Publications

(2025). Turning Up the Heat: Min-p Sampling for Creative and Coherent LLM Outputs. In ICLR.

PDF

(2024). Seq-VCR: Preventing Collapse in Intermediate Transformer Representations for Enhanced Reasoning. In ICLR.

PDF

(2024). LiveBench: A Challenging, Contamination-Free LLM Benchmark. In ICLR.

PDF

(2024). OpenDebateEvidence: A Massive-Scale Argument Mining and Summarization Dataset. In NeurIPS.

PDF Dataset

(2024). Inheritune: Training Smaller Yet More Attentive Language Models. arXiv.

PDF Code

(2023). An Information Theory Perspective on Variance-Invariance-Covariance Regularization. In NeurIPS.

PDF

(2023). Back to Basics: Revisiting Standard Deep Learning Components for Class Imbalance. In NeurIPS.

PDF DOI

(2022). What Do We Maximize in Self-Supervised Learning?. In ICML 2022: Pre-training: Perspectives, Pitfalls, and Paths Forward workshop.

PDF

(2022). Pre-Train Your Loss: Easy Bayesian Transfer Learning with Informative Priors. In NeurIPS 2022.

PDF Code Video

(2022). Tabular Data: Deep Learning is Not All You Need. In Information Fusion.

PDF Video

(2020). The Dual Information Bottleneck.

PDF Code

(2020). Neural Correlates of Learning Pure Tones or Natural Sounds in the Auditory Cortex. Frontiers in Neural Circuits.

PDF

(2020). Information in Infinite Ensembles of Infinitely-Wide Neural Networks. In The Symposium on Advances in Approximate Bayesian Inference.

PDF Code

(2018). Attentioned Convolutional LSTM Inpaintingv Network for Anomaly Detection in Videos. NIPS 2018 Workshop on Systems for ML.

PDF

(2017). Sequence Modeling Using a Memory Controller Extension for LSTM. NIPS 2017 Time Series Workshop.

PDF

(2017). Tabular Data: Deep Learning is Not All You Need. In TICML 2021 Workshop AutoML.

PDF

(0001). Automated Testing of Graphics Units by Deep-Learning Detection of Visual Anomalies. NIPS 2018 Machine Learning for Systems Workshop.

PDF