Machine Learning Blog | ML@CMU | Carnegie Mellon University

machine learning Research

How to Regularize Your Regression

by Dravyansh Sharma / April 12, 2024

A series of regression instances in a pharmaceutical application. Can we learn how to set the regularization parameter \(\lambda\) from similar domain-specific data? Overview. Perhaps the simplest relation between a real dependent variable \(y\) and a vector of features \(X\) is a linear model \(y = \beta X\). Given some training examples or datapoints consisting of pairs of features and dependent variables \((X_1, y_1),(X_2, y_2),\dots,(X_m,y_m)\), we would like to learn \(\beta\) which would give the best prediction \(y’\) given features…

147 1887

machine learning Research

Beyond the Mud: Datasets, Benchmarks, and Methods for Computer Vision in Off-Road Racing

March 22, 2024

TL;DR: Off-the-shelf text spotting and re-identification models fail in basic off-road racing settings, even more so during muddy events. Making matters worse, there aren’t any public datasets to evaluate or improve models in this domain. To this end, we introduce…

182 2582

machine learning Research

NLPositionality: Characterizing Design Biases of Datasets and Models

March 1, 2024

TLDR; Design biases in NLP systems, such as performance differences for different populations, often stem from their creator’s positionality, i.e., views and lived experiences shaped by identity and background. Despite the prevalence and risks of design biases, they are hard…

244 2526

machine learning Research

On Noisy Evaluation in Federated Hyperparameter Tuning

December 29, 2023

Evaluating models in federated networks is challenging due to factors such as client subsampling, data heterogeneity, and privacy. These factors introduce noise that can affect hyperparameter tuning algorithms and lead to suboptimal model selection. Hyperparameter tuning is critical to the…

311 5282

machine learning Research

Creative Robot Tool Use with Large Language Models

December 8, 2023

TLDR: We introduce RoboTool, enabling robots to use tools creatively with large language models, which solves long-horizon hybrid discrete-continuous planning problems with the environment- and embodiment-related constraints. Tool use is an essential hallmark of advanced intelligence. Some animals can use…

364 4659

machine learning Research

Peer Reviews of Peer Reviews: A Randomized Controlled Trial and Other Experiments

December 1, 2023

Alexander Goldberg, Ivan Stelmakh, Kyunghyun Cho, Alice Oh, Alekh Agarwal, Danielle Belgrave, and Nihar Shah Is it possible to reliably evaluate the quality of peer reviews? We study peer reviewing of peer reviews driven by two primary motivations: (i) Incentivizing…

371 3470

machine learning Research

Supporting Human-AI Collaboration in Auditing LLMs with LLMs

September 22, 2023

Illustration depicting the process of a human and a large language model working together to find failure cases in a (not necessarily different) large language model. Overview In the era of ChatGPT, where people increasingly take assistance from a large…

451 7075

computer vision deep learning machine learning Research

Test-time Adaptation with Slot-Centric Models

September 15, 2023

TLDR: Current SOTA methods for scene understanding, though impressive, often fail to decompose out-of-distribution scenes. In our ICML paper, Slot-TTA (http://slot-tta.github.io) we find that optimizing per test sample over reconstruction loss improves scene decomposition accuracy. Problem Statement: In machine learning,…

457 4476

machine learning Research

Navigating to Objects in the Real World

June 30, 2023

Empirical study: We evaluated three approaches for robots to navigate to objects in six visually diverse homes. TLDR: Semantic navigation is necessary to deploy mobile robots in uncontrolled environments like our homes, schools, and hospitals. Many learning-based approaches have been…

506 6213

machine learning Research

Validating Large Language Models with ReLM

June 5, 2023

ReLM enables writing tests that are guaranteed to come from the set of valid strings, such as dates. Without ReLM, LLMs are free to complete prompts with non-date answers, which are difficult to assess. TL;DR: While large language models (LLMs)…

547 12662

Older Posts

Machine Learning Blog | ML@CMU | Carnegie Mellon University

Statistics:

Categories: