AI

Omnipredictors for Regression and the Approximate Rank of Convex Functions

1 Mins read

Consider the supervised learning setting where the goal is to learn to predict labels y given points x from a distribution. An omnipredictor for a class L of loss functions and a class C of hypotheses is a predictor whose predictions incur less expected loss than the best hypothesis in C for every loss in L. Since the work of [GKR+21] that introduced the notion, there has been a large body of work in the setting of binary labels where y∈{0,1}, but much less is known about the regression setting where y∈[0,1] can be continuous. Our main conceptual contribution is the notion of sufficient statistics for loss minimization over a family of loss functions: these are a set of statistics about a distribution such that knowing them allows one to take actions that minimize the expected loss for any loss in the family. The notion of sufficient statistics relates directly to the approximate rank of the family of loss functions.

Our key technical contribution is a bound of O(1/ε^{2/3}) on the ϵ-approximate rank of convex, Lipschitz functions on the interval [0,1], which we show is tight up to a factor of polylog(1/ϵ). This yields improved runtimes for learning omnipredictors for the class of all convex, Lipschitz loss functions under weak learnability assumptions about the class C. We also give efficient omnipredictors when the loss families have low-degree polynomial approximations, or arise from generalized linear models (GLMs). This translation from sufficient statistics to faster omnipredictors is made possible by lifting the technique of loss outcome indistinguishability introduced by [GKH+23] for Boolean labels to the regression setting.


Source link

Related posts
AI

Optimizing Training Data Allocation Between Supervised and Preference Finetuning in Large Language Models

3 Mins read
Large Language Models (LLMs) face significant challenges in optimizing their post-training methods, particularly in balancing Supervised Fine-Tuning (SFT) and Reinforcement Learning (RL)…
AI

This AI Paper from Weco AI Introduces AIDE: A Tree-Search-Based AI Agent for Automating Machine Learning Engineering

3 Mins read
The development of high-performing machine learning models remains a time-consuming and resource-intensive process. Engineers and researchers spend significant time fine-tuning models, optimizing…
AI

What are AI Agents? Demystifying Autonomous Software with a Human Touch

8 Mins read
In today’s digital landscape, technology continues to advance at a steady pace. One development that has steadily gained attention is the concept…

 

 

Leave a Reply

Your email address will not be published. Required fields are marked *