PaperPulse logo
FeedTopicsAI Researcher FeedBlogPodcastAccount

Stay Updated

Get the latest research delivered to your inbox

Platform

  • Home
  • About Us
  • Search Papers
  • Research Topics
  • Researcher Feed

Resources

  • Newsletter
  • Blog
  • Podcast
PaperPulse•

AI-powered research discovery platform

© 2024 PaperPulse. All rights reserved.

BEDTime: A Unified Benchmark for Automatically Describing Time Series

arXivSource

Medhasweta Sen, Zachary Gottesman, Jiaxing Qiu, C. Bayan Bruss, Nam Nguyen, Tom Hartvigsen

cs.CL
|
Sep 5, 2025
27 views

One-line Summary

BEDTime introduces a unified benchmark for evaluating time series description models, highlighting the need for specialized architectures and revealing the strengths and weaknesses of various model types.

Plain-language Overview

The paper introduces BEDTime, a new benchmark designed to evaluate how well different models can describe time series data using natural language. It focuses on three tasks: recognizing whether a statement about a time series is true or false, choosing the correct description from multiple options, and generating an open-ended description. The study finds that models specifically designed for time series data tend to perform better than general language models, though there is still room for improvement. This benchmark helps researchers compare models more directly and understand which features contribute to their performance.

Technical Details