Skip to content

Research
Podcast & Videos
OSS
GU
About

Open sourced my work on LLMs and n-gram statistics

November 7, 2024November 10, 2024 ~ Timothy Nguyen

Just open sourced the datasets used in my NeurIPS 2024 paper Understanding Transformers via N-Gram Statistics. It includes training data and associated n-gram data to enable the research community to replicate and build upon my work measuring to what extent LLM predictions can be described in terms of n-gram statistics.

Share this:

Facebook
X

Like Loading...

Related

ai machine-learning open source

Post navigation

‹ PreviousJay McClelland | Neural Networks: Artificial and Biological

Next ›Justin Clarke-Doane | Mathematics, Reality, and Morality | The Cartesian Cafe with Timothy Nguyen

Leave a comment Cancel reply

Δ

A Summer of AI Debates and TalksAugust 21, 2025
Physics Grifters: Eric Weinstein, Sabine Hossenfelder, and a Crisis of CredibilityAugust 21, 2025
Justin Clarke-Doane | Mathematics, Reality, and Morality | The Cartesian Cafe with Timothy NguyenDecember 6, 2024
Open sourced my work on LLMs and n-gram statisticsNovember 7, 2024
Jay McClelland | Neural Networks: Artificial and BiologicalOctober 2, 2024
Interview on Machine Learning Street TalkAugust 19, 2024

2025
2024
2023
2022

Website Powered by WordPress.com.

Comment
Reblog
Subscribe Subscribed
- Timothy Nguyen
- Already have a WordPress.com account? Log in now.

%d