8:17
Studying GSM8K Leaderboard
151 views • 11 days ago
7:37
Think-and-execute prompting for LLMs
260 views • 2 weeks ago
9:17
AutoGen: Programming LLM Agents
300 views • 1 month ago
5:01
Fine-tuning LLMs encourages hallucinations
237 views • 1 month ago
8:05
Fine-tuning or RAG?
443 views • 1 month ago
15:04
Fixing RAG with GraphRAG
4K views • 1 month ago
6:59
LLMs improve writing-based knowledge work
270 views • 2 months ago
13:06
Co-intelligence: book review
389 views • 2 months ago
10:09
Winning prompt! $10k LLM reasoning challenge
464 views • 2 months ago
6:24
$10k for LLM reasoning
858 views • 2 months ago
7:40
LLM agents do software engineering
668 views • 3 months ago
11:02
LLM benchmarks
648 views • 3 months ago
9:06
LLMs eat entry-level SWEs
993 views • 3 months ago
8:54
LLMs can debug with prints
452 views • 3 months ago
10:07
Determinism ⇒ Fast LLMs (Groq)
572 views • 4 months ago
4:42
Self-discovery: Choosing the Best Prompt for the Problem
352 views • 4 months ago
6:50
Asleep at the wheel: can AI reduce performance?
123 views • 4 months ago
5:41
The Future of RAG in the Age of Large Context Windows
635 views • 4 months ago
6:58
GPT-4 passed the Turing Test!
2.1K views • 5 months ago
5:35
CS higher ed in North America: all the stats you should know
142 views • 5 months ago
4:18
I remember @AndrejKarpathy 's deleted tweet
372 views • 5 months ago
9:28
LLMs for real world knowledge work
254 views • 5 months ago
7:38
Watch me build a GPT for journaling
292 views • 5 months ago
8:30
LLMs can "breed" their own prompts
1.1K views • 5 months ago
9:35
LLMs with infinite context?
758 views • 6 months ago
9:44
Can prompt engineering beat fine-tuning?
628 views • 6 months ago
7:48
Can LLMs discover new math and CS?
1K views • 6 months ago
10:53
How will CS educators adapt to AI tools?
407 views • 7 months ago
6:27
How does Copilot do on advanced CS courses?
135 views • 7 months ago
6:45
Fine-tuning Language Models for Factuality
368 views • 7 months ago
Load More