How to Save a Reddit Thread as Markdown (With Comments and Context)

·

Reddit is where the real answers live --- the troubleshooting thread that actually solved your problem, the AMA buried four years deep, the niche subreddit where the experts hang out. But Reddit gives you nothing to take with you. There’s no export, no “save as” button, no way to pull a thread out with its comment context intact. Copy-paste flattens the nesting and strips the karma. If you’ve ever tried to feed a Reddit thread to Claude or ChatGPT, you know the problem --- pasting the URL gives the model nothing, and pasting the visible text loses the structure that made the thread worth reading.

This guide covers every method to convert a Reddit thread to clean Markdown --- from a quick question post to a 2,000-comment AMA.

Why Save Reddit Threads as Markdown?

Reddit content is uniquely fragile, and uniquely valuable as research material:

  • Feed it to an LLM --- Claude, ChatGPT, Gemini, and local models all read Markdown natively as context
  • Archive before it disappears --- users nuke accounts, mods remove posts, subreddits go private overnight
  • Quote a specific comment chain --- the answer is rarely the top comment, it’s usually three replies deep
  • Drop it into Obsidian or Notion --- one file, fully searchable, with the discussion structure intact
  • Build a personal knowledge base --- product recommendations, troubleshooting, AMAs, niche expertise

The use case driving most Reddit-to-Markdown traffic in 2026 is the first one: people want to ask an LLM to summarize a 500-comment thread, find the consensus answer, or extract the most-upvoted recommendation. Pasting the URL doesn’t work, and pasting raw HTML floods the context window.

Method 1: Save (Fastest, One Click)

Save is a Chrome extension that turns any Reddit thread into a Markdown file with one click. It walks the comment tree, preserves the nesting, and produces a file that reads like a structured discussion instead of a wall of text.

How it works:

  1. Open the Reddit thread in Chrome (works on both old.reddit.com and www.reddit.com)
  2. Click the Save extension icon in your toolbar
  3. A .md file downloads instantly (or lands in your Save Vault if connected)

What you get:

  • Original post body, title, subreddit, author, and score at the top
  • Top-scoring comment branches, with full reply nesting preserved
  • Karma score, flair, and OP markers on every comment
  • Frontmatter with URL, subreddit, post date, and total comment count
  • The whole thread in one file --- post and comments together, not two exports

What gets removed:

  • Sidebar widgets, subreddit rules, recommended posts
  • Promoted posts and ad slots between comments
  • Reddit’s nav chrome, footer, and “What are your thoughts?” prompts
  • AutoModerator stickied comments (unless explicitly opted in)
  • Deleted/removed comment bodies (replaced with a placeholder so the thread structure stays readable)

Best for: Researchers, AI users, anyone who treats Reddit as a serious knowledge source. If you need a clean thread that you’ll paste into Claude or read in Obsidian, this is the cleanest path. The killer feature is the comment nesting --- copy-paste destroys it, every other method loses it, Save keeps it.

Example Output

Saving a popular r/selfhosted thread produces:

---
title: "What's your favorite self-hosted app that nobody talks about?"
subreddit: r/selfhosted
author: u/homelabdad
url: https://www.reddit.com/r/selfhosted/comments/abc123/...
score: 1247
comments: 384
date: 2026-03-12
---

## Post

I've been running a homelab for 5 years and the apps that get all the
attention (Plex, Home Assistant, Nextcloud) are great, but the ones I
actually rely on every day are tiny tools nobody mentions. What's yours?

## Top Comments

### u/devops_anna --- 892 points
[Mealie](https://mealie.io) is criminally underrated. Recipe manager
that scrapes from any URL, meal planning, shopping lists. Replaced
three apps for me.

  - **u/homelabdad** (OP) --- 124 points
    > Just installed it after seeing this. The URL scraper is magic.

  - **u/recipe_hoarder** --- 67 points
    > Been using it 2 years. The Nextcloud integration is solid too.

### u/raspberry_pi_4 --- 654 points
**Beszel** for monitoring. Lighter than Grafana, prettier than Uptime
Kuma, zero config. Single binary on every host.

  - **u/devops_anna** --- 203 points
    > Switched from Netdata last month. No regrets.

That file is one paste away from being usable Claude context, one keystroke away from being a permanent Obsidian note. The nesting renders correctly in any Markdown viewer, and the karma scores make it obvious which branches matter.

Method 2: Reddit’s Native Export (Limited)

Reddit lets you request a data export of your own account activity via Settings → Privacy & Security → Request Data.

Steps:

  1. Go to reddit.com/settings/data-request
  2. Submit a GDPR or CCPA request
  3. Wait up to 30 days for an email with a download link
  4. Receive a ZIP of CSV files

Problems with this approach:

  • Only covers your own activity (posts you made, comments you wrote, saved items)
  • You cannot export someone else’s thread or an AMA
  • Output is CSV, not Markdown --- no structure, no nesting, just rows
  • Comment bodies are dumped flat with no parent-child relationships
  • 30-day wait makes it useless for active research
  • No support for exporting a single thread in context

Useful exactly once, when you want your own Reddit history backed up. Useless for everything else.

Method 3: PRAW or Reddit API

For engineering teams, Reddit’s official API (and its excellent Python wrapper PRAW) lets you pull any thread programmatically.

import praw

reddit = praw.Reddit(client_id="...", client_secret="...", user_agent="...")
submission = reddit.submission(url="https://reddit.com/r/.../comments/...")
submission.comments.replace_more(limit=None)

for comment in submission.comments.list():
    print(comment.author, comment.score, comment.body)

Best for: Engineering teams building Reddit ingestion pipelines, training datasets, or large-scale research scrapers.

Problems with this approach:

  • Reddit API now requires registration and enforces strict rate limits (since the 2023 changes)
  • Free tier caps at 100 queries per minute per OAuth client, harsh for deep threads with replace_more
  • No Markdown formatting --- you get raw text, no nesting structure, no rendering of Reddit’s own Markdown
  • Building a tree from the flat comment list is a separate step
  • Commercial / high-volume use requires a paid agreement with Reddit

This is the right method if you’re building a product. Wildly overkill for one thread.

Method 4: Browser Copy + Pandoc

The manual fallback: copy the rendered thread out of the browser and reformat.

# After copying the thread to a file as HTML:
pandoc thread.html -f html -t markdown -o thread.md

Best for: One-off rescue of a thread that you absolutely need now, with no tools installed.

Problems with this approach:

  • Copy-paste from Reddit flattens the nesting --- Pandoc cannot recover what Chrome didn’t include
  • Karma scores, flair, OP markers, and timestamps land inline as visual noise
  • “Load more comments” expansions need to happen manually before copying
  • Image posts, polls, and video posts lose their content entirely
  • The output needs heavy hand-editing to be usable

Works in a pinch on a short thread. Falls apart on anything with depth.

Which Method Should You Use?

ScenarioBest Method
Paste a thread into Claude or ChatGPTSave --- one click, nesting preserved
Archive a long AMA before it’s lockedSave --- captures the whole tree in one file
Quote a specific comment chain in research notesSave --- karma + flair + OP markers intact
Build an internal Reddit ingestion pipelinePRAW --- programmatic, with API key in hand
Back up your own Reddit historyReddit data export --- only method that gives you your own data
Rescue a single thread with no toolsCopy + Pandoc --- manual but works offline

For most people --- especially anyone using Reddit as AI context or as research material --- Save is the answer. It produces the cleanest Markdown with zero setup, and it handles 2,000-comment AMAs at the same speed as a five-comment post.

Edge Cases Save Handles

  • Permalinked specific comments. If you open a URL like /comments/abc/_/xyz/, Reddit shows only that comment and its replies. Save respects this --- it captures the comment subtree from that anchor down, not the whole thread. Useful when the answer you want is six levels deep.
  • Very deep comment threads. Reddit collapses threads past a certain depth with “continue this thread.” Save follows those links and stitches the deep branches back in, with indentation that stays readable in Markdown. On extreme cases (10+ levels), it switches to a flat-with-breadcrumb format so the file doesn’t become a horizontal scroll bar.
  • Deleted and removed comments. When a comment shows [deleted] or [removed], Save keeps the placeholder so the tree structure stays intact. The reply chain underneath is preserved --- you can still read the responses even when the original comment is gone.
  • old.reddit.com vs new.reddit.com. Both layouts work. Save auto-detects which one you’re on and parses accordingly. old.reddit tends to give slightly cleaner output because the DOM is simpler, but new.reddit works fine.
  • Private and restricted subreddits. Save sees what your logged-in browser sees. If you’re approved in a private sub, Save can capture threads there. If you can’t see it, neither can Save.
  • Archived threads. Reddit archives threads after 6 months and locks new comments. Save captures them identically --- archive status doesn’t change the parsing.
  • Polls and image posts. Poll results (current vote counts, options) are captured as a list. Image posts include the image URL and alt text in the frontmatter. Video posts include the video URL and the auto-transcript if Reddit generated one.
  • Crossposts. When a thread is a crosspost, Save links to the original and captures the discussion happening on the current crosspost, not on the source.

Pair It With Your Workflow

The Markdown output works wherever you need it:

  • Claude / ChatGPT / Gemini --- paste the file in, ask follow-up questions like “what’s the consensus product pick?” or “summarize the top three troubleshooting paths”
  • Obsidian --- drop it in your vault, link it to related notes, search across every thread you’ve saved on a topic
  • Notion --- paste directly, the nested comment structure renders as toggles
  • Apple Notes --- clean import via the Markdown share extension
  • Save Vault --- if you’ve connected one, every Reddit save lands there automatically with backlinks and subreddit tags

FAQ

Does Save work on the Reddit mobile site or app? The extension is desktop Chrome only for now. On mobile, copy the URL and open it on desktop, or share it into a Save Vault on Mac (which has a URL handler).

What about the new Reddit redesign with infinite-scroll comments? Save handles both the classic and the redesigned layouts. On new Reddit, it triggers the “load more comments” expansions automatically before parsing, so you don’t have to scroll the whole thread first.

Can I save just the original post, without the comments? Yes. The extension lets you pick: post only, post + top comments, or full thread. Default is post + top comments, because that’s usually what you actually want.

Does it preserve Reddit Markdown formatting in comments? Yes. Bold, italic, links, code blocks, blockquotes, and lists in comments are kept as Markdown in the output --- they were Markdown to begin with on Reddit’s side.

How does it handle threads with thousands of comments? Save captures the top-scoring branches first and works down by score. On a 2,000-comment AMA, the default output keeps the top 200 or so comment chains --- the ones with karma above the threshold. You can configure the depth and threshold in the extension settings if you want everything or just the very top.

What about NSFW threads? Save captures them identically to any other thread --- it doesn’t filter by NSFW tag. If you can see the thread, Save can save it.

Is the karma score accurate at save time? Yes, scores are captured at the moment you click Save. Reddit fuzzy-rounds large scores (“12.4k”) on the page; Save uses the exact integer when the API exposes it, otherwise it keeps the displayed rounded value.

How much does it cost? Save has a free tier so you can try it on a few threads. After that, a small subscription covers the parsing and storage costs.

## Continue reading

Jean-Sébastien Wallez

Written by

Jean-Sébastien Wallez

I've been making internet products for 10+ years. Built Save on weekends because I wanted my own reading library in clean markdown for Claude and Obsidian. Write here about web clipping, AI workflows, and the small things that make a personal knowledge base actually useful.