HTML 转文本
通过去除所有标签将 HTML 转换为纯文本。可选择保留链接和换行符。
Features:
- • Strips all HTML tags
- • Decodes HTML entities (& → &)
- • Removes script and style content
- • Preserves list formatting with bullets
- • Optionally preserves links and line breaks
HTML to Plain Text - 技术详情
This tool removes HTML tags while preserving meaningful content. It decodes HTML entities, converts list items to bullet points, and can optionally show link URLs in parentheses.
命令行替代方案
# Using lynx lynx -dump -nolist page.html # Using w3m w3m -dump page.html # Using Python html2text pip install html2text html2text page.html