Files
FacereDataset/.gitignore
Knowit 5e63924474 oshwhub: pin listing index snapshot (33,695 rows, 29 MB) into git
Previous commit added the dump script + report but the actual jsonl was caught
by data/state/* gitignore. Add a targeted exception so the snapshot travels with
the repo — anyone who clones can do local filtering without re-hitting the API.

The data is regenerable (scripts/dump_listing_index.py is one-shot, ~1 min), but
pinning a dated snapshot lets us reason about "the state of the corpus on
2026-04-28" reproducibly. Future re-dumps overwrite the same path.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-04-28 23:32:58 +08:00

50 lines
798 B
Plaintext
Raw Blame History

This file contains ambiguous Unicode characters
This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.
# Derivative 数据(可从 raw 重建),不入库
data/processed/*
data/state/*
!data/processed/.gitkeep
!data/state/.gitkeep
# 例外oshwhub 全量 listing 索引快照入库28 MB jsonl可重抓但要钉个版本
!data/state/oshwhub_listing_full.jsonl
# data/raw 入库(工程二进制走 LFS见 .gitattributes
# Python
__pycache__/
*.py[cod]
*.egg-info/
.pytest_cache/
.ruff_cache/
.mypy_cache/
.venv/
venv/
.env
.env.*
!.env.example
# uv
uv.lock
# Node (if we add JS helpers)
node_modules/
# Editor / OS
.vscode/
.idea/
.DS_Store
Thumbs.db
*.swp
# Claude Code session-local state
.claude/
# Local scratch
/tmp/
/scratch/
*.log
# kicad-cli sch erc default output (when --output not given goes to cwd/<input>.rpt)
*.rpt
# Private keys — never commit
*.pem
*.key