打通 oshwhub origin=pro 现代 Pro 3.x 工程的 EPRO2 源抓取链路。3/5
modern Pro 项目完整解出(共 8423 docs / 542 MB plain):
- X86 主板 7374 docs / 481 MB plain (chain=85, editor=3.2.15)
- 220V 桌面电源 771 docs / 26 MB plain (chain=28, editor=3.2.69)
- ESP-VoCat 278 docs / 7.5 MB plain (chain=12, editor=3.2.91)
剩余 2/5 是 legacy Pro 2.x(立创泰山派 RK3566、梁山派),项目 meta
返回 branch_uuid=null + editorVersion="2.1.40",没有 git-style chain
模型,文档直接挂在 boards[].sch/pcb 字段上,访问端点暂未挖通;元
数据落库 metadata.json,source/ 留空。
实现要点:
- fetch_pro_source(): 4 步流程(project → branch HEAD → structures
→ /branches/<B>/histories/<HEAD> 即返完整 chain,无需 ?limit 批量
端点)+ 逐 history 走 AES-128-GCM 解密(16 字节 IV,pycryptodome
原生支持)+ gunzip + 按 DOCHEAD 切 per-doc EPRO2 流
- EPRO2 解析坑:行末单 `|` 是行终止符不是字段分隔符,必须先
rstrip("|") 再 split("||"),否则 payload JSON 解析失败 silently
swallow 导致 cur_doc 不设 → 第一轮 X86 板 7374 docs 抽出来只剩 2 个
- docType 实测远不止 BOARD/PCB/SCH/SCH_PAGE,还含 SYMBOL /
FOOTPRINT / DEVICE / BLOB / FONT / CONFIG —— Pro 把组件库快照也
随项目存到 history,下游做 EPRO2→KiCad 转换时必须先把这些 lib
doc 加载进 symbol cache
- Pro 2.x vs 3.x 是不同存储模型 —— 3.x 走 branch 模型(已打通),
2.x 走 boards[] 直链(未打通);判别条件:project meta 的
branch_uuid 是否为 null
CLI 新增 --with-pro-source / --backfill-pro-source / --pro-cookie /
--origin(按 origin 字段服务端过滤 listing API),crawl_one() 按
origin=pro 自动 dispatch 到 Pro fetcher。
schema:docType 类型从 integer 放宽到 [integer, string, null]
(兼容 Std 的 1/3 + Pro 的 BOARD/SCH 等),新增 message_count 字段。
License 注意:本批 5 个项目全是 NC-SA / GPL,未达 Pro source doc
§4.2 Forge 白名单(MIT/BSD/Apache/CC0/CC-BY/CERN-OHL-P/Unlicense)。
按 CLAUDE.md "研究用、不再分发" 原则 raw 入库无碍;Forge 投影时
另过白名单。
详细技术细节见 docs/sources/easyeda_pro_source.md rev 3 + log.md。
Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
3 lines
1.9 KiB
Plaintext
3 lines
1.9 KiB
Plaintext
{"type":"DOCHEAD","ticket":3238}||{"docType":"DEVICE","client":"87d2123eb200f925","uuid":"6209565295f200ea","updateTime":1768981769058,"version":"1658348581"}|
|
|
{"type":"META","ticket":3239,"id":"META"}||{"title":"TS-1101-C-W","tags":["按键/开关","轻触开关"],"source":"92f5c854d43e40688877066f58abc5be|0819f05c4eef4c71ace90d822a990e87","images":["https://alimg.szlcsc.com/upload/public/product/middle/20230109/838630419820679F07E7748423BBADDC.jpg"],"attributes":{"LCSC Part Name":"6*3.6*2.5mm 立贴 轻触开关","Supplier Part":"C318938","Manufacturer":"XKB Connectivity(中国星坤)","Manufacturer Part":"TS-1101-C-W","Supplier Footprint":"SMD","JLCPCB Part Class":"","Datasheet":"https://atta.szlcsc.com/upload/public/pdf/source/20200227/C318938_C2E53B1EEDE37D980A33616AC086DE53.pdf","Supplier":"LCSC","Add into BOM":"yes","Convert to PCB":"yes","Symbol":"1ee89767b0920d5e","Designator":"SW?","Footprint":"d0ead2ae41ccd86f","3D Model":"b2f4ea9f0afc4b0786d53dba66ceb9e4","3D Model Title":"SW-SMD_L6.0-W3.3-H5-LS8.0","3D Model Transform":"314.96,129.921,0,0,0,0,0,0,0","Circuit":"单刀单掷","Actuator Style":"矩形按钮","Operating Force":"260gf","Mounting Style":"立贴","Switch Length":"6mm","Switch Width":"3.6mm","Switch Height":"2.5mm","Strike Gundam":"不带","Contact Current":"50mA","Voltage Rating (AC)":"-","Voltage Rating (DC)":"12V","Power Rating":"-","Insulation Resistance":"-","Actuator/Cap Color":"白色带淡绿","With Lamp":"无","IP Rating":"-","Pin Style":"-","Mechanical Life":"5万次","Operating Temperature":"-30℃~+85℃","Name":"={Manufacturer Part}","Description":"电路结构:单刀单掷;按钮形状:矩形按钮;作用力:260gf;安装方式:立贴;开关长度:6mm;开关宽度:3.6mm;开关高度:2.5mm;带支架:不带;触点电流:50mA;额定电压(AC):-;额定电压(DC):12V;额定功率:-;绝缘电阻:-;带灯:无;机械寿命:5万次;"}}|
|