examples: add zhaopin AI job scrapers + SQLite analysis

- scrape_zhaopin_ai.py: lightweight no-login list scraper - scrape_zhaopin_full.py: SQLite storage, resumable crawl, detail-page JD - analyze_zhaopin.py: stats by city/salary/education/experience/skills - scrape_zhipin_ai.py: BOSS Zhipin variant (login-based, fallback) - gitignore scraper data artifacts and browser profile
webgl: ship only the GPU buckets that pass tampering_ml + decouple render-noise seed
2026-06-14 23:18:51 +08:00 · 2026-06-14 11:53:33 +02:00 · 2026-06-12 17:40:48 +02:00 · 2026-06-12 17:31:31 +02:00 · 2026-06-11 20:19:19 +02:00 · 2026-06-11 19:14:45 +02:00
32 changed files with 12636 additions and 234 deletions
@@ -104,6 +104,24 @@ jobs:
          ref: ${{ env.SOURCE_REF }}
          fetch-depth: 1
      # Record which invisible_firefox commit this build came from. The publish
      # job turns the range previous-release..this commit into the release notes
      # (scripts/gen_release_notes.py), and re-publishes it as a source-commit.txt
      # asset so the NEXT release knows where to start the changelog. One leg is
      # enough — all legs check out the same SOURCE_REF.
      - name: Record source commit (for auto release notes)
        if: matrix.leg == 'linux-x86_64'
        shell: bash
        run: git rev-parse HEAD > source-commit.txt && cat source-commit.txt
      - name: Upload source-commit artifact
        if: matrix.leg == 'linux-x86_64'
        uses: actions/upload-artifact@ea165f8d65b6e75b540449e92b4886f43607fa02 # v4
        with:
          name: source-commit
          path: source-commit.txt
          if-no-files-found: error
          retention-days: 7
      - name: Set up Python
        uses: actions/setup-python@a26af69be951a213d495a4c3e4e4022e16d87065 # v5
        with: { python-version: '3.11' }
@@ -344,18 +362,23 @@ jobs:
      # CLOAK + WEBGL-MASKING GUARDS — run the wrapper's e2e cloak/gamma checks
      # against THIS leg's freshly-built artifact, on its native runner. The
      # wrapper's headless=True is headed+hidden (cloak on Win/macOS, its own
-      # Xvfb on Linux), so software-GL rendering works on the GPU-less hosts.
+      # Xvfb on Linux). Linux (Xvfb + llvmpipe) and Windows (WARP) give a
-      # test_cloak asserts the window is hidden (Windows DWMWA_CLOAKED / macOS
+      # software WebGL context on the GPU-less hosts, so the WebGL-dependent
-      # CGWindowAlpha) AND still renders — the macOS leg is the only place the
+      # assertions run there. macOS GitHub runners expose NO WebGL in the CI
-      # cocoa cloak patch gets RUN. The webgl guard catches a regression of the
+      # session at all (even vanilla Firefox; macOS has no software-GL fallback),
-      # gamma readPixels noise back to the pixelscan-maskable ±1 spike form.
+      # so on the mac legs the WebGL checks self-skip and the cloak is validated
      # via its non-blank screenshot + CGWindowAlpha == 0. test_cloak asserts the
      # window is hidden (Windows DWMWA_CLOAKED / macOS CGWindowAlpha) AND still
      # renders — the macOS leg is the only place the cocoa cloak patch gets RUN.
      # The webgl guard catches a regression of the gamma readPixels noise back to
      # the pixelscan-maskable ±1 spike form (covered on Linux + Windows).
      - name: Install pyobjc Quartz (macOS — to read the cloak window alpha)
        if: matrix.kind == 'mac'
        run: python -m pip install --quiet pyobjc-framework-Quartz
      - name: Cloak + WebGL-masking guards (headed)
        shell: bash
        run: |
-          python -m pip install --quiet -e .
+          python -m pip install --quiet ".[dev]"
          INVPW_BINARY_PATH="$FF_EXE" python -m pytest \
            tests/test_cloak.py \
            "tests/test_fingerprint_surface.py::test_webgl_readpixels_no_masking_signature" \
@@ -368,9 +391,18 @@ jobs:
    permissions:
      contents: write
    steps:
      - name: Checkout wrapper (for scripts/gen_release_notes.py)
        uses: actions/checkout@34e114876b0b11c390a56381ad16ebd13914f8d5 # v4
        with: { fetch-depth: 1 }
      - name: Set up Python
        uses: actions/setup-python@a26af69be951a213d495a4c3e4e4022e16d87065 # v5
        with: { python-version: '3.11' }
      - name: Download all build assets
        uses: actions/download-artifact@d3f86a106a0bac45b974a628896c90dbdf5c8093 # v4
        with: { pattern: asset-*, path: dl, merge-multiple: true }
      - name: Download source-commit metadata
        uses: actions/download-artifact@d3f86a106a0bac45b974a628896c90dbdf5c8093 # v4
        with: { name: source-commit, path: src-meta }
      - name: Assert all 5 target archives present (no silent partial release)
        run: |
          cd dl
@@ -397,9 +429,38 @@ jobs:
          TAG="${{ github.event.inputs.release_tag }}"
          [ -z "$TAG" ] && TAG="${GITHUB_REF_NAME}"
          echo "tag=$TAG" >> "$GITHUB_OUTPUT"
-          # bare revision number for the release title: firefox-9 -> 9
+          # bare revision number for the release title: firefox-10 -> 10
-          echo "num=${TAG#firefox-}" >> "$GITHUB_OUTPUT"
+          N="${TAG#firefox-}"
          echo "num=$N" >> "$GITHUB_OUTPUT"
          # previous release tag, for the changelog range (firefox-10 -> firefox-9)
          case "$N" in (*[!0-9]*|'') echo "prevtag=" >> "$GITHUB_OUTPUT";;
                        (*) echo "prevtag=firefox-$((N-1))" >> "$GITHUB_OUTPUT";; esac
          echo "publishing DRAFT release for tag: $TAG"
      - name: Build release notes from the source commits
        id: notes
        env:
          GH_TOKEN: ${{ secrets.GITHUB_TOKEN }}
          GITHUB_TOKEN: ${{ secrets.GITHUB_TOKEN }}
        run: |
          set -e
          CUR="$(cat src-meta/source-commit.txt 2>/dev/null | tr -d '[:space:]')"
          echo "this build's source commit: ${CUR:-<none>}"
          # previous release's recorded source commit — gives the changelog range.
          # Missing (first automated notes / firefox-0) -> notes omit the changelog.
          PREV=""
          PREVTAG="${{ steps.tag.outputs.prevtag }}"
          if [ -n "$PREVTAG" ] && gh release download "$PREVTAG" -R "${{ github.repository }}" \
               --pattern source-commit.txt --dir prev 2>/dev/null; then
            PREV="$(cat prev/source-commit.txt | tr -d '[:space:]')"
            echo "previous ($PREVTAG) source commit: $PREV"
          else
            echo "no previous source-commit.txt — changelog section omitted this time"
          fi
          python scripts/gen_release_notes.py --tag "${{ steps.tag.outputs.tag }}" \
            --current "$CUR" --prev-sha "$PREV" --source-repo "${{ env.SOURCE_REPO }}" > body.md
          echo "----- generated body.md -----"; cat body.md
          # publish THIS build's source commit so the next release can diff from it
          cp src-meta/source-commit.txt dl/source-commit.txt
      - name: Create DRAFT release with all assets
        uses: softprops/action-gh-release@3bb12739c298aeb8a4eeaf626c5b8d85266b0e65 # v2
        with:
@@ -412,13 +473,7 @@ jobs:
            dl/*.tar.gz
            dl/*.zip
            dl/checksums.txt
-          body: |
+            dl/source-commit.txt
-            Patched Firefox 150.0.1 — built on GitHub Actions ($0, no mold).
+          body_path: body.md
            Targets: linux-x86_64, linux-arm64, win-x86_64, macos-arm64, macos-x86_64.
            DRAFT — do not publish until validate_release.py + realness gate pass on all archives.
            macOS: ad-hoc signed (not notarized). After download run:
              xattr -dr com.apple.quarantine Firefox.app
        env:
          GITHUB_TOKEN: ${{ secrets.GITHUB_TOKEN }}
@@ -0,0 +1,103 @@
 # ─────────────────────────────────────────────────────────────────────────────
 # verify-cloak.yml — re-runnable CLOAK + WEBGL-MASKING GUARDS for an EXISTING
 # build run's artifacts, WITHOUT rebuilding Firefox (~3h on the mac legs).
 #
 # release.yml runs these same guards in its `gate` job against each freshly-built
 # artifact. This re-runs them against the artifacts of a PRIOR build run (input
 # `run_id`) using the CURRENT wrapper code on the default branch — so a test-only
 # fix (e.g. making the macOS leg tolerant of the runner's missing WebGL) can be
 # validated against the real binaries in ~10 min instead of paying a full rebuild.
 #
 # Same guard command as release.yml's gate. Headed-but-cloaked; zero proxy / zero
 # secrets. The macOS legs are the only place the cocoa cloak patch actually RUNS.
 # ─────────────────────────────────────────────────────────────────────────────
 name: verify-cloak
 on:
  workflow_dispatch:
    inputs:
      run_id:
        description: 'build run id whose asset-* artifacts to re-gate (e.g. 27346856197)'
        required: true
 permissions:
  contents: read
  actions: read   # download-artifact needs this to read another run's artifacts
 jobs:
  guard:
    name: guard-${{ matrix.leg }}
    runs-on: ${{ matrix.runner }}
    timeout-minutes: 25
    strategy:
      fail-fast: false
      matrix:
        # Same legs/runners/assets as release.yml's gate matrix.
        include:
          - leg: linux-x86_64
            runner: ubuntu-24.04
            kind: linux
            asset: firefox-150.0.1-stealth-linux-x86_64.tar.gz
          - leg: linux-arm64
            runner: ubuntu-24.04-arm
            kind: linux
            asset: firefox-150.0.1-stealth-linux-arm64.tar.gz
          - leg: win-x86_64
            runner: windows-latest
            kind: win
            asset: firefox-150.0.1-stealth-win-x86_64.zip
          - leg: macos-arm64
            runner: macos-15
            kind: mac
            asset: firefox-150.0.1-stealth-macos-arm64.tar.gz
          - leg: macos-x86_64
            runner: macos-15-intel
            kind: mac
            asset: firefox-150.0.1-stealth-macos-x86_64.tar.gz
    steps:
      - name: Checkout wrapper (current default branch — the FIXED tests)
        uses: actions/checkout@34e114876b0b11c390a56381ad16ebd13914f8d5 # v4
        with: { fetch-depth: 1 }
      - name: Download build asset from the prior run (no rebuild)
        uses: actions/download-artifact@d3f86a106a0bac45b974a628896c90dbdf5c8093 # v4
        with:
          name: asset-${{ matrix.leg }}
          path: art
          run-id: ${{ github.event.inputs.run_id }}
          github-token: ${{ secrets.GITHUB_TOKEN }}
      - name: Set up Python
        uses: actions/setup-python@a26af69be951a213d495a4c3e4e4022e16d87065 # v5
        with: { python-version: '3.11' }
      - name: Install Playwright driver (no bundled browser — we override executable_path)
        # Single-source pin (see release.yml); the wrapper enforces juggler compat.
        shell: bash
        run: python -m pip install --quiet "playwright==$(cat scripts/playwright_pin.txt)"
      - name: Linux system deps for headless firefox
        if: matrix.kind == 'linux'
        run: sudo "$(which python)" -m playwright install-deps firefox
      - name: Extract + locate firefox binary
        shell: bash
        run: |
          set -e
          mkdir -p ff
          A="art/${{ matrix.asset }}"
          case "${{ matrix.kind }}" in
            win)   python -c "import zipfile; zipfile.ZipFile('$A').extractall('ff')"; EXE="ff/firefox.exe";;
            linux) tar xzf "$A" -C ff; EXE="ff/firefox";;
            mac)   tar xzf "$A" -C ff; EXE="ff/Firefox.app/Contents/MacOS/firefox";;
          esac
          [ -e "$EXE" ] || { echo "ERROR: firefox binary not found at $EXE"; exit 1; }
          chmod +x "$EXE" 2>/dev/null || true
          echo "FF_EXE=$EXE" >> "$GITHUB_ENV"
          echo "located: $EXE"
      - name: Install pyobjc Quartz (macOS — to read the cloak window alpha)
        if: matrix.kind == 'mac'
        run: python -m pip install --quiet pyobjc-framework-Quartz
      - name: Cloak + WebGL-masking guards (headed)
        shell: bash
        run: |
          python -m pip install --quiet ".[dev]"
          INVPW_BINARY_PATH="$FF_EXE" python -m pytest \
            tests/test_cloak.py \
            "tests/test_fingerprint_surface.py::test_webgl_readpixels_no_masking_signature" \
            -m e2e -o addopts='' -q
@@ -6,3 +6,9 @@ build/
 .pytest_cache/
 .venv/
 firefox-source/
 # scraper runtime artifacts (examples/) — data outputs & browser profiles
 *.db
 zhaopin_*.json
 zhaopin_*.csv
 ai_jobs.csv
 examples/.zhipin_profile/
@@ -0,0 +1,169 @@
 """智联招聘抓取数据分析 —— 读取 SQLite, 按城市/薪资/学历/经验/技能做统计。
 纯标准库, 无需 pandas。
 用法:
  python examples/analyze_zhaopin.py                      # 全量分析
  python examples/analyze_zhaopin.py --keyword AI         # 只看某关键词
  python examples/analyze_zhaopin.py --city 北京          # 只看某城市
  python examples/analyze_zhaopin.py --top 15             # 排行榜取前 15
  python examples/analyze_zhaopin.py --db zhaopin_jobs.db
 """
 from __future__ import annotations
 import argparse
 import json
 import re
 import sqlite3
 import statistics
 from collections import Counter
 WORKDAYS_PER_MONTH = 21.75  # 日薪 -> 月薪折算
 def parse_salary(s: str) -> tuple[int, int] | None:
    """把薪资字符串解析成 (月薪下限, 月薪上限) 元。无法解析返回 None。"""
    if not s or "面议" in s:
        return None
    s = s.strip()
    is_daily = "/天" in s or "元/天" in s
    is_wan = "万" in s
    nums = re.findall(r"\d+(?:\.\d+)?", s)
    if not nums:
        return None
    vals = [float(x) for x in nums[:2]]
    if len(vals) == 1:
        vals = [vals[0], vals[0]]
    lo, hi = vals[0], vals[1]
    if is_wan:
        lo, hi = lo * 10000, hi * 10000
    if is_daily:
        lo, hi = lo * WORKDAYS_PER_MONTH, hi * WORKDAYS_PER_MONTH
    return int(lo), int(hi)
 def city_of(location: str) -> str:
    return (location or "").split("·")[0].strip() or "未知"
 SALARY_BUCKETS = [
    (0, 5000, "<5k"),
    (5000, 8000, "5-8k"),
    (8000, 12000, "8-12k"),
    (12000, 18000, "12-18k"),
    (18000, 25000, "18-25k"),
    (25000, 10**9, ">=25k"),
 ]
 def bucket_of(mid: float) -> str:
    for lo, hi, label in SALARY_BUCKETS:
        if lo <= mid < hi:
            return label
    return "?"
 def bar(n: int, maxn: int, width: int = 30) -> str:
    if maxn <= 0:
        return ""
    return "█" * max(1, round(n / maxn * width))
 def section(title: str) -> None:
    print("\n" + "=" * 56)
    print(title)
    print("=" * 56)
 def rank_table(counter: Counter, top: int, label: str) -> None:
    if not counter:
        print("  (无数据)")
        return
    maxn = counter.most_common(1)[0][1]
    for name, n in counter.most_common(top):
        print(f"  {name[:18]:<18} {n:>4}  {bar(n, maxn)}")
 def main() -> None:
    ap = argparse.ArgumentParser()
    ap.add_argument("--db", default="zhaopin_jobs.db")
    ap.add_argument("--keyword", default=None, help="按关键词过滤")
    ap.add_argument("--city", default=None, help="按城市过滤")
    ap.add_argument("--top", type=int, default=12, help="排行榜条数")
    args = ap.parse_args()
    conn = sqlite3.connect(args.db)
    q = ("SELECT title,salary,experience,education,location,company,"
         "job_tags,company_tags,skills FROM jobs")
    params: list = []
    if args.keyword:
        q += " WHERE keyword=?"
        params.append(args.keyword)
    rows = conn.execute(q, params).fetchall()
    if args.city:
        rows = [r for r in rows if city_of(r[4]) == args.city]
    if not rows:
        print("没有匹配的数据。先用 scrape_zhaopin_full.py 抓一些, 或检查过滤条件。")
        return
    n_total = len(rows)
    print(f"分析样本: {n_total} 条"
          + (f" | 关键词={args.keyword}" if args.keyword else "")
          + (f" | 城市={args.city}" if args.city else ""))
    cities, edus, exps = Counter(), Counter(), Counter()
    skill_freq, salary_buckets = Counter(), Counter()
    mids: list[float] = []
    n_salary_parsed = 0
    for (title, salary, exp, edu, loc, comp, jtags, ctags, skills) in rows:
        cities[city_of(loc)] += 1
        edus[(edu or "未知").strip() or "未知"] += 1
        exps[(exp or "未知").strip() or "未知"] += 1
        # 技能: 合并 skills + job_tags
        for src in (skills, jtags):
            try:
                for t in json.loads(src or "[]"):
                    t = t.strip()
                    if t:
                        skill_freq[t] += 1
            except Exception:
                pass
        rng = parse_salary(salary)
        if rng:
            n_salary_parsed += 1
            mid = (rng[0] + rng[1]) / 2
            mids.append(mid)
            salary_buckets[bucket_of(mid)] += 1
    section("城市分布 (Top)")
    rank_table(cities, args.top, "城市")
    section("薪资区间分布 (按月薪中位点)")
    if mids:
        order = {label: i for i, (_, _, label) in enumerate(SALARY_BUCKETS)}
        maxn = max(salary_buckets.values())
        for label in sorted(salary_buckets, key=lambda x: order.get(x, 99)):
            n = salary_buckets[label]
            print(f"  {label:<8} {n:>4}  {bar(n, maxn)}")
        print(f"\n  可解析薪资: {n_salary_parsed}/{n_total} 条 (面议等已排除)")
        print(f"  月薪中位点  平均: {statistics.mean(mids):>8,.0f} 元")
        print(f"            中位数: {statistics.median(mids):>8,.0f} 元")
        print(f"          最低/最高: {min(mids):,.0f} / {max(mids):,.0f} 元")
    else:
        print("  (无可解析薪资)")
    section("学历要求")
    rank_table(edus, args.top, "学历")
    section("经验要求")
    rank_table(exps, args.top, "经验")
    section("技能/标签词频 (Top)")
    rank_table(skill_freq, args.top, "技能")
 if __name__ == "__main__":
    main()
@@ -0,0 +1,120 @@
 """抓取智联招聘「AI 相关」岗位 —— 免登录, 渲染搜索结果页直接解析。
 合规提醒:
  - 仅抓取公开展示的岗位标题/薪资/公司/标签等字段。
  - 不抓取招聘者个人联系方式; 低频请求, 遵守目标站点服务条款, 风险自负。
 用法:
  python examples/scrape_zhaopin_ai.py                         # 默认抓 "AI" 前 3 页
  python examples/scrape_zhaopin_ai.py --keyword 大模型 --pages 5
  python examples/scrape_zhaopin_ai.py --keyword AI --jl 530   # 530=北京
  python examples/scrape_zhaopin_ai.py --headful               # 显示浏览器窗口
 城市编码(jl): 北京530 上海538 广州763 深圳765 杭州653 成都801 武汉736 南京635
 """
 from __future__ import annotations
 import argparse
 import csv
 import json
 import random
 import time
 from urllib.parse import quote
 from invisible_playwright import InvisiblePlaywright
 # 在渲染后的页面里一次性提取所有卡片, 比逐个 query 更快更稳。
 # 选择器来自对真实页面的探测 (见 _probe_zhaopin.py)。
 _EXTRACT_JS = r"""
 () => {
  const txt = (el) => el ? el.innerText.trim() : "";
  const cards = Array.from(document.querySelectorAll(".joblist-box__item"));
  return cards.map(card => {
    const nameA = card.querySelector("a.jobinfo__name");
    const info  = Array.from(card.querySelectorAll(".jobinfo__other-info-item"))
                       .map(e => e.innerText.trim());
    // 第一个 other-info 是地点(带图标), 取其 span; 其余按顺序是经验/学历
    const locSpan = card.querySelector(".jobinfo__other-info-item span");
    const companyA = card.querySelector("a.companyinfo__name");
    return {
      title:       txt(nameA),
      link:        nameA ? nameA.href : "",
      salary:      txt(card.querySelector(".jobinfo__salary")),
      job_tags:    Array.from(card.querySelectorAll(".jobinfo__tag .joblist-box__item-tag"))
                        .map(e => e.innerText.trim()),
      location:    locSpan ? locSpan.innerText.trim() : (info[0] || ""),
      experience:  info[1] || "",
      education:   info[2] || "",
      company:     companyA ? (companyA.getAttribute("title") || companyA.innerText).trim()
                            : "",
      company_url: companyA ? companyA.href : "",
      company_tags: Array.from(card.querySelectorAll(".companyinfo__tag .joblist-box__item-tag"))
                         .map(e => e.innerText.trim()),
    };
  });
 }
 """
 def scrape(keyword: str, pages: int, jl: str | None, headful: bool) -> list[dict]:
    results: list[dict] = []
    seen: set[str] = set()
    with InvisiblePlaywright(seed=42, headless=not headful) as browser:
        page = browser.new_page()
        for n in range(1, pages + 1):
            url = f"https://sou.zhaopin.com/?kw={quote(keyword)}&p={n}"
            if jl:
                url += f"&jl={jl}"
            try:
                page.goto(url, wait_until="domcontentloaded", timeout=60000)
                page.wait_for_selector(".joblist-box__item", timeout=20000)
            except Exception:
                print(f"第 {n} 页未加载出岗位列表, 跳过 (可能触发风控)。")
                continue
            page.wait_for_timeout(1500)  # 让懒加载内容补齐
            rows = page.evaluate(_EXTRACT_JS)
            new = 0
            for r in rows:
                key = r.get("link") or (r.get("title", "") + r.get("company", ""))
                if key and key not in seen:
                    seen.add(key)
                    results.append(r)
                    new += 1
            print(f"第 {n} 页: 抓到 {len(rows)} 条 (新增 {new})")
            time.sleep(random.uniform(3, 7))  # 低频, 降低风控
    return results
 def save(rows: list[dict], stem: str) -> None:
    with open(f"{stem}.json", "w", encoding="utf-8") as f:
        json.dump(rows, f, ensure_ascii=False, indent=2)
    if rows:
        fields = ["title", "salary", "experience", "education", "location",
                  "company", "job_tags", "company_tags", "link", "company_url"]
        with open(f"{stem}.csv", "w", newline="", encoding="utf-8-sig") as f:
            w = csv.DictWriter(f, fieldnames=fields, extrasaction="ignore")
            w.writeheader()
            for r in rows:
                row = dict(r)
                row["job_tags"] = " / ".join(row.get("job_tags") or [])
                row["company_tags"] = " / ".join(row.get("company_tags") or [])
                w.writerow(row)
    print(f"\n已保存 {len(rows)} 条 → {stem}.json / {stem}.csv")
 def main() -> None:
    ap = argparse.ArgumentParser()
    ap.add_argument("--keyword", default="AI", help="搜索关键词")
    ap.add_argument("--pages", type=int, default=3, help="抓取页数")
    ap.add_argument("--jl", default=None, help="城市编码, 不填=全国")
    ap.add_argument("--headful", action="store_true", help="显示浏览器窗口")
    args = ap.parse_args()
    rows = scrape(args.keyword, args.pages, args.jl, args.headful)
    save(rows, stem=f"zhaopin_{args.keyword}")
 if __name__ == "__main__":
    main()
@@ -0,0 +1,277 @@
 """智联招聘 AI 岗位爬虫（完整版）—— 免登录 + SQLite + 断点续抓 + 详情页 JD 全文。
 特性:
  * 写入 SQLite (INSERT OR IGNORE 去重, 每条提交, 中断不丢数据)
  * 断点续抓:
      - 列表阶段: 记录每个 (keyword, jl) 已抓到的最大页码, 重跑从下一页继续
      - 详情阶段: 只抓还没有 JD 的岗位, 重跑自动补齐
  * 详情页: 进入每个岗位页面抓 职位描述(JD)全文 + 技能标签
 合规提醒: 仅抓公开岗位字段, 不抓招聘者个人联系方式; 低频自用, 遵守站点条款。
 用法:
  # 抓列表(前5页) + 进详情页抓 JD
  python examples/scrape_zhaopin_full.py --keyword AI --pages 5
  # 只抓列表, 不进详情
  python examples/scrape_zhaopin_full.py --keyword 大模型 --pages 5 --no-detail
  # 中断后直接重跑同一命令 => 自动从断点继续
  python examples/scrape_zhaopin_full.py --keyword AI --pages 5
  # 导出已抓数据到 CSV
  python examples/scrape_zhaopin_full.py --export ai_jobs.csv
  # 重置某关键词的列表进度(重新从第1页抓)
  python examples/scrape_zhaopin_full.py --keyword AI --reset
 """
 from __future__ import annotations
 import argparse
 import csv
 import json
 import random
 import re
 import sqlite3
 import time
 from contextlib import closing
 from urllib.parse import quote
 from invisible_playwright import InvisiblePlaywright
 DB_DEFAULT = "zhaopin_jobs.db"
 _LIST_JS = r"""
 () => {
  const txt = (el) => el ? el.innerText.trim() : "";
  return Array.from(document.querySelectorAll(".joblist-box__item")).map(card => {
    const nameA = card.querySelector("a.jobinfo__name");
    const info  = Array.from(card.querySelectorAll(".jobinfo__other-info-item"))
                       .map(e => e.innerText.trim());
    const locSpan = card.querySelector(".jobinfo__other-info-item span");
    const companyA = card.querySelector("a.companyinfo__name");
    return {
      title:   txt(nameA),
      link:    nameA ? nameA.href : "",
      salary:  txt(card.querySelector(".jobinfo__salary")),
      job_tags: Array.from(card.querySelectorAll(".jobinfo__tag .joblist-box__item-tag"))
                     .map(e => e.innerText.trim()),
      location: locSpan ? locSpan.innerText.trim() : (info[0] || ""),
      experience: info[1] || "",
      education:  info[2] || "",
      company: companyA ? (companyA.getAttribute("title") || companyA.innerText).trim() : "",
      company_url: companyA ? companyA.href : "",
      company_tags: Array.from(card.querySelectorAll(".companyinfo__tag .joblist-box__item-tag"))
                         .map(e => e.innerText.trim()),
    };
  });
 }
 """
 _DETAIL_JS = r"""
 () => {
  const c = document.querySelector(".describtion-card__detail-content");
  const skills = Array.from(document.querySelectorAll(".describtion-card__skills-item"))
                      .map(e => e.innerText.trim());
  return { jd: c ? c.innerText.trim() : "", skills };
 }
 """
 # ── DB ──────────────────────────────────────────────────────────────────
 def init_db(path: str) -> sqlite3.Connection:
    conn = sqlite3.connect(path)
    conn.execute("""
        CREATE TABLE IF NOT EXISTS jobs (
            job_id       TEXT PRIMARY KEY,
            title        TEXT,
            salary       TEXT,
            experience   TEXT,
            education    TEXT,
            location     TEXT,
            company      TEXT,
            company_url  TEXT,
            job_tags     TEXT,
            company_tags TEXT,
            link         TEXT,
            jd_text      TEXT,
            skills       TEXT,
            keyword      TEXT,
            created_at   TEXT DEFAULT (datetime('now','localtime')),
            detail_at    TEXT
        )
    """)
    conn.execute("""
        CREATE TABLE IF NOT EXISTS progress (
            scope     TEXT PRIMARY KEY,   -- f"{keyword}|{jl}"
            last_page INTEGER DEFAULT 0
        )
    """)
    conn.commit()
    return conn
 def job_id_from_link(link: str) -> str:
    m = re.search(r"/jobdetail/([^.?/]+)\.htm", link)
    return m.group(1) if m else link
 def get_last_page(conn: sqlite3.Connection, scope: str) -> int:
    row = conn.execute("SELECT last_page FROM progress WHERE scope=?", (scope,)).fetchone()
    return row[0] if row else 0
 def set_last_page(conn: sqlite3.Connection, scope: str, page: int) -> None:
    conn.execute(
        "INSERT INTO progress(scope,last_page) VALUES(?,?) "
        "ON CONFLICT(scope) DO UPDATE SET last_page=excluded.last_page",
        (scope, page),
    )
    conn.commit()
 def upsert_job(conn: sqlite3.Connection, r: dict, keyword: str) -> bool:
    jid = job_id_from_link(r.get("link", ""))
    if not jid:
        return False
    cur = conn.execute(
        """INSERT OR IGNORE INTO jobs
           (job_id,title,salary,experience,education,location,company,
            company_url,job_tags,company_tags,link,keyword)
           VALUES(?,?,?,?,?,?,?,?,?,?,?,?)""",
        (jid, r.get("title"), r.get("salary"), r.get("experience"),
         r.get("education"), r.get("location"), r.get("company"),
         r.get("company_url"),
         json.dumps(r.get("job_tags") or [], ensure_ascii=False),
         json.dumps(r.get("company_tags") or [], ensure_ascii=False),
         r.get("link"), keyword),
    )
    conn.commit()
    return cur.rowcount > 0
 # ── 抓取 ────────────────────────────────────────────────────────────────
 def crawl_list(conn, page, keyword: str, pages: int, jl: str | None) -> None:
    scope = f"{keyword}|{jl or ''}"
    start = get_last_page(conn, scope) + 1
    if start > pages:
        print(f"[列表] '{keyword}' 已抓到第 {start-1} 页, 目标 {pages} 页, 无需续抓。")
        return
    print(f"[列表] '{keyword}' 从第 {start} 页抓到第 {pages} 页")
    for n in range(start, pages + 1):
        url = f"https://sou.zhaopin.com/?kw={quote(keyword)}&p={n}"
        if jl:
            url += f"&jl={jl}"
        try:
            page.goto(url, wait_until="domcontentloaded", timeout=60000)
            page.wait_for_selector(".joblist-box__item", timeout=20000)
        except Exception:
            print(f"  第 {n} 页未加载出列表, 停止本轮 (重跑可从此页续)。")
            break
        page.wait_for_timeout(1200)
        rows = page.evaluate(_LIST_JS)
        new = sum(upsert_job(conn, r, keyword) for r in rows)
        set_last_page(conn, scope, n)
        print(f"  第 {n} 页: {len(rows)} 条 (新增 {new})")
        time.sleep(random.uniform(3, 7))
 def crawl_details(conn, page, keyword: str | None, limit: int | None) -> None:
    q = "SELECT job_id,link FROM jobs WHERE (jd_text IS NULL OR jd_text='') AND link!=''"
    params: list = []
    if keyword:
        q += " AND keyword=?"
        params.append(keyword)
    q += " ORDER BY created_at"
    if limit:
        q += f" LIMIT {int(limit)}"
    todo = conn.execute(q, params).fetchall()
    if not todo:
        print("[详情] 没有待补充 JD 的岗位。")
        return
    print(f"[详情] 待抓 JD: {len(todo)} 条")
    for i, (jid, link) in enumerate(todo, 1):
        try:
            page.goto(link, wait_until="domcontentloaded", timeout=60000)
            page.wait_for_selector(".describtion-card__detail-content", timeout=15000)
            page.wait_for_timeout(800)
            data = page.evaluate(_DETAIL_JS)
        except Exception as e:
            print(f"  [{i}/{len(todo)}] {jid} 抓取失败: {str(e)[:60]}")
            continue
        conn.execute(
            "UPDATE jobs SET jd_text=?, skills=?, detail_at=datetime('now','localtime') "
            "WHERE job_id=?",
            (data.get("jd", ""),
             json.dumps(data.get("skills") or [], ensure_ascii=False), jid),
        )
        conn.commit()  # 逐条提交 => 中断安全
        jd_len = len(data.get("jd", ""))
        print(f"  [{i}/{len(todo)}] {jid} JD {jd_len} 字")
        time.sleep(random.uniform(2, 5))
 # ── 导出 ────────────────────────────────────────────────────────────────
 def export_csv(conn, path: str, keyword: str | None) -> None:
    q = ("SELECT title,salary,experience,education,location,company,"
         "job_tags,company_tags,skills,jd_text,link FROM jobs")
    params: list = []
    if keyword:
        q += " WHERE keyword=?"
        params.append(keyword)
    rows = conn.execute(q, params).fetchall()
    cols = ["title", "salary", "experience", "education", "location", "company",
            "job_tags", "company_tags", "skills", "jd_text", "link"]
    with open(path, "w", newline="", encoding="utf-8-sig") as f:
        w = csv.writer(f)
        w.writerow(cols)
        for row in rows:
            row = list(row)
            for idx in (6, 7, 8):  # json 数组列 -> 用 / 连接
                try:
                    row[idx] = " / ".join(json.loads(row[idx] or "[]"))
                except Exception:
                    pass
            w.writerow(row)
    print(f"已导出 {len(rows)} 条 -> {path}")
 # ── main ────────────────────────────────────────────────────────────────
 def main() -> None:
    ap = argparse.ArgumentParser()
    ap.add_argument("--keyword", default="AI", help="搜索关键词")
    ap.add_argument("--pages", type=int, default=3, help="列表抓取页数")
    ap.add_argument("--jl", default=None, help="城市编码, 不填=全国")
    ap.add_argument("--db", default=DB_DEFAULT, help="SQLite 文件路径")
    ap.add_argument("--no-detail", action="store_true", help="只抓列表, 不进详情页")
    ap.add_argument("--detail-limit", type=int, default=None, help="本轮最多抓多少条 JD")
    ap.add_argument("--headful", action="store_true", help="显示浏览器窗口")
    ap.add_argument("--reset", action="store_true", help="重置该关键词的列表进度")
    ap.add_argument("--export", metavar="CSV", help="导出已抓数据到 CSV 后退出")
    args = ap.parse_args()
    with closing(init_db(args.db)) as conn:
        if args.export:
            export_csv(conn, args.export, args.keyword if args.keyword != "AI" else None)
            return
        if args.reset:
            scope = f"{args.keyword}|{args.jl or ''}"
            conn.execute("DELETE FROM progress WHERE scope=?", (scope,))
            conn.commit()
            print(f"已重置进度: {scope}")
        with InvisiblePlaywright(seed=42, headless=not args.headful) as browser:
            page = browser.new_page()
            crawl_list(conn, page, args.keyword, args.pages, args.jl)
            if not args.no_detail:
                crawl_details(conn, page, args.keyword, args.detail_limit)
        total = conn.execute("SELECT COUNT(*) FROM jobs").fetchone()[0]
        with_jd = conn.execute(
            "SELECT COUNT(*) FROM jobs WHERE jd_text IS NOT NULL AND jd_text!=''"
        ).fetchone()[0]
        print(f"\n库内合计 {total} 条, 其中含 JD 全文 {with_jd} 条 -> {args.db}")
 if __name__ == "__main__":
    main()
@@ -0,0 +1,112 @@
 """抓取 BOSS 直聘「AI 相关」岗位（自用 / 学习用途）。
 合规提醒:
  - 仅抓取公开展示的岗位标题/薪资/公司等字段, 不抓取招聘者个人联系方式。
  - 低频请求, 遵守目标站点服务条款; 风险自负。
 首次使用:
  python scrape_zhipin_ai.py --login      # 打开浏览器, 手动扫码登录一次
 之后:
  python scrape_zhipin_ai.py --keyword AI --city 101010100 --pages 3
 """
 from __future__ import annotations
 import argparse
 import csv
 import json
 import random
 import time
 from pathlib import Path
 from urllib.parse import quote
 from invisible_playwright import InvisiblePlaywright
 PROFILE_DIR = Path(__file__).parent / ".zhipin_profile"  # 持久化登录态
 SEED = 20240614  # 固定 seed → 跨会话指纹一致, 配合持久化 profile
 def login_flow() -> None:
    """首次手动登录: 打开页面, 你扫码, 登录态写入 PROFILE_DIR。"""
    with InvisiblePlaywright(seed=SEED, profile_dir=PROFILE_DIR) as ctx:
        page = ctx.new_page()
        page.goto("https://www.zhipin.com/web/user/?ka=header-login",
                  wait_until="domcontentloaded")
        print("请在打开的浏览器中扫码登录, 登录完成后回到终端按回车...")
        input()  # 等你登录完成
        print("登录态已保存到", PROFILE_DIR)
 def scrape(keyword: str, city: str, pages: int) -> list[dict]:
    results: list[dict] = []
    with InvisiblePlaywright(seed=SEED, profile_dir=PROFILE_DIR) as ctx:
        page = ctx.new_page()
        for n in range(1, pages + 1):
            url = (
                "https://www.zhipin.com/web/geek/job"
                f"?query={quote(keyword)}&city={city}&page={n}"
            )
            page.goto(url, wait_until="domcontentloaded")
            # 等列表渲染; 选择器需按实际页面结构核对/调整
            try:
                page.wait_for_selector("li.job-card-wrapper", timeout=15000)
            except Exception:
                print(f"第 {n} 页未出现岗位列表, 可能需要登录或触发了验证码。")
                # 给你时间手动过验证码
                input("处理完页面后按回车继续...")
            rows = page.eval_on_selector_all(
                "li.job-card-wrapper",
                """els => els.map(e => ({
                    title:   e.querySelector('.job-name')?.innerText?.trim(),
                    salary:  e.querySelector('.salary')?.innerText?.trim(),
                    company: e.querySelector('.company-name')?.innerText?.trim(),
                    tags:    Array.from(e.querySelectorAll('.tag-list li'))
                                  .map(t => t.innerText.trim()),
                    area:    e.querySelector('.job-area')?.innerText?.trim(),
                    link:    e.querySelector('a.job-card-left')?.href
                             || e.querySelector('a')?.href,
                }))""",
            )
            print(f"第 {n} 页抓到 {len(rows)} 条")
            results.extend(rows)
            # 低频: 随机停顿, 降低风控触发概率
            time.sleep(random.uniform(4, 9))
    return results
 def save(rows: list[dict], stem: str) -> None:
    Path(f"{stem}.json").write_text(
        json.dumps(rows, ensure_ascii=False, indent=2), encoding="utf-8"
    )
    if rows:
        keys = ["title", "salary", "company", "area", "tags", "link"]
        with open(f"{stem}.csv", "w", newline="", encoding="utf-8-sig") as f:
            w = csv.DictWriter(f, fieldnames=keys, extrasaction="ignore")
            w.writeheader()
            for r in rows:
                r = dict(r)
                r["tags"] = " / ".join(r.get("tags") or [])
                w.writerow(r)
    print(f"已保存 {len(rows)} 条 → {stem}.json / {stem}.csv")
 def main() -> None:
    ap = argparse.ArgumentParser()
    ap.add_argument("--login", action="store_true", help="首次手动登录")
    ap.add_argument("--keyword", default="AI", help="搜索关键词")
    ap.add_argument("--city", default="101010100", help="城市编码 (101010100=北京)")
    ap.add_argument("--pages", type=int, default=3, help="抓取页数")
    args = ap.parse_args()
    if args.login:
        login_flow()
        return
    rows = scrape(args.keyword, args.city, args.pages)
    save(rows, stem=f"zhipin_{args.keyword}")
 if __name__ == "__main__":
    main()
@@ -0,0 +1,114 @@
 #!/usr/bin/env python3
 """Generate the GitHub release body for a firefox-N build from the actual
 invisible_firefox commits that went into it.
 The release tag (firefox-N) lives on the wrapper, but the binary's changes live
 on the SOURCE repo (feder-cr/invisible_firefox). We never deep-clone that history
 (it's a full Firefox fork); instead we use GitHub's compare API to list the
 commits between the PREVIOUS release's source commit and this one, and turn their
 subject lines into a short human-readable "What changed" list.
  - The previous release's source commit comes from its ``source-commit.txt``
    asset (this script's own output uploads one for the next run to read).
  - If there's no previous source commit (first automated release) or the compare
    fails, we fall back to a body WITHOUT the changelog section — publishing must
    never break on note generation.
 This is NOT an LLM and NOT a raw ``git log`` dump: it filters out the
 non-user-facing commits (docs/chore/ci/test/style) and prints the remaining
 subjects as plain bullets. Quality rides on writing good commit subjects.
 Usage:
    python scripts/gen_release_notes.py --tag firefox-10 --current <sha> \
        [--prev-sha <sha>] [--source-repo feder-cr/invisible_firefox]
    # reads GITHUB_TOKEN from the env for the compare API (optional for public).
 """
 from __future__ import annotations
 import argparse
 import json
 import os
 import re
 import sys
 import urllib.request
 import urllib.error
 # Conventional-commit prefixes that never belong in user-facing release notes.
 _SKIP = re.compile(r"^(docs|chore|ci|test|style|build)(\(|:)", re.I)
 def _api(url: str, token: str | None) -> dict:
    headers = {"Accept": "application/vnd.github+json",
               "User-Agent": "invisible-playwright-release-notes"}
    if token:
        headers["Authorization"] = f"Bearer {token}"
    req = urllib.request.Request(url, headers=headers)
    with urllib.request.urlopen(req, timeout=30) as r:
        return json.load(r)
 def changelog_bullets(source_repo: str, prev_sha: str, current_sha: str,
                      token: str | None) -> list[str]:
    """Return the user-facing commit subjects in prev_sha..current_sha, or []."""
    if not prev_sha or not current_sha or prev_sha == current_sha:
        return []
    url = f"https://api.github.com/repos/{source_repo}/compare/{prev_sha}...{current_sha}"
    try:
        data = _api(url, token)
    except (urllib.error.URLError, urllib.error.HTTPError, ValueError) as e:
        print(f"[gen_release_notes] compare API failed ({e}); no changelog section",
              file=sys.stderr)
        return []
    bullets: list[str] = []
    for c in data.get("commits", []):
        subject = (c.get("commit", {}).get("message") or "").splitlines()[0].strip()
        if not subject or _SKIP.match(subject):
            continue
        bullets.append(subject.rstrip("."))
    return bullets
 def build_body(tag: str, current_sha: str, bullets: list[str]) -> str:
    m = re.search(r"(\d+)", tag)
    n = int(m.group(1)) if m else None
    prev_label = f"firefox-{n - 1}" if n else "the previous build"
    short = (current_sha or "")[:8]
    parts = ["Patched Firefox 150.0.1, the stealth build invisible_playwright drives.", ""]
    if bullets:
        parts.append(f"What changed since {prev_label}:")
        parts += [f"- {b}" for b in bullets]
        parts.append("")
    parts += [
        "Builds: Linux x86_64, Linux arm64, Windows x86_64, macOS arm64, macOS x86_64.",
        "",
        "Most people won't grab these by hand. The wrapper fetches the right one for "
        "your platform on first run:",
        "",
        "    pip install git+https://github.com/feder-cr/invisible_playwright",
        "",
        "If you do download manually, `checksums.txt` has the SHA256s. The macOS builds "
        "are ad-hoc signed (not notarized), so clear the quarantine flag: "
        "`xattr -dr com.apple.quarantine Firefox.app`",
    ]
    if short:
        parts += ["", f"Built from invisible_firefox @{short}."]
    return "\n".join(parts)
 def main() -> int:
    ap = argparse.ArgumentParser()
    ap.add_argument("--tag", required=True, help="release tag, e.g. firefox-10")
    ap.add_argument("--current", required=True, help="invisible_firefox SHA this build was built from")
    ap.add_argument("--prev-sha", default="", help="previous release's source SHA (omit for none)")
    ap.add_argument("--source-repo", default="feder-cr/invisible_firefox")
    args = ap.parse_args()
    token = os.environ.get("GITHUB_TOKEN") or os.environ.get("GH_TOKEN")
    bullets = changelog_bullets(args.source_repo, args.prev_sha, args.current, token)
    sys.stdout.write(build_body(args.tag, args.current, bullets))
    return 0
 if __name__ == "__main__":
    sys.exit(main())
@@ -75,9 +75,22 @@ class Network:
        self.nodes = _topsort(nodes)
        self.by_name = {n.name: n for n in self.nodes}
-    def sample(self, rng: random.Random) -> Dict[str, Any]:
+    def sample(
        self,
        rng: random.Random,
        evidence: Optional[Dict[str, Any]] = None,
    ) -> Dict[str, Any]:
        """Sample the network. ``evidence`` fixes named nodes BEFORE their children
        sample, so the children RE-CONDITION on the fixed value (not relabel after).
        Used to pin ``gpu_class`` to the validated WebGL persona's class so the whole
        bundle (cores/screen/fonts) stays coherent with the GPU we expose. Earlier
        nodes still sample (RNG stream preserved → per-seed determinism)."""
        evidence = evidence or {}
        context: Dict[str, Any] = {}
        for node in self.nodes:
            if node.name in evidence:
                context[node.name] = evidence[node.name]
            else:
                context[node.name] = node.sample(context, rng)
        return context
@@ -8,7 +8,7 @@ oscpu, webdriver=false, maxTouchPoints=0) is locked by the compiled build.
 Graph:
-    gpu (root, 444 real Windows ANGLE renderers)
+    gpu (root, 474 real Windows ANGLE renderers)
     │
     └─> gpu_class (deterministic classifier, 6 classes)
          ├─> hw_concurrency       (CPT per class)
@@ -28,7 +28,7 @@ Sampling is deterministic per stealth_seed via a private random.Random.
 import json
 import os
 import re
-from typing import Any, Dict
+from typing import Any, Dict, Optional
 from ._network import Network, Node
@@ -110,6 +110,16 @@ def classify_gpu(gpu_value: Dict[str, str]) -> str:
    if re.search(r"Intel.*HD Graphics (3000|4000|2500)", r):
        return "integrated_old"
    # Discrete Intel Arc DESKTOP/dGPU cards (A-series / B-series, e.g. A750,
    # A770, B580) are discrete GPUs (~RTX 3060 tier for A7xx), NOT the
    # integrated "Arc 130T/140T/Graphics" iGPUs in Core Ultra chips. Route the
    # discrete SKUs to a coherent discrete-GPU class so the conditioned bundle
    # (cores, screen, storage) matches a real discrete-GPU machine; A3xx are
    # entry discrete -> low_end, A5xx/A7xx/Bxxx -> mid_range. Bare "Arc 1x0(T/V)"
    # integrated names do NOT match and fall through to integrated_modern below.
    m = re.search(r"Intel.*\bArc(?:\(TM\))?\s+([AB])(\d)\d\d\b", r)
    if m:
        return "low_end" if m.group(2) == "3" else "mid_range"
    if re.search(
        r"Intel.*(HD Graphics (4[56]|5\d\d|6\d\d)|UHD Graphics|Graphics Family|Iris|Arc)",
        r,
@@ -328,7 +338,14 @@ class Forge:
        self.seed = int(seed)
        self._rng = random.Random(self.seed)
-    def sample(self) -> Dict[str, Any]:
+    def sample(self, fixed_gpu_class: Optional[str] = None) -> Dict[str, Any]:
        # fixed_gpu_class pins gpu_class so the WHOLE bundle (cores/screen/fonts) is
        # drawn coherently for the WebGL persona's class we expose on Windows/mac.
        # The default (no fix) path calls _NETWORK.sample(rng) with one arg so existing
        # monkeypatches/tests keep working.
        if fixed_gpu_class:
            bundle = _NETWORK.sample(self._rng, evidence={"gpu_class": fixed_gpu_class})
        else:
            bundle = _NETWORK.sample(self._rng)
        gpu = bundle["gpu"]
        screen = bundle["screen"]
@@ -339,7 +356,7 @@ class Forge:
            "stealth_seed": self.seed,
            # Locked identity
            **_LOCKED,
-            # GPU (coherent pair from 444 pool)
+            # GPU (coherent pair from 474 pool)
            "webgl_renderer": gpu["renderer"],
            "webgl_vendor": gpu["vendor"],
            "gpu_class": bundle["gpu_class"],
@@ -392,6 +409,6 @@ class Forge:
        }
-def sample(seed: int) -> Dict[str, Any]:
+def sample(seed: int, fixed_gpu_class: Optional[str] = None) -> Dict[str, Any]:
-    """Convenience: `Forge(seed).sample()`."""
+    """Convenience: `Forge(seed).sample(fixed_gpu_class)`."""
-    return Forge(seed).sample()
+    return Forge(seed).sample(fixed_gpu_class)
@@ -1,5 +1,5 @@
 {
-  "_meta": "audio (rate/latency/channels) given gpu_class",
+  "_meta": "audio (rate/latency/channels) given gpu_class. NOTE 2026-06-14: maxChannelCount reflects the OS DEFAULT OUTPUT DEVICE (stereo for the vast majority of users), NOT the GPU — so channels=2 dominates every class (~78-92%) with only a small 6/8 surround tail. The previous tables emitted 45-100% surround on mid/high/workstation, which is unrealistic and lifted FP Pro tampering_ml (surround on a typical consumer profile reads as a coherence anomaly). Rate/latency tuples are unchanged.",
  "table": {
    "integrated_old": [
      {
@@ -26,7 +26,7 @@
          "latency": 30,
          "channels": 2
        },
-        "prob": 0.6
+        "prob": 0.62
      },
      {
        "value": {
@@ -34,7 +34,7 @@
          "latency": 40,
          "channels": 2
        },
-        "prob": 0.25
+        "prob": 0.3
      },
      {
        "value": {
@@ -42,7 +42,7 @@
          "latency": 25,
          "channels": 6
        },
-        "prob": 0.15
+        "prob": 0.08
      }
    ],
    "low_end": [
@@ -52,7 +52,7 @@
          "latency": 40,
          "channels": 2
        },
-        "prob": 0.55
+        "prob": 0.6
      },
      {
        "value": {
@@ -60,7 +60,7 @@
          "latency": 50,
          "channels": 2
        },
-        "prob": 0.3
+        "prob": 0.32
      },
      {
        "value": {
@@ -68,7 +68,7 @@
          "latency": 30,
          "channels": 6
        },
-        "prob": 0.15
+        "prob": 0.08
      }
    ],
    "mid_range": [
@@ -78,31 +78,39 @@
          "latency": 25,
          "channels": 2
        },
-        "prob": 0.45
+        "prob": 0.5
      },
      {
        "value": {
          "rate": 48000,
          "latency": 20,
-          "channels": 6
+          "channels": 2
        },
        "prob": 0.3
      },
      {
        "value": {
          "rate": 48000,
          "latency": 20,
          "channels": 8
        },
        "prob": 0.15
      },
      {
        "value": {
          "rate": 44100,
          "latency": 30,
          "channels": 2
        },
-        "prob": 0.1
+        "prob": 0.12
      },
      {
        "value": {
          "rate": 48000,
          "latency": 20,
          "channels": 6
        },
        "prob": 0.06
      },
      {
        "value": {
          "rate": 48000,
          "latency": 20,
          "channels": 8
        },
        "prob": 0.02
      }
    ],
    "high_end": [
@@ -110,51 +118,75 @@
        "value": {
          "rate": 48000,
          "latency": 15,
-          "channels": 6
+          "channels": 2
        },
-        "prob": 0.3
+        "prob": 0.6
      },
      {
        "value": {
-          "rate": 48000,
+          "rate": 96000,
          "latency": 15,
          "channels": 8
        },
        "prob": 0.3
      },
      {
        "value": {
          "rate": 48000,
          "latency": 15,
          "channels": 2
        },
-        "prob": 0.2
+        "prob": 0.18
      },
      {
        "value": {
-          "rate": 96000,
+          "rate": 48000,
          "latency": 15,
          "channels": 6
        },
        "prob": 0.1
      },
      {
        "value": {
          "rate": 48000,
          "latency": 15,
          "channels": 8
        },
        "prob": 0.05
      },
      {
        "value": {
          "rate": 96000,
          "latency": 15,
          "channels": 6
        },
        "prob": 0.05
      },
      {
        "value": {
          "rate": 96000,
          "latency": 15,
          "channels": 8
        },
-        "prob": 0.1
+        "prob": 0.02
      }
    ],
    "workstation": [
      {
        "value": {
          "rate": 48000,
          "latency": 10,
          "channels": 2
        },
        "prob": 0.45
      },
      {
        "value": {
          "rate": 96000,
          "latency": 10,
          "channels": 2
        },
        "prob": 0.2
      },
      {
        "value": {
          "rate": 48000,
          "latency": 10,
          "channels": 8
        },
-        "prob": 0.25
+        "prob": 0.12
      },
      {
        "value": {
@@ -162,7 +194,7 @@
          "latency": 10,
          "channels": 8
        },
-        "prob": 0.3
+        "prob": 0.1
      },
      {
        "value": {
@@ -170,7 +202,7 @@
          "latency": 10,
          "channels": 6
        },
-        "prob": 0.2
+        "prob": 0.08
      },
      {
        "value": {
@@ -178,15 +210,7 @@
          "latency": 10,
          "channels": 8
        },
-        "prob": 0.15
+        "prob": 0.05
      },
      {
        "value": {
          "rate": 48000,
          "latency": 15,
          "channels": 6
        },
        "prob": 0.1
      }
    ]
  }
@@ -36,29 +36,21 @@
      },
      {
        "value": 8,
-        "prob": 0.3
+        "prob": 0.35
      },
      {
        "value": 12,
        "prob": 0.05
      }
    ],
    "[\"integrated_modern\", \"budget\"]": [
      {
        "value": 4,
        "prob": 0.55
      },
      {
        "value": 6,
-        "prob": 0.2
+        "prob": 0.45
      },
      {
        "value": 8,
-        "prob": 0.2
+        "prob": 0.4
      },
      {
        "value": 12,
-        "prob": 0.05
+        "prob": 0.15
      }
    ],
    "[\"integrated_modern\", \"standard\"]": [
@@ -178,11 +170,7 @@
      },
      {
        "value": 12,
-        "prob": 0.1
+        "prob": 0.15
      },
      {
        "value": 16,
        "prob": 0.05
      }
    ],
    "[\"mid_range\", \"standard\"]": [
@@ -108,16 +108,6 @@
      }
    ],
    "[\"integrated_modern\", \"budget\"]": [
      {
        "value": {
          "w": 1366,
          "h": 768,
          "aw": 1366,
          "ah": 728,
          "dpr": 1.0
        },
        "prob": 0.3
      },
      {
        "value": {
          "w": 1920,
@@ -126,14 +116,24 @@
          "ah": 1040,
          "dpr": 1.0
        },
-        "prob": 0.65
+        "prob": 0.8
      },
      {
        "value": {
-          "w": 1600,
+          "w": 2560,
-          "h": 900,
+          "h": 1440,
-          "aw": 1600,
+          "aw": 2560,
-          "ah": 860,
+          "ah": 1400,
          "dpr": 1.0
        },
        "prob": 0.15
      },
      {
        "value": {
          "w": 1920,
          "h": 1200,
          "aw": 1920,
          "ah": 1160,
          "dpr": 1.0
        },
        "prob": 0.05
@@ -48,29 +48,21 @@
      },
      {
        "value": 500000,
-        "prob": 0.3
+        "prob": 0.35
      },
      {
        "value": 1000000,
        "prob": 0.05
      }
    ],
    "[\"integrated_modern\", \"budget\"]": [
      {
        "value": 64000,
        "prob": 0.2
      },
      {
        "value": 128000,
        "prob": 0.3
      },
      {
        "value": 256000,
        "prob": 0.3
      },
      {
        "value": 500000,
-        "prob": 0.2
+        "prob": 0.45
      },
      {
        "value": 1000000,
        "prob": 0.25
      }
    ],
    "[\"integrated_modern\", \"standard\"]": [
@@ -178,7 +178,11 @@ def _apply_pins_to_raw(raw: Dict[str, Any], pin: Dict[str, Any]) -> Dict[str, An
    return out
-def generate_profile(seed: int, pin: Optional[Dict[str, Any]] = None) -> Profile:
+def generate_profile(
    seed: int,
    pin: Optional[Dict[str, Any]] = None,
    fixed_gpu_class: Optional[str] = None,
 ) -> Profile:
    """Return a deterministic Profile for the given integer seed.
    pin: optional dict of dotted-path keys (e.g. "screen.width", "gpu.renderer")
@@ -215,7 +219,11 @@ def generate_profile(seed: int, pin: Optional[Dict[str, Any]] = None) -> Profile
        for key in pin:
            _validate_pin_key(key)
-    raw = _sample_raw(int(seed))
+    # fixed_gpu_class re-conditions the whole bundle on a chosen class (used so the
    # bundle stays coherent with the validated WebGL persona we expose on Windows/mac).
    # An explicit gpu.class_tier pin still wins.
    eff_class = (pin or {}).get("gpu.class_tier") or fixed_gpu_class
    raw = _sample_raw(int(seed), fixed_gpu_class=eff_class)
    if pin:
        raw = _apply_pins_to_raw(raw, pin)
@@ -0,0 +1,163 @@
 """Empirically-calibrated WebGL GPU personas for Windows ANGLE D3D11.
 We expose a FALSE GPU (this is a multi-user tool — never leak each host's real GPU),
 chosen deterministically per seed from a small set of renderer-string "buckets" that
 Firefox's SanitizeRenderer emits and that FP Pro's tampering_ml scores as CLEAN.
 ## What actually gates a persona (calibrated 2026-06-14, supersedes the old theory)
 The blocker is NOT anti_detect and NOT a "render-vs-renderer" check. It is FP Pro's
 **tampering_ml** (gate <=0.5), a holistic ML coherence score. We reverse-engineered its
 GPU sensitivity with single-variable A/Bs on demo.fingerprint.com (deterministic per
 (seed, renderer, IP); tools in tests/_gpu_isolate.py / _gpu_landscape.py / _gpu_sweep.py /
 _gpu_sweep2.py / _gpu_persona_pure.py). Findings:
  1. tampering_ml = f(renderer STRING, seed baseline = canvas/audio). The renderer string
     carries a STABLE per-bucket penalty; the seed sets the floor it adds to.
  2. gpu_class is IRRELEVANT to tampering_ml (nv_980 scored identically on mid_range /
     high_end / premium / workstation). So pairing a fake GPU with a "matching" hardware
     tier does NOT help the score (we still set a coherent class — see gpu_class below —
     for OTHER detectors that cross-check cores/screen, just not for this).
  3. It is NOT render-consistency: a cross-vendor AMD string is CLEAN on our Intel-Arc
     host. So the real silicon's pixels are not the dominant signal; falsifying to a
     different vendor works — IF the string is one FP Pro scores low.
 Sweep over all 10 Windows SanitizeRenderer buckets x 10 seeds (clean = tml<=0.5 AND not
 anti_detect), on our Intel Arc A750 host:
  - amd_r9 (Radeon R9 200 Series) ...... 10/10 clean, max tml 0.346   <- SHIP
  - intel_arc (Arc A750) ............... 10/10 clean, max tml 0.377   <- SHIP
  - amd_hd5850 ......................... 9/10 (fails the hardest seed)
  - amd_hd3200 / intel_hd .............. 6/10 (seed-dependent, risky)
  - intel_hd400 ........................ 3/10
  - ALL NVIDIA (8800/480/980) .......... 0/10 (penalized everywhere, ~0.7-0.99)
  - intel_945 (ancient Intel) .......... 0/10
 So only TWO buckets are robustly clean across profiles. We ship exactly those, weighted
 to real-world prevalence ("Radeon R9 200 Series" is the bucket for ALL modern AMD = a big
 real slice; "Arc A750" covers Intel discrete = rarer). Cross-vendor, so the fleet is not a
 single-GPU cluster. More names require lowering the seed floor first (see CAVEAT 2).
 ## ⚠️ CAVEATS
 1. HOST-INDEPENDENCE NOT PROVEN. Everything above was measured on ONE host (Intel Arc
    A750). The host's real render is embedded in the seed baseline, so the clean-bucket set
    *might* be host-dependent (on a real NVIDIA host, maybe nv_980 is clean and amd_r9 is
    not). This MUST be validated on a non-Arc machine before trusting it fleet-wide; if it
    turns out host-dependent, add a pre-launch host-GPU-class probe and pick a bucket per
    detected class. Until then: safe for Arc hosts (incl. the dev's), unvalidated elsewhere.
 2. DIVERSITY CEILING = 2 names because "hard" seeds (high canvas/audio floor, e.g. seed 4
    ~0.35) only stay clean on the 2 best buckets. Lowering that floor (an fpforge CPT fix —
    candidate: 8-channel audio + 1TB storage emitted on a mid_range profile) would unlock
    amd_hd5850 / intel_hd for more seeds => up to ~5 names. Follow-up, not done yet.
 ## Load-bearing format requirements (unchanged, still true)
 - renderer MUST end ", D3D11)" (full ANGLE wire format) or SanitizeRenderer returns
   "Generic Renderer" (a tell). The C++ passes our string through SanitizeRenderer, which
   buckets "AMD Radeon R9 200 Series" -> "Radeon R9 200 Series" and "Arc A750" -> itself.
 - the forced extension list MUST be the EXACT NATIVE ORDER getSupportedExtensions returns.
   The set+order is fixed by Firefox+ANGLE on D3D11 FL11_0 (VENDOR-INDEPENDENT — verified
   via 20-agent source study), so ONE list is correct for both personas. A reorder is caught
   (tampering_ml 0.34 -> 0.84). The lists below are the verbatim native-order Arc capture.
 Calibration data + sweep tooling live in the local workbench (not shipped).
 """
 from __future__ import annotations
 import sys
 from typing import Dict, List, Optional
 # Vendor-independent ext lists (native order, Arc host capture). Identical for every persona
 # because the set+order is fixed by Firefox+ANGLE on D3D11 FL11_0, not by the GPU vendor.
 _EXT1 = (
    "ANGLE_instanced_arrays,EXT_blend_minmax,EXT_color_buffer_half_float,EXT_float_blend,"
    "EXT_frag_depth,EXT_shader_texture_lod,EXT_sRGB,EXT_texture_compression_bptc,"
    "EXT_texture_compression_rgtc,EXT_texture_filter_anisotropic,OES_element_index_uint,"
    "OES_fbo_render_mipmap,OES_standard_derivatives,OES_texture_float,OES_texture_float_linear,"
    "OES_texture_half_float,OES_texture_half_float_linear,OES_vertex_array_object,"
    "WEBGL_color_buffer_float,WEBGL_compressed_texture_s3tc,WEBGL_compressed_texture_s3tc_srgb,"
    "WEBGL_debug_renderer_info,WEBGL_debug_shaders,WEBGL_depth_texture,WEBGL_draw_buffers,"
    "WEBGL_lose_context,WEBGL_provoking_vertex"
 )
 _EXT2 = (
    "EXT_color_buffer_float,EXT_float_blend,EXT_texture_compression_bptc,"
    "EXT_texture_compression_rgtc,EXT_texture_filter_anisotropic,OES_draw_buffers_indexed,"
    "OES_texture_float_linear,OVR_multiview2,WEBGL_compressed_texture_s3tc,"
    "WEBGL_compressed_texture_s3tc_srgb,WEBGL_debug_renderer_info,WEBGL_debug_shaders,"
    "WEBGL_lose_context,WEBGL_provoking_vertex"
 )
 def _p(key, renderer, vendor, gpu_class, weight):
    return {"key": key, "renderer": renderer, "vendor": vendor,
            "gpu_class": gpu_class, "weight": weight, "ext1": _EXT1, "ext2": _EXT2}
 # Only the two robustly-clean Windows buckets (calibration sweep 2026-06-14). Both discrete,
 # so gpu_class=mid_range keeps cores/screen coherent with the declared GPU for OTHER detectors
 # (gpu_class does NOT affect tampering_ml). Weights ~ real-world prevalence of the BUCKET:
 # "Radeon R9 200 Series" represents ALL modern AMD (large real slice); "Arc A750" = Intel
 # discrete (rarer). Cross-vendor => the fleet is not a single-GPU cluster.
 _PERSONAS: List[Dict] = [
    _p("amd_radeon_r9", "ANGLE (AMD, AMD Radeon R9 200 Series Direct3D11 vs_5_0 ps_5_0, D3D11)",
       "Google Inc. (AMD)", "mid_range", 70),    # -> bucket "Radeon R9 200 Series"; tml 0.03-0.35
    _p("intel_arc_a750", "ANGLE (Intel, Intel(R) Arc(TM) A750 Graphics Direct3D11 vs_5_0 ps_5_0, D3D11)",
       "Google Inc. (Intel)", "mid_range", 30),  # -> bucket "Intel(R) Arc(TM) A750 Graphics"; tml 0.02-0.38
 ]
 _TOTAL_W = sum(p["weight"] for p in _PERSONAS)
 # ENABLED: we falsify the GPU on Windows/mac. Validated clean on an Intel Arc host (see the
 # HOST-INDEPENDENCE caveat in the module docstring — unvalidated on non-Arc hosts). On Linux
 # select_persona returns None: there prefs.py spoofs profile.gpu.renderer directly.
 _ENABLED = True
 def select_persona(seed: int) -> Optional[Dict]:
    """Deterministic, prevalence-weighted persona for this seed (None on Linux).
    Same seed -> same persona (fppro_consistency: identity stable per seed). Different seeds
    spread across the persona mix by weight. None on Linux (the sampled profile.gpu.renderer
    is spoofed directly there).
    """
    if not _ENABLED or sys.platform.startswith("linux") or not _PERSONAS:
        return None
    h = (int(seed) * 2654435761) % _TOTAL_W
    cum = 0
    for p in _PERSONAS:
        cum += p["weight"]
        if h < cum:
            return p
    return _PERSONAS[-1]
 def forced_gpu_class(seed: int) -> Optional[str]:
    """The gpu_class the forge conditions the WHOLE bundle on (== the selected persona's class),
    so cores/screen/fonts stay coherent with the GPU we expose. Does NOT affect FP Pro
    tampering_ml (proven) but matters for detectors that cross-check hardware tier. None on Linux."""
    p = select_persona(seed)
    return p["gpu_class"] if p else None
 # ── Render-noise seed pool (canvas/WebGL gamma) ──────────────────────────────
 # zoom.stealth.fpp.hw_seed drives the per-seed canvas2D + WebGL readPixels gamma
 # LUT in C++. The render-image HASH it produces is the DOMINANT FP Pro tampering_ml
 # driver (proven 2026-06-14: holding a fixed profile and varying ONLY hw_seed moved
 # tml 0.25->0.75). The monotonic gamma preserves the GPU's render structure, so some
 # hw_seeds yield a "suspicious" render hash. We therefore DECOUPLE the render-noise
 # seed from the identity seed and pick from a calibrated pool of hw_seeds that score
 # CLEAN even on the hardest attribute profile (sweep 1..30 vs the worst seed: these
 # 14 all gave tml<=0.285). Diversity is preserved (14 distinct render hashes spread
 # across the population — real GPUs cluster to few canvas hashes anyway); identity
 # stays per-seed (the rest of the fingerprint differs). Same seed -> same render seed
 # (fppro_consistency holds).
 # CAVEAT: the render hash = f(host GPU render, gamma), so this pool is calibrated on
 # the Intel-Arc host. On other GPUs the clean set may differ (host-independence open,
 # same as the personas) — Option B (substitution = GPU-independent render hash) would
 # remove that dependency. Validate per-host or move to B before trusting fleet-wide.
 CLEAN_RENDER_SEEDS = [19, 10, 28, 24, 23, 16, 11, 30, 17, 22, 3, 9, 12, 26]
 def render_noise_seed(seed: int) -> int:
    """Deterministic clean render-noise seed for hw_seed (decoupled from identity).
    Maps the identity seed into CLEAN_RENDER_SEEDS so every session gets a calibrated
    clean canvas/WebGL render hash while keeping per-user diversity. Stable per seed."""
    return CLEAN_RENDER_SEEDS[(int(seed) * 2654435761) % len(CLEAN_RENDER_SEEDS)]
@@ -9,8 +9,9 @@ from typing import Any, Dict, Optional, Union
 from playwright.async_api import Browser, BrowserContext, Playwright, async_playwright
 from ._fpforge import Profile, generate_profile
 from ._webgl_personas import forced_gpu_class
 from ._geo import prepare_session_geo
-from ._headless import make_virtual_display
+from ._headless import cloak_prefs, make_virtual_display
 from ._proxy import configure_proxy as _configure_proxy_shared
 from .download import ensure_binary
 from .launcher import _CHROME_H, _CHROME_W, _TASKBAR_H, _tz_env
@@ -68,7 +69,9 @@ class InvisiblePlaywright:
        self._profile_dir: Optional[Path] = Path(profile_dir) if profile_dir else None
        # reCAPTCHA pre-seed gated server-side; respect persistent profile.
        self._prep_recaptcha = bool(prep_recaptcha) and self._profile_dir is None
-        self._profile: Profile = generate_profile(self.seed, pin=self._pin)
+        self._profile: Profile = generate_profile(
            self.seed, pin=self._pin, fixed_gpu_class=forced_gpu_class(self.seed)
        )
        self._pw: Optional[Playwright] = None
        self._browser: Optional[Browser] = None
        self._persistent_context: Optional[BrowserContext] = None
@@ -95,10 +98,19 @@ class InvisiblePlaywright:
            extra_prefs=self._extra_prefs,
            virtual_display=bool(self._headless and _sys.platform == "win32"),
        )
-        prefs["invisible_playwright.humanize"] = bool(self._humanize)
+        # Windows & macOS hide the headless window via the binary's own cloak
        # (DWMWA_CLOAK / NSWindow alpha) — inject the pref so the patched build
        # cloaks its chrome windows. setdefault: an explicit user override wins.
        # (Mirrors launcher._build_prefs; the sync path always did this, async
        # didn't — so async headless=True never cloaked AND crashed below.)
        if self._headless and _sys.platform in ("win32", "darwin"):
            for _k, _v in cloak_prefs().items():
                prefs.setdefault(_k, _v)
        # stealthfox.* is the namespace the binary's Juggler reads (see launcher.py note).
        prefs["stealthfox.humanize"] = bool(self._humanize)
        if self._humanize:
            cap = 1.5 if self._humanize is True else float(self._humanize)
-            prefs["invisible_playwright.humanize.maxTime"] = str(cap)
+            prefs["stealthfox.humanize.maxTime"] = str(cap)
        playwright_proxy = _configure_proxy_shared(self._proxy, prefs)
        pw_headless = self._resolve_headless()
        env = self._build_env()
@@ -223,6 +235,11 @@ class InvisiblePlaywright:
        if not self._headless:
            return False
        vd = make_virtual_display()
        # Linux: Xvfb to start. Windows/macOS: make_virtual_display() returns
        # None (the binary self-cloaks via cloak_prefs injected in __aenter__),
        # so there is nothing to start — guarding the None was the missing piece
        # that made async headless=True crash with AttributeError on Windows.
        if vd is not None:
            vd.start()
            self._virtual_display = vd
        return False
@@ -38,6 +38,7 @@ import secrets
 from typing import Any, Dict, List, Optional, Union
 from ._fpforge import generate_profile
 from ._webgl_personas import forced_gpu_class
 from .prefs import translate_profile_to_prefs
@@ -83,7 +84,7 @@ def get_default_stealth_prefs(
        ``playwright.firefox.launch()`` or ``launch_persistent_context()``.
    """
    resolved_seed = int(seed) if seed is not None else secrets.randbits(31)
-    profile = generate_profile(resolved_seed, pin=pin)
+    profile = generate_profile(resolved_seed, pin=pin, fixed_gpu_class=forced_gpu_class(resolved_seed))
    prefs = translate_profile_to_prefs(
        profile,
        locale=locale,
@@ -91,10 +92,11 @@ def get_default_stealth_prefs(
        extra_prefs=extra_prefs,
        virtual_display=virtual_display,
    )
-    prefs["invisible_playwright.humanize"] = bool(humanize)
+    # stealthfox.* is the namespace the binary's Juggler reads (see launcher.py note).
    prefs["stealthfox.humanize"] = bool(humanize)
    if humanize:
        max_seconds = float(humanize) if not isinstance(humanize, bool) else 1.5
-        prefs["invisible_playwright.humanize.maxTime"] = str(max_seconds)
+        prefs["stealthfox.humanize.maxTime"] = str(max_seconds)
    return prefs
@@ -7,7 +7,7 @@ bugfixes don't force a multi-hour Firefox rebuild.
 from __future__ import annotations
 # Bump this when a new patched Firefox build is released on GitHub.
-BINARY_VERSION: str = "firefox-9"
+BINARY_VERSION: str = "firefox-10"
 # Releases known to be broken — ensure_binary() refuses them with a clear error
 # instead of handing the user an unusable binary. firefox-8 was packaged without
@@ -8,6 +8,7 @@ from typing import Any, Dict, Optional, Union
 from playwright.sync_api import Browser, BrowserContext, Playwright, sync_playwright
 from ._fpforge import Profile, generate_profile
 from ._webgl_personas import forced_gpu_class
 from ._geo import prepare_session_geo
 from ._headless import cloak_prefs, make_virtual_display
 from ._proxy import configure_proxy as _configure_proxy_shared
@@ -178,7 +179,9 @@ class InvisiblePlaywright:
        # persistent profile_dir is in use, respect its existing cookies
        # and DON'T enable pre-seed (the profile owns its own state).
        self._prep_recaptcha = bool(prep_recaptcha) and self._profile_dir is None
-        self._profile: Profile = generate_profile(self.seed, pin=self._pin)
+        self._profile: Profile = generate_profile(
            self.seed, pin=self._pin, fixed_gpu_class=forced_gpu_class(self.seed)
        )
        self._pw: Optional[Playwright] = None
        self._browser: Optional[Browser] = None
        self._persistent_context: Optional[BrowserContext] = None
@@ -346,9 +349,13 @@ class InvisiblePlaywright:
        if self._headless and _sys.platform in ("win32", "darwin"):
            for _k, _v in cloak_prefs().items():
                prefs.setdefault(_k, _v)
-        prefs["invisible_playwright.humanize"] = bool(self._humanize)
+        # Pref namespace MUST be stealthfox.* — that's what the binary's Juggler
        # reads (PageHandler.js gates the Bezier mouse path on `stealthfox.humanize`).
        # The old `invisible_playwright.*` name was a dead no-op (nothing read it), so
        # humanize silently never fired and every click teleported the cursor.
        prefs["stealthfox.humanize"] = bool(self._humanize)
        if self._humanize:
-            prefs["invisible_playwright.humanize.maxTime"] = str(self._humanize_max_seconds())
+            prefs["stealthfox.humanize.maxTime"] = str(self._humanize_max_seconds())
        return prefs
    def _build_env(self) -> Dict[str, str]:
@@ -21,6 +21,7 @@ import sys
 from typing import Any, Dict, Optional
 from ._fpforge import Profile
 from ._webgl_personas import render_noise_seed, select_persona
 # ──────────────────────────────────────────────────────────────────────
@@ -231,6 +232,17 @@ _BASELINE: Dict[str, Any] = {
    "network.proxy.socks_remote_dns":                     True,
    "network.proxy.failover_direct":                      False,
    # TLS ClientHello fingerprint — match stock Firefox byte-for-byte.
    # The Playwright/Juggler Firefox build this binary derives from re-enables
    # cipher 0xC009 (TLS_ECDHE_ECDSA_WITH_AES_128_CBC_SHA), which retail Firefox
    # 150 does NOT offer. That extra (17th) cipher shifts our JA3/JA4 away from
    # any real Firefox (ja4 t13d1717h2 vs stock t13d1617h2). A ClientHello that
    # matches no real browser is itself a consistency tell. Disabling it makes
    # JA3/JA4/peetprint byte-identical to retail FF150 (verified on tls.peet.ws).
    # Stock Firefox ships without 0xC009 and works on the whole web, so this only
    # improves fingerprint consistency — it cannot break connectivity.
    "security.ssl3.ecdhe_ecdsa_aes_128_sha":              False,
    # Safebrowsing — chatty and fingerprintable.
    "browser.safebrowsing.malware.enabled":               False,
    "browser.safebrowsing.phishing.enabled":              False,
@@ -437,22 +449,41 @@ def _accept_language(locale: str) -> str:
 def _font_metrics_for_platform(profile_metrics: str) -> str:
    """Return ``zoom.stealth.font.metrics`` value.
-    Windows: empty string. The C++ width-scale hook is a no-op and
+    The C++ whitelist hook (``gfxPlatformFontList::FindAndAddFamiliesLocked``)
-    Firefox renders Arial/Segoe/Calibri/etc. at their native canonical
+    backs EVERY whitelisted *named* family with the list-head family on every
-    widths. Applying the Bayesian-sampled per-font factors on a Windows
+    platform. Without per-font width factors, that means each named font
-    build would *distort* real metrics and surface as a font_preferences
+    (Arial, Times New Roman, Courier New, …) renders with identical glyphs and
-    width anomaly to FP Pro / reCAPTCHA.
+    collapses to a SINGLE canvas ``measureText`` width — a non-physical
    1-distinct-width result that strict JS-sensor anti-bots flag via their
    font probe. The per-font factors in ``profile_metrics``
    (``arial|0.978,arial black|1.168,…``) spread the fabricated families back
    to distinct, realistic, deterministic-per-seed widths, so we apply them on
    EVERY platform (previously suppressed on Windows/mac, which left the
    collapse in place — only the CSS-generic vector, which FP Pro probes, was
    ever correct there).
-    Linux: prepend generic-family compensation factors so DejaVu /
+    These factors only key *named* families. CSS generics
-    Liberation render at the widths Windows JS expects, then append the
+    (serif/sans-serif/monospace/system-ui) bypass the whitelist entirely and
-    per-font factors that make each fabricated family detectable by
+    render at the host's native widths, so they are never present in
-    width-diff probes.
+    ``profile_metrics`` and stay unfactored — FP Pro's ``font_preferences``
    probe (which measures the generics) is unaffected. That is also why
    applying named-font factors here does NOT distort the canonical generic
    widths.
    Linux ADDITIONALLY needs generic-family compensation
    (``_LINUX_GENERIC_FONT_FACTORS``) because DejaVu/Liberation generics render
    wider/narrower than the Windows widths the spoofed profile claims; on
    Windows/mac the generics already render native, so no generic compensation
    is applied — only the named-font factors.
    """
    if not profile_metrics:
        return ""
    if sys.platform.startswith("linux"):
        return _LINUX_GENERIC_FONT_FACTORS + profile_metrics
-    return ""  # Windows: NEVER apply width-scale factors.
+    # Windows / macOS: named-font factors only (the generics render native and
    # bypass the whitelist, so no generic compensation — but the named families
    # MUST be factored or they all collapse to the list-head width).
    return profile_metrics
 def translate_profile_to_prefs(
@@ -479,21 +510,32 @@ def translate_profile_to_prefs(
    # GPU / WebGL renderer/vendor.
    # On Linux we spoof to a Windows ANGLE renderer string (profile.gpu.renderer)
    # so cross-platform sessions report a consistent Windows GPU identity.
-    # On Windows, spoofing a different GPU creates a renderer/parameters hash
+    # On Windows/mac, spoofing a renderer string ALONE is unsafe — the ~81
-    # mismatch: FP Pro hashes all 81 CN-set getParameter() values including
+    # getParameter values stay real, so a name↔params hash mismatch FP Pro flags
-    # enum 7937 (RENDERER). Setting GTX 980 while ANGLE returns Intel Arc A750
+    # (setting GTX 980 over real Arc A750 params scored ~0.70). Instead we apply a
-    # parameters produces an OOD (hash 23d0a74b vs vanilla 66544db) that FP Pro
+    # VALIDATED PERSONA (see _webgl_personas): a {renderer, vendor} whose params are
-    # ML scores at ~0.70 (confirmed: direct SF146 vs vanilla on same machine).
+    # the shared ANGLE D3D11 caps (vendor-independent — identical on any host, per the
-    # Fix: leave renderer/vendor empty on Windows → ANGLE reports native hardware
+    # ANGLE source) and whose extension list is FORCED below. That is a coherent fake
-    # (SanitizeRenderer path at ClientWebGLContext.cpp:2592-2595) → consistent.
+    # GPU that passes FP Pro host-independently (the host's real GPU never leaks). If no
    # validated persona exists for the sampled gpu_class yet, fall back to the host-real
    # renderer (empty → native ANGLE; SanitizeRenderer at ClientWebGLContext.cpp:2592).
    _persona = None
    if sys.platform.startswith("linux"):
        prefs["zoom.stealth.webgl.renderer"] = profile.gpu.renderer
        prefs["zoom.stealth.webgl.vendor"]   = profile.gpu.vendor
        _renderer_lo = (profile.gpu.renderer or "").lower()
    else:
        _persona = select_persona(profile.seed)
        if _persona:
            prefs["zoom.stealth.webgl.renderer"] = _persona["renderer"]
            prefs["zoom.stealth.webgl.vendor"]   = _persona["vendor"]
        else:
            prefs["zoom.stealth.webgl.renderer"] = ""
            prefs["zoom.stealth.webgl.vendor"]   = ""
-        _renderer_lo = "intel"  # test hardware is Intel Arc A750
+        # Canvas-noise mask is calibrated to the REAL host GPU's rendering variance — the canvas is
        # drawn by real hardware, NOT the persona's claimed GPU, so it must NOT follow the persona
        # (a non-Intel persona on an Intel host would over-noise). Deployment host is Intel.
        _renderer_lo = "intel"
    # MSAA: on Windows, pin to 4 (Firefox default for ANGLE) so gl.SAMPLES is
    # constant across all sessions. Different MSAA values cause different CN-set
@@ -522,7 +564,8 @@ def translate_profile_to_prefs(
    prefs["zoom.stealth.screen.dpr"]          = profile.screen.dpr
    prefs["layout.css.devPixelsPerPx"]        = str(profile.screen.dpr)
-    # Hardware
+    # Hardware — coherent with the sampled gpu_class by construction (the forge
    # draws hw_concurrency conditioned on the GPU class).
    prefs["zoom.stealth.hw_concurrency"]      = profile.hardware.concurrency
    prefs["zoom.stealth.storage.quota_mb"]    = profile.hardware.storage_quota_mb
@@ -566,8 +609,12 @@ def translate_profile_to_prefs(
    # Cross-process seed (canvas noise + DWrite gamma share this). Only
    # zoom.stealth.fpp.hw_seed is read by the C++; the old zoom.stealth.seed
    # alias was never declared in the yaml and read by nothing — dropped
-    # 2026-06-10.
+    # 2026-06-10. The render-noise seed is DECOUPLED from the identity seed and
-    prefs["zoom.stealth.fpp.hw_seed"] = profile.seed
+    # drawn from a calibrated CLEAN pool: the canvas/WebGL render HASH it drives
    # is the dominant FP Pro tampering_ml signal, and some hw_seeds yield a
    # "suspicious" render hash. render_noise_seed() maps to the clean pool while
    # keeping per-seed determinism + diversity. See _webgl_personas.
    prefs["zoom.stealth.fpp.hw_seed"] = render_noise_seed(profile.seed)
    # Synthetic host ICE candidate — injected by C++ when addr_ct==0 (SOCKS5
    # proxy suppresses all local addresses so Firefox can't gather host cands).
@@ -577,11 +624,20 @@ def translate_profile_to_prefs(
    _lan_ip = f"192.168.{(_s >> 8) % 254 + 1}.{_s % 254 + 1}"
    prefs["zoom.stealth.webrtc.host_ip"] = _lan_ip
-    # On Windows, native ANGLE extension list already matches real Windows users.
+    # Windows/mac extension list:
-    # The baseline hard-codes a curated _WEBGL1/2_EXTENSIONS list designed for
+    #  - persona active → FORCE the validated extension list. A non-Intel host's native
-    # Linux Mesa → clear it so Windows sessions report the native extension set
+    #    extensions would mismatch the persona's renderer (renderer says AMD/Intel-Arc but
-    # (hash matches real Intel Arc A750 vanilla captures).
+    #    extensions are the host's), so the persona must carry its own list to stay
    #    host-independent.
    #  - no persona → clear so the host-real renderer reports its native extension set
    #    (matches real vanilla captures for that host's GPU).
    if not sys.platform.startswith("linux"):
        if _persona:
            # The persona carries its OWN extension lists in EXACT NATIVE ORDER — a
            # reordered/foreign list is flagged by FP Pro (verified 2026-06-13).
            prefs["zoom.stealth.webgl.extensions"]  = _persona["ext1"]
            prefs["zoom.stealth.webgl2.extensions"] = _persona["ext2"]
        else:
            prefs["zoom.stealth.webgl.extensions"]  = ""
            prefs["zoom.stealth.webgl2.extensions"] = ""
@@ -91,9 +91,18 @@ def test_cloak_hides_window_but_keeps_rendering(firefox_binary):
        shot = page.screenshot()
        assert len(shot) > 3000, "cloaked window produced a blank screenshot (rendering paused)"
-        # 2) real WebGL present (native headless has none) -> headed pipeline intact.
+        # 2) headed pipeline intact: a real WebGL context (Playwright's native
        #    headless has none). Linux (Xvfb + llvmpipe) and Windows (WARP) give a
        #    software context on the GPU-less runners, so a missing context there
        #    is a real regression -> hard fail. macOS GitHub runners expose NO
        #    WebGL in the CI session at all (even vanilla Firefox), and macOS has
        #    no software-GL fallback; the cloak's "still rendering" property is
        #    already proven by the non-blank screenshot above, so we don't also
        #    require a live WebGL context there.
        renderer = page.evaluate(_WEBGL_RENDERER)
-        assert renderer and renderer != "NO-WEBGL", f"no real WebGL under cloak: {renderer!r}"
+        webgl_ok = bool(renderer) and renderer != "NO-WEBGL"
        if not (sys.platform == "darwin" and not webgl_ok):
            assert webgl_ok, f"no real WebGL under cloak: {renderer!r}"
        # 3) the window is actually hidden (per-platform).
        if sys.platform == "win32":
@@ -6,21 +6,32 @@ libraries and uses their FULL API surface:
  * BotD (@fingerprintjs/botd, MIT) — the client-side bot detector that
    FingerprintJS Pro itself uses. We assert the aggregate verdict
    (``detect().bot == False``) AND every one of its ~18 individual detectors
-    (``getDetections()``) returns ``bot == False``. The per-detector view is
+    (``getDetections()``) returns ``bot == False``.
    why we could delete our hand-rolled ``test_botd_*`` mirrors — the real
    library now covers each detector, with the same granularity.
  * FingerprintJS open-source (MIT) — ``get()`` must return a ``visitorId``
-    that is STABLE across two fresh launches with the same seed (an
+    that is STABLE across two fresh launches with the same seed, and a RICH
-    over-randomized spoof drifts), and a RICH component set (the fingerprint
+    component set (the fingerprint surface is real, not a stub).
-    surface is real, not a stub).
+  * fpscanner (antoinevastel/fpscanner 1.0.6, MIT) — ``collectFingerprint()``
    runs ~21 bot-detection rules in the browser. We assert the **engine-agnostic**
    subset (webdriver / selenium / bot-UA / platform / timezone / language) is
    clean. We deliberately do NOT assert the Chrome/GPU-only rules (hasCDP,
    hasPlaywright, hasSwiftshaderRenderer, hasMissingChromeObject, …): they're
    trivially clean on Firefox, and the GPU ones can legitimately fire on a
    software-WebGL CI host (Xvfb/llvmpipe) — asserting them would false-red.
  * CreepJS (abrahamjuliot/creepjs, MIT, pinned) — the gold-standard Firefox-aware
    headless/stealth/lie detector. It exposes its result on ``window.Fingerprint``.
    We assert ``headlessRating == 0`` (webdriver + headless-UA tells) and the
    JS-proxy stealth tells are absent. ``stealthRating`` / ``totalLies`` /
    ``likeHeadlessRating`` are LOGGED, not hard-asserted, because some of their
    sub-signals (hasBadWebGL, prefers-light-color) are GPU/theme-sensitive and
    differ on a GPU-less CI host.
 Everything is hermetic: the libraries are vendored (tests/vendor/) and served
-from a localhost HTTP server — no external CDN call (Firefox tracking-protection
+from a localhost HTTP server — no external CDN call. For CreepJS, every non-local
-blocks the CDN anyway) and no IP/network dependency. Runs identically on a dev
+request is aborted, so its optional crowd-comparison POST never runs and the
-box and on a GitHub runner.
+verdict is computed purely locally. Runs identically on a dev box and a GH runner.
-NOT covered: FingerprintJS *Pro* (commercial, server-side, IP/residential
+NOT covered: FingerprintJS *Pro* (commercial, server-side) — stays the local
-analysis) — can't be self-hosted, stays the local realness gate.
+realness gate.
 """
 from __future__ import annotations
@@ -36,19 +47,33 @@ from invisible_playwright import InvisiblePlaywright
 _VENDOR = Path(__file__).parent / "vendor"
 _BOTD = "botd-2.0.0.esm.js"
 _FPJS = "fingerprintjs-5.2.0.umd.min.js"
 _FPSCANNER = "fpscanner-1.0.6.es.js"
 _CREEPJS = "creepjs-10aa672.js"  # pinned abrahamjuliot/creepjs@10aa6724
 # fpscanner rules that are MEANINGFUL on Firefox and GPU-independent — these must
 # stay clean. The omitted rules are Chrome-only (hasCDP/hasPlaywright/
 # hasMissingChromeObject/hasHighCPUCount/hasImpossibleDeviceMemory/
 # headlessChromeScreenResolution) or GPU-sensitive on a software-WebGL CI host
 # (hasSwiftshaderRenderer/hasGPUMismatch/hasMismatchWebGLInWorker).
 _FPSCANNER_AGNOSTIC = [
    "hasWebdriver", "hasWebdriverIframe", "hasWebdriverWorker", "hasWebdriverWritable",
    "hasSeleniumProperty", "hasBotUserAgent", "hasPlatformMismatch",
    "hasMismatchLanguages", "hasUTCTimezone", "hasMismatchPlatformIframe",
    "hasMismatchPlatformWorker", "hasInconsistentEtsl",
 ]
 _PAGE = f"""<!doctype html><html><head><meta charset="utf-8">
 <title>detectors</title>
 <script src="/{_FPJS}"></script>
 </head><body><h1 id="state">loading</h1>
 <script type="module">
-window.__botd = null; window.__fp = null; window.__err = "";
+window.__botd = null; window.__fp = null; window.__fps = null; window.__err = "";
 (async () => {{
  try {{
    const Botd = await import("/{_BOTD}");
-    const botd = await Botd.load();          // load() collects internally
+    const botd = await Botd.load();
-    const verdict = botd.detect();           // {{bot:false}} | {{bot:true,botKind}}
+    const verdict = botd.detect();
-    const raw = botd.getDetections() || {{}}; // per-detector verdicts
+    const raw = botd.getDetections() || {{}};
    const detections = {{}};
    for (const k in raw) detections[k] = {{ bot: raw[k].bot, botKind: raw[k].botKind || null }};
    window.__botd = {{ bot: verdict.bot, botKind: verdict.botKind || null, detections }};
@@ -60,24 +85,41 @@ window.__botd = null; window.__fp = null; window.__err = "";
    const errored = keys.filter(k => r.components[k] && "error" in r.components[k]);
    window.__fp = {{ visitorId: r.visitorId, componentKeys: keys, erroredComponents: errored }};
  }} catch (e) {{ window.__err += " fp:" + e; }}
  try {{
    const M = await import("/{_FPSCANNER}");
    const scanner = new M.default();
    const fp = await scanner.collectFingerprint({{ encrypt: false }});
    window.__fps = {{ fastBotDetection: fp.fastBotDetection, details: fp.fastBotDetectionDetails }};
  }} catch (e) {{ window.__err += " fps:" + e; }}
  document.getElementById("state").textContent = "done";
 }})();
 </script></body></html>"""
 # CreepJS gets its own page: creep.js is a plain `defer` script that runs on load
 # and populates window.Fingerprint. A minimal DOM is enough (the rich report DOM
 # is only for the visual page, not the computation).
 _CREEP_PAGE = f"""<!doctype html><html><head><meta charset="utf-8"><title>creep</title></head>
 <body><div id="fingerprint-data"></div><script src="/{_CREEPJS}" defer></script></body></html>"""
 class _DetectorSite:
-    """Localhost server: `/` → the page; `/<lib>` → the vendored bundle."""
+    """Localhost server: `/` → BotD+FPJS+fpscanner page, `/creepjs` → CreepJS page,
    `/<file>` → the vendored bundle."""
    def __init__(self):
        page = _PAGE.encode()
        creep_page = _CREEP_PAGE.encode()
        vendor = _VENDOR
        class H(http.server.BaseHTTPRequestHandler):
            def do_GET(self):  # noqa: N802
-                if self.path == "/" or self.path.startswith("/?"):
+                p = self.path.split("?")[0]
                if p == "/":
                    body, ctype = page, "text/html; charset=utf-8"
                elif p == "/creepjs":
                    body, ctype = creep_page, "text/html; charset=utf-8"
                else:
-                    f = vendor / Path(self.path.lstrip("/")).name
+                    f = vendor / Path(p.lstrip("/")).name
                    if not f.is_file():
                        self.send_error(404); return
                    body = f.read_bytes()
@@ -99,6 +141,10 @@ class _DetectorSite:
    def url(self):
        return f"http://127.0.0.1:{self.port}/"
    @property
    def creep_url(self):
        return f"http://127.0.0.1:{self.port}/creepjs"
    def close(self):
        self._srv.shutdown()
@@ -111,7 +157,7 @@ def detector_site():
 def _run_detectors(firefox_binary, url):
-    """Launch the binary, load the page, return (botd, fp, err)."""
+    """Launch the binary, load the page, return (botd, fp, fps, err)."""
    with InvisiblePlaywright(seed=42, binary_path=firefox_binary) as browser:
        page = browser.new_page()
        page.goto(url, wait_until="load", timeout=45000)
@@ -121,16 +167,48 @@ def _run_detectors(firefox_binary, url):
        )
        botd = page.evaluate("() => window.__botd")
        fp = page.evaluate("() => window.__fp")
        fps = page.evaluate("() => window.__fps")
        err = page.evaluate("() => window.__err")
-    return botd, fp, err
+    return botd, fp, fps, err
 def _run_creepjs(firefox_binary, creep_url):
    """Launch the binary, run CreepJS fully offline, return its headless result."""
    _EV = """() => {
      const f = window.Fingerprint;
      if (!f || !f.headless) return { ready: false };
      const h = f.headless;
      return {
        ready: true,
        headlessRating: h.headlessRating,
        stealthRating: h.stealthRating,
        likeHeadlessRating: h.likeHeadlessRating,
        headless: h.headless || {},
        stealth: h.stealth || {},
        totalLies: (f.lies && f.lies.totalLies) || 0,
      };
    }"""
    with InvisiblePlaywright(seed=42, binary_path=firefox_binary) as browser:
        page = browser.new_page()
        # truly offline: abort every non-loopback request (CreepJS's optional
        # crowd-comparison POST to arh.antoinevastel.com never runs).
        page.route(
            "**/*",
            lambda r: r.abort() if "127.0.0.1" not in r.request.url else r.continue_(),
        )
        page.goto(creep_url, wait_until="domcontentloaded", timeout=45000)
        page.wait_for_function(
            "() => !!(window.Fingerprint && window.Fingerprint.headless)",
            timeout=60000,
        )
        return page.evaluate(_EV)
@pytest.mark.e2e
 def test_botd_no_detector_flags_automation(firefox_binary, detector_site):
    """The real BotD must not flag the build — aggregate AND every one of its
-    individual detectors (webDriver/userAgent/appVersion/plugins/process/... ).
+    individual detectors (webDriver/userAgent/appVersion/plugins/process/...)."""
-    """
+    botd, _fp, _fps, err = _run_detectors(firefox_binary, detector_site.url)
    botd, _fp, err = _run_detectors(firefox_binary, detector_site.url)
    assert botd is not None, f"BotD produced no result (err:{err!r})"
    assert botd.get("bot") is False, (
        f"BotD aggregate flagged a bot: botKind={botd.get('botKind')!r}"
@@ -146,8 +224,8 @@ def test_fingerprintjs_visitorid_stable_across_launches(firefox_binary, detector
    """FingerprintJS visitorId must be present and identical across two fresh
    launches with the same seed — a real browser is stable; an over-randomized
    spoof drifts (and a drifting fingerprint is itself a bot tell)."""
-    _b1, fp1, err1 = _run_detectors(firefox_binary, detector_site.url)
+    _b1, fp1, _f1, err1 = _run_detectors(firefox_binary, detector_site.url)
-    _b2, fp2, err2 = _run_detectors(firefox_binary, detector_site.url)
+    _b2, fp2, _f2, err2 = _run_detectors(firefox_binary, detector_site.url)
    assert fp1 and fp1.get("visitorId"), f"no visitorId on run 1 (err:{err1!r})"
    assert fp2 and fp2.get("visitorId"), f"no visitorId on run 2 (err:{err2!r})"
    assert fp1["visitorId"] == fp2["visitorId"], (
@@ -159,13 +237,58 @@ def test_fingerprintjs_visitorid_stable_across_launches(firefox_binary, detector
@pytest.mark.e2e
 def test_fingerprintjs_collects_rich_fingerprint(firefox_binary, detector_site):
    """FingerprintJS must collect a RICH component surface (a real browser
-    exposes many signals; a stripped/blocked surface is itself suspicious).
+    exposes many signals; a stripped/blocked surface is itself suspicious)."""
-    We don't assert zero errored components (some are legitimately unsupported
+    _b, fp, _f, err = _run_detectors(firefox_binary, detector_site.url)
    per browser), only that the surface is substantial and the id computed."""
    _b, fp, err = _run_detectors(firefox_binary, detector_site.url)
    assert fp and fp.get("visitorId"), f"FingerprintJS produced no id (err:{err!r})"
    keys = fp.get("componentKeys") or []
    assert len(keys) >= 15, (
        f"FingerprintJS collected only {len(keys)} components — surface too thin "
        f"(suppressed signals are themselves a tell): {keys}"
    )
@pytest.mark.e2e
 def test_fpscanner_no_automation_rules(firefox_binary, detector_site):
    """fpscanner's engine-agnostic bot rules (webdriver/selenium/bot-UA/platform/
    timezone/language) must all be clean. The Chrome/GPU-only rules are ignored
    on purpose (see module docstring) — they false-red on a software-WebGL host."""
    _b, _fp, fps, err = _run_detectors(firefox_binary, detector_site.url)
    assert fps is not None, f"fpscanner produced no result (err:{err!r})"
    details = fps.get("details") or {}
    assert details, f"fpscanner returned no detection details (err:{err!r})"
    flagged = [
        k for k in _FPSCANNER_AGNOSTIC
        if details.get(k) and details[k].get("detected")
    ]
    assert not flagged, (
        f"fpscanner flagged automation on engine-agnostic rules: {flagged} "
        f"(full details: { {k: v for k, v in details.items() if v.get('detected')} })"
    )
@pytest.mark.e2e
 def test_creepjs_headless_and_proxy_clean(firefox_binary, detector_site):
    """CreepJS (Firefox-aware) must see no headless tell and no JS-proxy stealth
    tell. ``headlessRating`` aggregates webDriverIsOn + headless-UA checks (all
    GPU-independent). The proxy/runtime stealth sub-signals (hasIframeProxy,
    hasToStringProxy, hasBadChromeRuntime) must be false — a spoof implemented
    with a JS Proxy is exactly what CreepJS catches. stealthRating/totalLies/
    likeHeadlessRating are GPU/theme-sensitive, so we log them, not assert."""
    r = _run_creepjs(firefox_binary, detector_site.creep_url)
    assert r and r.get("ready"), f"CreepJS never populated window.Fingerprint: {r!r}"
    print(
        f"[creepjs] headlessRating={r['headlessRating']} stealthRating={r['stealthRating']} "
        f"likeHeadlessRating={r['likeHeadlessRating']} totalLies={r['totalLies']} "
        f"headless={r['headless']} stealth={r['stealth']}"
    )
    assert r["headlessRating"] == 0, (
        f"CreepJS headless tells fired: headless={r['headless']} "
        f"(headlessRating={r['headlessRating']})"
    )
    stealth = r.get("stealth") or {}
    proxy_tells = {
        k: stealth.get(k)
        for k in ("hasIframeProxy", "hasToStringProxy", "hasBadChromeRuntime")
        if stealth.get(k)
    }
    assert not proxy_tells, f"CreepJS JS-proxy stealth tells fired: {proxy_tells}"
@@ -27,6 +27,7 @@ Run only this file:
 from __future__ import annotations
 import re
 import sys
 import pytest
@@ -296,6 +297,12 @@ def test_webgl_readpixels_no_masking_signature(page):
    ~300+ 'spikes' and pixelscan flagged it as masking; the gamma remap leaves
    the gradient smooth (~0 spikes). Regression guard for the gamma fix."""
    res = _ev(page, _WEBGL_MASKING_PROBE)
    if res.get("error") == "no-webgl" and sys.platform == "darwin":
        pytest.skip(
            "macOS CI runners expose no WebGL (no software-GL fallback); the gamma "
            "readPixels remap is platform-agnostic C++ and is exercised by the Linux "
            "(Xvfb/llvmpipe) and Windows (WARP) gates."
        )
    assert "error" not in res, f"WebGL probe failed: {res}"
    # genuine / gamma -> ~0; the rejected +-1 algorithm produced ~320.
    assert res["spikes"] < 30, (
@@ -289,8 +289,10 @@ def test_windows_virtual_display_with_socks_proxy(monkeypatch):
    assert prefs["security.sandbox.gpu.level"] == 0  # virtual_display branch
    assert prefs["network.proxy.type"] == 1          # SOCKS branch
    assert prefs["network.proxy.socks"] == "127.0.0.1"
-    # Windows still has the renderer cleared.
+    # Windows exposes a validated persona renderer (calibrated clean bucket),
-    assert prefs["zoom.stealth.webgl.renderer"] == ""
+    # not empty/native — see _webgl_personas.
    assert prefs["zoom.stealth.webgl.renderer"].startswith("ANGLE (")
    assert prefs["zoom.stealth.webgl.renderer"].rstrip().endswith(", D3D11)")
 # ──────────────────────────────────────────────────────────────────────
@@ -132,12 +132,9 @@ def test_mouse_move_outside_viewport_does_not_raise(firefox_binary):
 # ────────────────────────────────────────────────────────────────────
-@pytest.mark.e2e
+def _humanize_move_count(firefox_binary, humanize):
-def test_humanize_emits_intermediate_moves(firefox_binary):
+    """Count page mousemove events fired by ONE long mouse.move."""
-    """A long mouse.move from one corner to another should fire several
+    with InvisiblePlaywright(seed=42, binary_path=firefox_binary, humanize=humanize) as browser:
    mousemove events on the page when the humanize hook is enabled (which
    is the StealthFox default)."""
    with InvisiblePlaywright(seed=42, binary_path=firefox_binary) as browser:
        page = browser.new_page()
        page.goto(_data_url(
            "<div id=d style='width:600px;height:400px' "
@@ -146,8 +143,27 @@ def test_humanize_emits_intermediate_moves(firefox_binary):
        page.mouse.move(10, 10)
        page.evaluate("window.__n = 0")
        page.mouse.move(500, 300)
-        moves = page.evaluate("window.__n")
+        return page.evaluate("window.__n")
-        assert moves >= 1, f"expected at least 1 mousemove event, got {moves}"
+
@pytest.mark.e2e
 def test_humanize_emits_intermediate_moves(firefox_binary):
    """A long mouse.move must expand into MANY intermediate mousemove events when
    humanize is on (Bezier), and ~1 (a teleport) when off. We assert the on/off
    CONTRAST: `moves >= 1` alone was a false-green — a teleport already fires 1 —
    and that false-green hid a pref-namespace bug (wrapper wrote
    `invisible_playwright.humanize`, the binary's Juggler reads `stealthfox.humanize`)
    that left humanize silently dead in production. This test now fails if the
    pref ever stops reaching the binary."""
    on = _humanize_move_count(firefox_binary, True)
    off = _humanize_move_count(firefox_binary, False)
    assert off <= 2, f"humanize OFF should ~teleport (<=2 moves), got {off}"
    assert on >= 4, (
        f"humanize ON must expand into many intermediate moves (Bezier); got {on} "
        f"(off={off}). moves==1 means the cursor teleports — the exact automation "
        f"tell humanize exists to remove, and a sign the stealthfox.* pref isn't "
        f"reaching the binary's Juggler."
    )
 # ────────────────────────────────────────────────────────────────────
@@ -185,8 +201,11 @@ def test_hover_triggers_mouseenter(firefox_binary):
        # Wait for the event rather than reading immediately: under load / on a
        # virtual display the mouseenter can land a beat after hover() returns,
        # which made an instant read flaky. wait_for_function still fails (times
-        # out) if mouseenter genuinely never fires.
+        # out) if mouseenter genuinely never fires. Timeout is generous (10s) so a
-        page.wait_for_function("() => window.__h === true", timeout=5000)
+        # busy full-suite run — where browser startup + CPU contention can push
        # the event past a tight 5s window — doesn't flake; the event itself fires
        # in well under a second when run in isolation.
        page.wait_for_function("() => window.__h === true", timeout=10_000)
 # ────────────────────────────────────────────────────────────────────
@@ -15,12 +15,18 @@ from invisible_playwright.prefs import (
@pytest.mark.unit
 def test_translate_includes_gpu_renderer_windows(monkeypatch):
-    """On Windows, renderer/vendor are cleared so ANGLE reports native hardware."""
+    """On Windows we falsify the GPU to one of the calibrated CLEAN buckets (FP Pro
    tampering_ml<=0.5 on every seed; sweep 2026-06-14). Only Radeon R9 200 Series and
    Intel Arc A750 ship — every NVIDIA/iGPU/945 bucket is penalized. See _webgl_personas."""
    monkeypatch.setattr(sys, "platform", "win32")
    _CLEAN = {
        "ANGLE (AMD, AMD Radeon R9 200 Series Direct3D11 vs_5_0 ps_5_0, D3D11)",
        "ANGLE (Intel, Intel(R) Arc(TM) A750 Graphics Direct3D11 vs_5_0 ps_5_0, D3D11)",
    }
    p = generate_profile(seed=42)
    prefs = translate_profile_to_prefs(p)
-    assert prefs["zoom.stealth.webgl.renderer"] == ""
+    assert prefs["zoom.stealth.webgl.renderer"] in _CLEAN
-    assert prefs["zoom.stealth.webgl.vendor"] == ""
+    assert prefs["zoom.stealth.webgl.vendor"] in {"Google Inc. (AMD)", "Google Inc. (Intel)"}
@pytest.mark.unit
@@ -82,10 +88,15 @@ def test_accept_language_underscore_normalized():
@pytest.mark.unit
-def test_font_metrics_windows_returns_empty(monkeypatch):
+def test_font_metrics_windows_applies_named_factors(monkeypatch):
-    # FM2: Windows never applies width-scale factors.
+    # FM2: Windows/mac apply the per-NAMED-font factors (so whitelisted named
    # families don't collapse to the list-head width on the canvas measureText
    # path), but WITHOUT the Linux generic-family compensation (generics bypass
    # the whitelist and render native there).
    monkeypatch.setattr(sys, "platform", "win32")
-    assert _font_metrics_for_platform("Arial|1.0,Verdana|0.9,") == ""
+    out = _font_metrics_for_platform("Arial|1.0,Verdana|0.9,")
    assert out == "Arial|1.0,Verdana|0.9,"
    assert "sans-serif|" not in out  # no generic compensation on Windows
@pytest.mark.unit
@@ -100,13 +111,14 @@ def test_font_metrics_empty_input_returns_empty():
@pytest.mark.unit
-def test_gpu_renderer_empty_on_windows(monkeypatch):
+def test_gpu_renderer_persona_on_windows(monkeypatch):
-    # PG2
+    # PG2: Windows exposes a validated persona renderer (well-formed ANGLE bucket, NOT empty/native).
    monkeypatch.setattr(sys, "platform", "win32")
    p = generate_profile(seed=42)
    prefs = translate_profile_to_prefs(p)
-    assert prefs["zoom.stealth.webgl.renderer"] == ""
+    r = prefs["zoom.stealth.webgl.renderer"]
-    assert prefs["zoom.stealth.webgl.vendor"] == ""
+    assert r and r.startswith("ANGLE (") and r.rstrip().endswith(", D3D11)")
    assert prefs["zoom.stealth.webgl.vendor"].startswith("Google Inc. (")
@pytest.mark.unit
@@ -143,13 +155,16 @@ def test_canvas_noise_mask_windows_uses_intel_path(monkeypatch):
@pytest.mark.unit
-def test_webgl_extensions_cleared_on_windows(monkeypatch):
+def test_webgl_extensions_persona_on_windows(monkeypatch):
-    # WE2
+    # WE2: with a persona active on Windows, extensions are FORCED to the persona's native-order
    # list (host-independent), NOT cleared. Order is load-bearing (must match the persona verbatim).
    monkeypatch.setattr(sys, "platform", "win32")
    from invisible_playwright._webgl_personas import select_persona
    p = generate_profile(seed=42)
    prefs = translate_profile_to_prefs(p)
-    assert prefs["zoom.stealth.webgl.extensions"] == ""
+    persona = select_persona(42)
-    assert prefs["zoom.stealth.webgl2.extensions"] == ""
+    assert prefs["zoom.stealth.webgl.extensions"] == persona["ext1"]
    assert prefs["zoom.stealth.webgl2.extensions"] == persona["ext2"]
 # ──────────────────────────────────────────────────────────────────────
@@ -45,13 +45,28 @@ def test_classify_gpu_intel_hd_old_buckets(renderer):
    "ANGLE (Intel, Intel(R) HD Graphics 530 Direct3D11)",
    "ANGLE (Intel, Intel(R) UHD Graphics 630 Direct3D11)",
    "ANGLE (Intel, Intel(R) Iris Xe Graphics Direct3D11)",
-    "ANGLE (Intel, Intel(R) Arc A750 Direct3D11)",
+    # Integrated Arc iGPUs (Core Ultra "Arc 130T/140T/Graphics") stay integrated_modern.
    "ANGLE (Intel, Intel(R) Arc(TM) 140T GPU Direct3D11)",
 ])
 def test_classify_gpu_intel_modern(renderer):
-    """CG4-CG7 [DT]: modern Intel HD/UHD/Iris/Arc → integrated_modern."""
+    """CG4-CG7 [DT]: modern Intel HD/UHD/Iris + integrated Arc → integrated_modern."""
    assert classify_gpu(_gpu(renderer)) == "integrated_modern"
@pytest.mark.unit
@pytest.mark.parametrize("renderer,expected", [
    # Discrete Intel Arc DESKTOP cards are NOT integrated: A5xx/A7xx/Bxxx ~ mid-range
    # discrete (RTX 3060 tier); A3xx are entry discrete → low_end.
    ("ANGLE (Intel, Intel(R) Arc(TM) A750 Graphics Direct3D11 vs_5_0 ps_5_0)", "mid_range"),
    ("ANGLE (Intel, Intel(R) Arc(TM) A770 Graphics Direct3D11)", "mid_range"),
    ("ANGLE (Intel, Intel(R) Arc(TM) B580 Graphics Direct3D11)", "mid_range"),
    ("ANGLE (Intel, Intel(R) Arc(TM) A380 Graphics Direct3D11)", "low_end"),
 ])
 def test_classify_gpu_intel_arc_discrete(renderer, expected):
    """Discrete Intel Arc desktop SKUs map to a discrete-GPU class, not integrated."""
    assert classify_gpu(_gpu(renderer)) == expected
@pytest.mark.unit
@pytest.mark.parametrize("renderer", [
    "ANGLE (AMD, AMD Radeon Graphics Direct3D11)",
@@ -29,8 +29,8 @@ def test_get_default_stealth_prefs_random_seed_returns_dict():
    assert isinstance(prefs, dict)
    assert len(prefs) > 0
    # humanize toggle is always set explicitly
-    assert "invisible_playwright.humanize" in prefs
+    assert "stealthfox.humanize" in prefs
-    assert prefs["invisible_playwright.humanize"] is True
+    assert prefs["stealthfox.humanize"] is True
 def test_get_default_stealth_prefs_seed_is_deterministic():
@@ -50,22 +50,22 @@ def test_get_default_stealth_prefs_different_seeds_differ():
 def test_humanize_false_disables_prefs():
    """humanize=False removes the maxTime knob and flips the toggle to False."""
    prefs = get_default_stealth_prefs(seed=42, humanize=False)
-    assert prefs["invisible_playwright.humanize"] is False
+    assert prefs["stealthfox.humanize"] is False
-    assert "invisible_playwright.humanize.maxTime" not in prefs
+    assert "stealthfox.humanize.maxTime" not in prefs
 def test_humanize_default_sets_max_time_1_5():
    """humanize=True -> default maxTime is 1.5s, stored as string."""
    prefs = get_default_stealth_prefs(seed=42, humanize=True)
-    assert prefs["invisible_playwright.humanize"] is True
+    assert prefs["stealthfox.humanize"] is True
-    assert prefs["invisible_playwright.humanize.maxTime"] == "1.5"
+    assert prefs["stealthfox.humanize.maxTime"] == "1.5"
 def test_humanize_float_overrides_max_time():
    """Float for humanize is the explicit cap in seconds."""
    prefs = get_default_stealth_prefs(seed=42, humanize=3.0)
-    assert prefs["invisible_playwright.humanize"] is True
+    assert prefs["stealthfox.humanize"] is True
-    assert prefs["invisible_playwright.humanize.maxTime"] == "3.0"
+    assert prefs["stealthfox.humanize.maxTime"] == "3.0"
 def test_extra_prefs_overlay_takes_precedence():
@@ -6,13 +6,19 @@ on a dev box and on a GitHub runner (no external CDN at test time — Firefox
 tracking-protection blocks the openfpcdn.io CDN anyway, and we want CI offline).
 They are served from a localhost HTTP server and loaded into the patched Firefox;
-the tests assert the REAL detectors don't flag the stealth build (BotD: `bot===false`)
+the tests assert the REAL detectors don't flag the stealth build (BotD: `bot===false`;
-and that the fingerprint is stable (FingerprintJS: same `visitorId` across launches).
+fpscanner: engine-agnostic rules clean; CreepJS: `headlessRating===0` + no JS-proxy
 tells) and that the fingerprint is stable (FingerprintJS: same `visitorId` across
 launches). CreepJS runs fully offline — the tests abort every non-loopback request,
 so its optional crowd-comparison POST never fires and the verdict is computed locally.
 | File | Package | Version | Source | License |
 |---|---|---|---|---|
 | `botd-2.0.0.esm.js` | `@fingerprintjs/botd` | 2.0.0 | https://cdn.jsdelivr.net/npm/@fingerprintjs/botd@2.0.0/dist/botd.esm.js | MIT |
 | `fingerprintjs-5.2.0.umd.min.js` | `@fingerprintjs/fingerprintjs` | 5.2.0 | https://cdn.jsdelivr.net/npm/@fingerprintjs/fingerprintjs@5.2.0/dist/fp.umd.min.js | MIT |
 | `fpscanner-1.0.6.es.js` | `fpscanner` | 1.0.6 | https://cdn.jsdelivr.net/npm/fpscanner@1.0.6/dist/fpScanner.es.js | MIT |
 | `creepjs-10aa672.js` | `abrahamjuliot/creepjs` | git `10aa6724` | https://raw.githubusercontent.com/abrahamjuliot/creepjs/10aa6724cd33a1015db1574211890518cd04f0cc/docs/creep.js | MIT |
-Both are MIT (Copyright © FingerprintJS, Inc.). To update: download the pinned
+All MIT (FingerprintJS Inc. / Antoine Vastel / Abraham Juliot). To update: download
-dist from jsdelivr, drop it here, and bump the version in the filename + this table.
+the pinned dist (jsdelivr for npm packages, raw.githubusercontent for CreepJS at a
 commit SHA), drop it here, and bump the version in the filename + this table.
Author	SHA1	Message	Date
freedakgmail	ba2a67290d	examples: add zhaopin AI job scrapers + SQLite analysis e2e / e2e (linux, xvfb) (push) Waiting to run Details tests / pytest (ubuntu-latest, py3.11) (push) Waiting to run Details tests / pytest (ubuntu-latest, py3.12) (push) Waiting to run Details tests / pytest (windows-latest, py3.11) (push) Waiting to run Details tests / pytest (windows-latest, py3.12) (push) Waiting to run Details - scrape_zhaopin_ai.py: lightweight no-login list scraper - scrape_zhaopin_full.py: SQLite storage, resumable crawl, detail-page JD - analyze_zhaopin.py: stats by city/salary/education/experience/skills - scrape_zhipin_ai.py: BOSS Zhipin variant (login-based, fallback) - gitignore scraper data artifacts and browser profile	2026-06-14 23:18:51 +08:00
feder-cr	29262a644e	webgl: ship only the GPU buckets that pass tampering_ml + decouple render-noise seed Cut the per-seed WebGL persona to the two renderer buckets that score clean on FP Pro tampering_ml across seeds (AMD Radeon R9 200 Series and Intel Arc A750), weighted 70/30, cross-vendor so the fleet isn't one fixed GPU. Every NVIDIA bucket and the integrated/ancient Intel buckets are penalised, so they're out. The canvas/WebGL render-image hash turned out to be the dominant tampering_ml driver, not the attributes, so the render-noise seed (zoom.stealth.fpp.hw_seed) is now decoupled from the identity seed and drawn from a calibrated clean pool. Per-seed determinism and per-user diversity are preserved. Also in this change: - audio maxChannelCount is stereo-dominant per class (it reflects the output device, not the GPU; the old tables over-emitted 5.1/7.1 surround) - route discrete Intel Arc desktop cards to a discrete-GPU class (not integrated) - condition the whole sampled profile on the exposed GPU class via the sampler's evidence path, so cores/screen/storage stay coherent with the declared GPU - apply per-named-font width factors on Windows/macOS so canvas measureText widths don't collapse to a single value 12/12 seeds clean on tampering_ml (worst 0.29), bot and anti-detect negative, and the fingerprint stays identical across repeated runs of the same seed.	2026-06-14 11:53:33 +02:00
feder-cr	2dfa4e7bd7	fix: match stock Firefox TLS ClientHello (drop cipher 0xC009) The Playwright/Juggler Firefox build re-enables cipher 0xC009 (TLS_ECDHE_ECDSA_WITH_AES_128_CBC_SHA), which retail Firefox 150 does not offer. That extra (17th) cipher shifted our JA3/JA4 off every real Firefox (ja4 t13d1717h2 / ja3 6f7889b9 vs stock t13d1617h2 / 6447ab08) — a TLS fingerprint that matches no real browser, which is itself a consistency tell. Set security.ssl3.ecdhe_ecdsa_aes_128_sha=false in _BASELINE so JA3/JA4/peetprint are byte-identical to retail FF150 (verified on tls.peet.ws). Non-breaking: stock Firefox ships without 0xC009 and works on the whole web.	2026-06-12 17:40:48 +02:00
feder-cr	b34ecf2a21	fix: humanize pref namespace + async headless cloak humanize: the wrapper wrote invisible_playwright.humanize[.maxTime], but the binary's Juggler reads stealthfox.humanize (PageHandler.js gates the Bezier mouse path on it). The old name was a dead no-op, so humanize never fired and every mouse.move teleported the cursor — an automation tell. Renamed across config.py, launcher.py and async_api.py; the mouse test now asserts the on/off contrast instead of a false-green moves>=1. headless (async): InvisiblePlaywright(headless=True) crashed on Windows/macOS. _resolve_headless called make_virtual_display().start() unconditionally, but on Win/macOS that returns None (the binary self-cloaks via DWMWA_CLOAK; only Linux spawns Xvfb), so it died with AttributeError. It also never injected cloak_prefs(), so the window wouldn't have hidden anyway. Mirror the sync launcher: guard `if vd is not None` + inject cloak_prefs() when headless on win32/darwin. Verified on FF150: headless=True loads, exits clean, window fully hidden (no MainWindowHandle / no taskbar entry).	2026-06-12 17:31:31 +02:00
feder-cr	090baa6155	test: add fpscanner + CreepJS to the offline real-detector gate Alongside BotD and FingerprintJS, the detector e2e now runs two more FOSS client-side detectors against the binary, vendored and served from localhost so it stays offline on CI. fpscanner: assert its engine-agnostic bot rules (webdriver/ selenium/bot-UA/platform/timezone/language) are clean. CreepJS (the Firefox-aware one): runs fully offline via window.Fingerprint with every non-loopback request aborted, asserting headlessRating==0 and no JS-proxy stealth tell. The Chrome-only and GPU-sensitive signals are logged, not asserted, so a software-WebGL CI host doesn't false-red. Validated against firefox-10: full e2e 130 green.	2026-06-11 20:19:19 +02:00
feder-cr	3f2834d8c2	ci: auto-generate release notes from the invisible_firefox commits The publish job used a fixed body that still read 'DRAFT - do not publish' on the live release and listed none of the actual changes. Now the body is built from the source commits that went into the binary: the build records which invisible_firefox commit it came from (source-commit.txt), and publish diffs that against the previous release's recorded commit via the GitHub compare API (no deep clone, no cross-repo token) to list the user-facing subjects. docs/chore/ci/test commits are filtered out, and the body ends with 'Built from invisible_firefox @<sha>' for traceability. It's still a draft - the realness gate and the un-draft flip stay manual (issue #14).	2026-06-11 19:14:45 +02:00
feder-cr	b3608771ed	release: pin the wrapper to firefox-10 firefox-10 is published: the in-binary headless window cloak (Windows DWMWA_CLOAK, macOS NSWindow alpha-0, Linux keeps Xvfb) and the WebGL readPixels gamma remap that clears pixelscan's masking flag. Validated against the built binary — validate_release (linux+win), full e2e (128 passed), fppro ALL CRITICAL CLEAN, consistency PASS, and the cloak gate green on all 5 targets (macOS via CGWindowAlpha).	2026-06-11 18:29:07 +02:00
feder-cr	c9cc0f1743	test(mouse): widen hover->mouseenter wait to 10s (full-suite load flake) test_hover_triggers_mouseenter timed out at the old 5s wait when run as part of the full e2e suite — browser startup + CPU contention occasionally push the mouseenter past the window. In isolation the event fires in well under a second (5/5), so this is load-sensitivity, not a real regression. A 10s wait absorbs it while still failing fast if mouseenter genuinely never fires.	2026-06-11 18:08:11 +02:00
feder-cr	a950537f0a	ci: macOS gate tolerates the runner's missing WebGL; add verify-cloak The firefox-10 build gated green on all 5 targets but both macOS gate legs failed. The cloak/webgl guards hard-required a live WebGL context, and macOS GitHub runners expose none in the CI session (no software-GL fallback, unlike Linux llvmpipe and Windows WARP). The cloak renders fine there anyway, which the non-blank screenshot proves, so on the mac legs the WebGL-present check now self-skips and the cocoa cloak is validated via the screenshot plus CGWindowAlpha. The gamma masking guard skips on mac too (platform-agnostic C++, covered on Linux and Windows). verify-cloak.yml re-runs these guards against a prior build run's artifacts with no rebuild, so a test-only fix like this is validated against the real binaries in minutes instead of a 3h rebuild.	2026-06-11 17:18:02 +02:00
feder-cr	d4db15d37b	ci: install the [dev] extra (pytest) in the release cloak/webgl guard step The gate runner only had Playwright; `pip install -e .` doesn't pull pytest (a dev dep), so `python -m pytest` failed with "No module named pytest". Install ".[dev]" like e2e.yml does.	2026-06-11 14:29:31 +02:00