Skip to content

Actions: datajuicer/data-juicer

Actions

Deploy Sphinx documentation to Pages

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
291 workflow runs
291 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

feat(agent): interaction quality ops & recipe, bad-case HTML report, …
Deploy Sphinx documentation to Pages #489: Commit 4472789 pushed by yxdyc
22m 29s main
In PyArrow 20.0.0+, when using open_json to read data in batches, an …
Deploy Sphinx documentation to Pages #488: Commit 003e2a8 pushed by HYLcool
30m 43s main
feat(semantic_ops): MVP extract + condition filter; join/agg/top-k pl…
Deploy Sphinx documentation to Pages #487: Commit 5e64e8b pushed by HYLcool
29m 51s main
fix(service): propagate text_keys to ops via get_init_configs in _set…
Deploy Sphinx documentation to Pages #486: Commit 4698741 pushed by cmgzn
30m 28s main
feat: add DocumentLineDeduplicator for cross-document line-level dedu…
Deploy Sphinx documentation to Pages #485: Commit f9f64ce pushed by HYLcool
29m 44s main
[Env] Reduce the size of default deps to speed up the installation. (…
Deploy Sphinx documentation to Pages #484: Commit 2bc2774 pushed by yxdyc
31m 4s main
hotfix: optimize model_utils.py sampling params and API client initia…
Deploy Sphinx documentation to Pages #483: Commit 0ca465c pushed by HYLcool
21m 49s main
fix: use broader substring match for aesthetics-predictor normalizati…
Deploy Sphinx documentation to Pages #482: Commit b8ea51a pushed by HYLcool
22m 10s main
fix: add fallback for temp dir removal in RayExecutor (#943)
Deploy Sphinx documentation to Pages #481: Commit b37020b pushed by HYLcool
21m 59s main
Release/v1.5.1 (#939)
Deploy Sphinx documentation to Pages #480: Commit 11c7679 pushed by HYLcool
21m 3s v1.5.1
Release/v1.5.1 (#939)
Deploy Sphinx documentation to Pages #479: Commit 11c7679 pushed by HYLcool
23m 22s main
feat: Enhance op_search with BM25/Regex retrieval & upgrade MCP serve…
Deploy Sphinx documentation to Pages #478: Commit 006ed6c pushed by cmgzn
19m 57s main
docs: add cache, export, and tracing docs (#935)
Deploy Sphinx documentation to Pages #477: Commit c5a116a pushed by HYLcool
19m 55s main
feat: add latex_figure_context_extractor_mapper operator (#923)
Deploy Sphinx documentation to Pages #476: Commit 2a2249f pushed by yxdyc
19m 48s main
Add support for json[l].gz, and make ray dataset support reading json…
Deploy Sphinx documentation to Pages #475: Commit 53322ba pushed by HYLcool
19m 49s main
feat(mapper): add LatexMergeTexMapper to extract and merge .tex files…
Deploy Sphinx documentation to Pages #474: Commit 22331f3 pushed by HYLcool
19m 27s main
fix(ops): fix NlpaugEnMapper only augmenting first sample in batch (#…
Deploy Sphinx documentation to Pages #473: Commit bdb5662 pushed by HYLcool
19m 10s main
fix(ops): prevent shared mutable _default_kwargs pollution across ope…
Deploy Sphinx documentation to Pages #472: Commit 55124b1 pushed by HYLcool
20m 19s main
perf: optimize TokenNumFilter with batch tokenization (#929)
Deploy Sphinx documentation to Pages #471: Commit b56c124 pushed by HYLcool
17m 38s main
Deploy Sphinx documentation to Pages
Deploy Sphinx documentation to Pages #470: by HYLcool
19m 49s main
feat(mapper): add custom tokenizer support to RemoveRepeatSentencesMa…
Deploy Sphinx documentation to Pages #469: Commit ef4bbac pushed by HYLcool
19m 25s main
perf: cache redundant sum() calls in repetition filters (#924)
Deploy Sphinx documentation to Pages #468: Commit 2c4bc60 pushed by HYLcool
19m 18s main
feat(config): add load_dataset_kwargs for passing extra args to datas…
Deploy Sphinx documentation to Pages #467: Commit 3152ad3 pushed by HYLcool
19m 13s main
fix: correct cache key in ImageFaceCountFilter (#921)
Deploy Sphinx documentation to Pages #466: Commit fa4d7b2 pushed by yxdyc
19m 33s main
Release v1.5.0 (#918)
Deploy Sphinx documentation to Pages #465: Commit 2e62d2a pushed by yxdyc
19m 39s v1.5.0