Regrouping pages on model tuning in the DataOps User Guide by emassoulie · Pull Request #2014 · skrub-data/skrub

emassoulie · 2026-04-02T10:25:35Z

The five pages on validation and tuning at the end of the DataOps section of the User Guide have been fused into three main parts:

Validating a DataOps model
Hyperparameter tuning
Going further with Optuna

These parts have examples that should be shortened (or made into "example" pages), and subsection titles that have been made more explicit.

emassoulie · 2026-04-02T10:26:29Z

Marked as draft because, in addition to the restructuring, the pages also need to be trimmed a little (with the larger examples perhaps moved to their own section).

rcap107

Thanks for the PR @emassoulie, overall I think it's an improvement over the current documentation. I left a couple of comments where I was not convinced by the wording, but aside from that I think we can merge this soon.

rcap107 · 2026-04-14T11:43:36Z


-Here are the different kinds of choices, along with their default outcome when
-we are not using hyperparameter search:
+Skrub provides over 10 different ``choose`` methods for tuning use cases, all detailed


I wouldn't say there are 10 "different" choose methods: the choose methods are 4 (from, int, float and bool), but they can be used in different ways

rcap107 · 2026-04-14T11:46:29Z


-Splitting the data in train and test sets
-=========================================
+More advanced train/test splitting


I think the original title should be kept here: this section isn't about a more advanced way of defining train and test splits, we really are just splitting the data in two

First commit: rearranged page contents and subsection titles

2cd3f78

rcap107 reviewed Apr 14, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Regrouping pages on model tuning in the DataOps User Guide#2014

Regrouping pages on model tuning in the DataOps User Guide#2014
emassoulie wants to merge 1 commit intomainfrom
issue-2003-regroup-dataops-hyperparameter-pages

emassoulie commented Apr 2, 2026

Uh oh!

emassoulie commented Apr 2, 2026

Uh oh!

rcap107 left a comment

Uh oh!

rcap107 Apr 14, 2026

Uh oh!

rcap107 Apr 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

emassoulie commented Apr 2, 2026

Uh oh!

emassoulie commented Apr 2, 2026

Uh oh!

rcap107 left a comment

Choose a reason for hiding this comment

Uh oh!

rcap107 Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

rcap107 Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants