Package: smallsets 2.0.0

smallsets: Visual Documentation for Data Preprocessing

Data practitioners regularly use the 'R' and 'Python' programming languages to prepare data for analyses. Thus, they encode important data preprocessing decisions in 'R' and 'Python' code. The 'smallsets' package subsequently decodes these decisions into a Smallset Timeline, a static, compact visualisation of data preprocessing decisions (Lucchesi et al. (2022) <doi:10.1145/3531146.3533175>). The visualisation consists of small data snapshots of different preprocessing steps. The 'smallsets' package builds this visualisation from a user's dataset and preprocessing code located in an 'R', 'R Markdown', 'Python', or 'Jupyter Notebook' file. Users simply add structured comments with snapshot instructions to the preprocessing code. One optional feature in 'smallsets' requires installation of the 'Gurobi' optimisation software and 'gurobi' 'R' package, available from <https://www.gurobi.com>. More information regarding the optional feature and 'gurobi' installation can be found in the 'smallsets' vignette.

Authors:Lydia R. Lucchesi [aut, cre], Petra M. Kuhnert [ths], Jenny L. Davis [ths], Lexing Xie [ths]

smallsets_2.0.0.tar.gz
smallsets_2.0.0.zip(r-4.5)smallsets_2.0.0.zip(r-4.4)smallsets_2.0.0.zip(r-4.3)
smallsets_2.0.0.tgz(r-4.5-any)smallsets_2.0.0.tgz(r-4.4-any)smallsets_2.0.0.tgz(r-4.3-any)
smallsets_2.0.0.tar.gz(r-4.5-noble)smallsets_2.0.0.tar.gz(r-4.4-noble)
smallsets_2.0.0.tgz(r-4.4-emscripten)smallsets_2.0.0.tgz(r-4.3-emscripten)
smallsets.pdf |smallsets.html
smallsets/json (API)
NEWS

# Install 'smallsets' in R:
install.packages('smallsets', repos = c('https://lydialucchesi.r-universe.dev', 'https://cloud.r-project.org'))

Bug tracker:https://github.com/lydialucchesi/smallsets/issues

Pkgdown site:https://lydialucchesi.github.io

Datasets:

On CRAN:

Conda:

data-sciencedata-visualizationdocumentation-toolmachine-learningpreprocessingpythonvisualization-tools

5.19 score 14 stars 11 scripts 194 downloads 4 exports 86 dependencies

Last updated 2 months agofrom:9dc8bedaa7. Checks:4 OK, 5 NOTE. Indexed: yes.

TargetResultLatest binary
Doc / VignettesOKMar 24 2025
R-4.5-winOKMar 24 2025
R-4.5-macOKMar 24 2025
R-4.5-linuxOKMar 24 2025
R-4.4-winNOTEMar 24 2025
R-4.4-macNOTEMar 24 2025
R-4.4-linuxNOTEMar 24 2025
R-4.3-winNOTEMar 24 2025
R-4.3-macNOTEMar 24 2025

Exports:sets_labellingsets_sizingsets_spacingSmallset_Timeline

Dependencies:askpassbase64encbslibcachemcallrclicolorspacecommonmarkcpp11curldata.tabledigestevaluatefansifarverfastmapflextablefontawesomefontBitstreamVerafontLiberationfontquiverfsgdtoolsggplot2ggtextgluegridtextgtableherehighrhtmltoolsisobandjpegjquerylibjsonliteknitrlabelinglatticelifecyclelitedownmagrittrmarkdownMASSMatrixmemoisemgcvmimemunsellnlmeofficeropensslpatchworkpillarpkgconfigplotrixpngprocessxpsR6raggrappdirsRColorBrewerRcppRcppTOMLreticulaterlangrmarkdownrprojrootsassscalesstringistringrsyssystemfontstextshapingtibbletinytexutf8uuidvctrsviridisLitewithrxfunxml2yamlzip

smallsets User Guide

Rendered fromsmallsets.Rmdusingknitr::rmarkdownon Mar 24 2025.

Last update: 2023-12-04
Started: 2023-01-29

Readme and manuals

Help Manual

Help pageTopics
Synthetic datasets_data
Sets labellingsets_labelling
Sets sizingsets_sizing
Sets spacingsets_spacing
Smallset TimelineSmallset_Timeline