AI & ML interests
None defined yet.
Recent Activity
textcleanlm/essentialweb-1.0-10B-clean-content
Viewer
•
Updated
•
9.32M
•
116
textcleanlm/essentialweb-1.0-10B-raw-content
Viewer
•
Updated
•
9.32M
•
135
textcleanlm/essentialweb-1.0-sample-10B
Viewer
•
Updated
•
9.32M
•
481
Viewer
•
Updated
•
2.98M
•
262
textcleanlm/med-domain-5b
Viewer
•
Updated
•
4.07M
•
242
textcleanlm/med-domain-data-sample1
Viewer
•
Updated
•
814k
•
120
textcleanlm/med-domain-data-sample
Viewer
•
Updated
•
8.1k
•
15
textcleanlm/fineweb-sample-10BT
Viewer
•
Updated
•
14.9M
•
119
textcleanlm/textclean-10B
Viewer
•
Updated
•
9.77M
•
422
textcleanlm/textclean-2B-raw-cleaned
Viewer
•
Updated
•
1.95M
•
477
textcleanlm/textclean-2B-raw-sample
Viewer
•
Updated
•
100
•
17
textcleanlm/textclean-2B-raw
Viewer
•
Updated
•
1.97M
•
33
textcleanlm/textclean-sft
Viewer
•
Updated
•
894k
•
33
Viewer
•
Updated
•
91.7k
•
39
textcleanlm/textclean-200M
Viewer
•
Updated
•
581k
•
38
textcleanlm/100M-raw-webtext-to-denoised-text
Viewer
•
Updated
•
179k
•
39
textcleanlm/annotation_example
Viewer
•
Updated
•
1.82k
•
17
Viewer
•
Updated
•
1.82k
•
119
textcleanlm/textclean-20M
Viewer
•
Updated
•
18.3k
•
24
textcleanlm/textclean-corpus-10M-deepseek-ablation
Viewer
•
Updated
•
18.1k
•
30
textcleanlm/textclean-corpus-1M-variant-ablation-research
Viewer
•
Updated
•
1.82k
•
12
textcleanlm/textclean-corpus-1M-old
Viewer
•
Updated
•
1.82k
•
12
•
1
textcleanlm/textclean-corpus-1M-o4-mini
Viewer
•
Updated
•
1.82k
•
16