Explore projects
-
Updated
-
This software enables to damage texts written in any natural language by applying OCR degradation (phantom characters, character degradation, etc.) and by over-segmenting texts (this means splitting regularly the texts in equal parts).
This is useful to reproduce common errors found in historical documents when historical data is missing.
Archived 0Updated -
Methods to take into account digit preference (heaping) in count data of wildlife
Updated -
Updated
-
Updated
-
Updated
-
Updated
-
Updated
-
-
Updated
-
Updated
-
Updated
-
Updated
-
S4 Structure de Données TP1 Immutable Linked List
Updated -
Requests to collect documents relating real-world events (themselves described using wikivents) stored in a global index (provided by database_infrastructure_text_mining).
Archived 0Updated -
Updated