Explore projects
-
-
Updated
-
Updated
-
Thèse Guillaume Bernard / Développement / from events to documents / database_infrastructure_text_mining
GNU General Public License v3.0 or laterTextual Search Engine Infrastructure based on ElasticSearch (https://www.elastic.co/fr/elasticsearch/) and Lucene (https://lucene.apache.org/). Includes the import scripts to load datasets into the index.
Archived 0Updated -
Updated
-
Updated
-
Updated
-
Thèse Guillaume Bernard / Jeux de données / dataset_manipulation_tools / synthesise_ocr_and_segmentation_errors_in_texts
GNU General Public License v3.0 or laterThis software enables to damage texts written in any natural language by applying OCR degradation (phantom characters, character degradation, etc.) and by over-segmenting texts (this means splitting regularly the texts in equal parts).
This is useful to reproduce common errors found in historical documents when historical data is missing.
Archived 0Updated -
Thèse Guillaume Bernard / Jeux de données / dataset_manipulation_tools / compute_tf_idf_weights
GNU General Public License v3.0 or laterThis software is used to compute TF IDF weighting from texts that are based on the document_tracking_resources format. Vectors and weightings are computed thanks to a resource file that contains a representation of the language used in the same context as the text to weight (news features to weight texts published in the news).
Archived 0Updated -
Joudieh Noura / Moodle2EventLog
Apache License 2.0Learning Management Systems like Moodle generate detailed logs from student interactions, offering significant potential for learning analytics and educational process mining. However, raw logs capture interaction-based actions rather than actual learning processes, limiting their pedagogical relevance. To address this, we developed Moodle2EventLog, a tool that automates the cleaning, preprocessing, and semantic enrichment of Moodle logs. The tool operates in two modules: the first cleans and structures logs by generating event logs with key elements (case IDs, activities, timestamps), and the second enriches them by grouping low-level events into context-aware sub-processes and maps them to "Semantic Activities" based on Bloom’s Taxonomy.
Updated -
Projet réalisé dans le cadre du cursus informatique pour la matière "Gestion de Projet". Il s'agit d'un visual novel (jeu à choix multiples) pouvant aboutir à plusieurs fins différentes en fonction des décisions prises au cours du Jeu.
Updated -
Updated
-
Updated
-
Updated
-
Updated