Explore projects
-
Thèse Guillaume Bernard / Développement / from events to documents / annotate_events_with_wikidata_identifiers
GNU General Public License v3.0 or laterAnnotation tool to check whether the annotation of a Corpus (document_tracking_resources) are correct and truthful.
Archived 0Updated -
Updated
-
-
Updated
-
Updated
-
Updated
-
ClimaBat experimental mockup - scaled-down street canyons in real-life climatic conditions
Updated -
galactic / public / src / core / closure
BSD 3-Clause "New" or "Revised" LicenseGALACTIC core closures library
Updated -
Thèse Guillaume Bernard / Jeux de données / dataset_manipulation_tools / compute_dense_vectors
GNU General Public License v3.0 or laterThis software is used to compute dense vectorisations (sentence embeddings) of sequences of sentences of natural text. It is able to handle multilingual documents until the model used is a multilingual one. This relies on the S-BERT architecture, software and models (https://www.sbert.net/). It computes dense vector representations for tokens, lemmas, entities, etc. of your datasets.
Archived 0Updated -
Thèse Guillaume Bernard / Jeux de données / dataset_manipulation_tools / compute_tf_idf_weights
GNU General Public License v3.0 or laterThis software is used to compute TF IDF weighting from texts that are based on the document_tracking_resources format. Vectors and weightings are computed thanks to a resource file that contains a representation of the language used in the same context as the text to weight (news features to weight texts published in the news).
Archived 0Updated -
galactic / public / src / core / concept
BSD 3-Clause "New" or "Revised" LicenseGALACTIC core formal concept analysis library
Updated -
galactic / public / src / io / data / core
BSD 3-Clause "New" or "Revised" LicenseUpdated -
Thèse Guillaume Bernard / Développement / from events to documents / database_infrastructure_text_mining
GNU General Public License v3.0 or laterTextual Search Engine Infrastructure based on ElasticSearch (https://www.elastic.co/fr/elasticsearch/) and Lucene (https://lucene.apache.org/). Includes the import scripts to load datasets into the index.
Archived 0Updated -
-