Explore projects
-
Updated
-
Thèse Guillaume Bernard / Développement / from documents to events / document_processing
GNU General Public License v3.0 or laterProcess documents in order to extract tokens, lemmas and named entities from texts. This software depends on spaCy (https://spacy.io/) in order to extract text features and recognise the inner elements.
Archived 0Updated -
Updated
-
Thèse Guillaume Bernard / Développement / from documents to events / documents_tracking_resources
GNU General Public License v3.0 or laterResources and Python API to manipulate datasets of news documents. It manipulates data in the .pickle format with the help of pandas and numpy. It can perform operations on the datasets.
Archived 0Updated -
Philippe Noah / my-arithmetic-nphili02
MIT LicenseUpdated -
Updated
-
Mariano Elric / my-arithmetic-emariano
MIT LicenseUpdated -
galactic / public / src / helpers / core
BSD 3-Clause "New" or "Revised" LicenseGALACTIC helpers core library
Updated -
Transcriptions of the French PARES dataset (originally coming from a unique .ods file, split in .csv files)
Updated -
-
galactic / public / src / io / data / text
BSD 3-Clause "New" or "Revised" LicenseA text data reader for GALACTIC.
Updated -
Visualisation du registre des traitements / Application web de visualisation du registre légal des traitements
CeCILL-B Free Software License AgreementProjet Open Source de visualisation interactive du registre des traitements de l'agglomération de La Rochelle.
Updated -
Updated
-
Thèse Guillaume Bernard / Jeux de données / dataset_manipulation_tools / compute_dense_vectors
GNU General Public License v3.0 or laterThis software is used to compute dense vectorisations (sentence embeddings) of sequences of sentences of natural text. It is able to handle multilingual documents until the model used is a multilingual one. This relies on the S-BERT architecture, software and models (https://www.sbert.net/). It computes dense vector representations for tokens, lemmas, entities, etc. of your datasets.
Archived 0Updated -
Updated