Explore projects
-
-
galactic / public / src / apps / cli / framework / io / data
BSD 3-Clause "New" or "Revised" LicenseGALACTIC framework io data plugin
Updated -
galactic / public / src / apps / cli / framework / core
BSD 3-Clause "New" or "Revised" LicenseGALACTIC core framework
Updated -
Updated
-
Updated
-
ClimaBat experimental mockup - scaled-down street canyons in real-life climatic conditions
Updated -
Updated
-
Thèse Guillaume Bernard / Jeux de données / dataset_manipulation_tools / synthesise_ocr_and_segmentation_errors_in_texts
GNU General Public License v3.0 or laterThis software enables to damage texts written in any natural language by applying OCR degradation (phantom characters, character degradation, etc.) and by over-segmenting texts (this means splitting regularly the texts in equal parts).
This is useful to reproduce common errors found in historical documents when historical data is missing.
Archived 0Updated -
Updated
-
galactic / public / src / apps / cli / main
BSD 3-Clause "New" or "Revised" LicenseUpdated -
galactic / public / src / io / data / toml
BSD 3-Clause "New" or "Revised" LicenseA toml data reader for GALACTIC.
Updated -
galactic / public / src / io / data / text
BSD 3-Clause "New" or "Revised" LicenseA text data reader for GALACTIC.
Updated -
-
galactic / public / src / io / data / ini
BSD 3-Clause "New" or "Revised" LicenseAn INI data reader for GALACTIC
Updated -
Updated
-
Thèse Guillaume Bernard / Jeux de données / dataset_manipulation_tools / compute_dense_vectors
GNU General Public License v3.0 or laterThis software is used to compute dense vectorisations (sentence embeddings) of sequences of sentences of natural text. It is able to handle multilingual documents until the model used is a multilingual one. This relies on the S-BERT architecture, software and models (https://www.sbert.net/). It computes dense vector representations for tokens, lemmas, entities, etc. of your datasets.
Archived 0Updated -
Updated
-
Thèse Guillaume Bernard / Développement / from documents to events / document_processing
GNU General Public License v3.0 or laterProcess documents in order to extract tokens, lemmas and named entities from texts. This software depends on spaCy (https://spacy.io/) in order to extract text features and recognise the inner elements.
Archived 0Updated -
Updated
-
Updated