Explore projects
-
-
galactic / public / src / apps / cli / framework / io / data
BSD 3-Clause "New" or "Revised" LicenseGALACTIC framework io data plugin
Updated -
Updated
-
galactic / public / src / apps / cli / framework / core
BSD 3-Clause "New" or "Revised" LicenseGALACTIC core framework
Updated -
Updated
-
Updated
-
Updated
-
Updated
-
Thèse Guillaume Bernard / Développement / from documents to events / document_tracking
GNU General Public License v3.0 or laterImplementation of algorithms to detect and track events reported in the news. It provides two alternatives, one supervised, the other unsupervised to track events in the texts.
Archived 0Updated -
This competition proposes to improve / denoise OCR-ed texts, on a testbed of more than 20 million characters form English, French, German, Finish, Spanish, Dutch, Czech, Bulgarian, Slovak and Polish.
Updated -
Thèse Guillaume Bernard / Développement / from documents to events / news_tracking
GNU General Public License v3.0 or laterCommand Line Tools to manipulate the document_tracking architecture. It allows to train the Miranda algorithm, to use it and the alternative one, the K-Means implementation. It also provides a tool to evaluate the results.
Archived 0Updated -
Updated
-
Thèse Guillaume Bernard / Jeux de données / dataset_manipulation_tools / compute_tf_idf_weights
GNU General Public License v3.0 or laterThis software is used to compute TF IDF weighting from texts that are based on the document_tracking_resources format. Vectors and weightings are computed thanks to a resource file that contains a representation of the language used in the same context as the text to weight (news features to weight texts published in the news).
Archived 0Updated -
Thèse Guillaume Bernard / Développement / from documents to events / document_processing
GNU General Public License v3.0 or laterProcess documents in order to extract tokens, lemmas and named entities from texts. This software depends on spaCy (https://spacy.io/) in order to extract text features and recognise the inner elements.
Archived 0Updated -
Python tool to analyse and compare Tide Gauge data from online portals.
Updated -
Thèse Guillaume Bernard / Développement / from events to documents / wikivents-projects / wikivents
GNU General Public License v3.0 or laterA Python package to process and represent events from ontologies and semi-structured databases such as Wikidata and Wikipedia.
Archived 0Updated -
Thèse Guillaume Bernard / Jeux de données / dataset_manipulation_tools / compute_dense_vectors
GNU General Public License v3.0 or laterThis software is used to compute dense vectorisations (sentence embeddings) of sequences of sentences of natural text. It is able to handle multilingual documents until the model used is a multilingual one. This relies on the S-BERT architecture, software and models (https://www.sbert.net/). It computes dense vector representations for tokens, lemmas, entities, etc. of your datasets.
Archived 0Updated