GuaranIA
- 5 followers
- Paraguay
Popular repositories Loading
-
corpus
corpus PublicThe central repository with the main codebase for data ingestion, preprocessing, and model training pipelines.
Jupyter Notebook
-
-
fineweb2-exploration
fineweb2-exploration PublicCode used to explore the dataset fineweb2
Jupyter Notebook
-
-
guardrails
guardrails PublicCode that implements guard-rail features that identify and get rid off of inappropriate content
Python
-
existing-guarani-corpora
existing-guarani-corpora PublicCode used to explore the existing guarani corpora
Python
Repositories
- corpus Public
The central repository with the main codebase for data ingestion, preprocessing, and model training pipelines.
guaran-ia/corpus’s past year of commit activity - guardrails Public
Code that implements guard-rail features that identify and get rid off of inappropriate content
guaran-ia/guardrails’s past year of commit activity - madlad400-exploration Public
Code used to explore the madlad-400 dataset https://huggingface.co/datasets/allenai/MADLAD-400
guaran-ia/madlad400-exploration’s past year of commit activity - audio-transcription Public
Code used to explore the performance of LLMs to transcribe Guarani audios
guaran-ia/audio-transcription’s past year of commit activity - orembae-exploration Public
Code used to explore the text extracted from the book `Che ñe'e, che purahei` https://www.orembae.org.py/book/1
guaran-ia/orembae-exploration’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…