Skip to content

Pinned

  1. OLMo OLMo Public

    Modeling, training, eval, and inference code for OLMo

    Python 4.1k 385

  2. dolma dolma Public

    Data and tools for generating and inspecting OLMo pre-training data.

    Python 822 80

  3. scispacy scispacy Public

    A full spaCy pipeline and models for scientific/biomedical documents.

    Python 1.6k 222

  4. ai2thor ai2thor Public

    An open-source platform for Visual AI.

    C# 1.1k 211

Repositories

Showing 10 of 458 repositories
  • OLMo Public

    Modeling, training, eval, and inference code for OLMo

    Python 4,134 Apache-2.0 385 26 42 Updated Jun 15, 2024
  • dolma Public

    Data and tools for generating and inspecting OLMo pre-training data.

    Python 822 Apache-2.0 80 19 7 Updated Jun 14, 2024
  • discoveryworld Public

    A virtual environment for developing and evaluating automated scientific discovery agents.

    Python 6 Apache-2.0 0 0 0 Updated Jun 14, 2024
  • beaker-gantry Public

    Gantry streamlines running Python experiments in Beaker by managing containers and boilerplate for you

    Python 15 Apache-2.0 0 1 2 Updated Jun 14, 2024
  • beaker-py Public

    A pure-Python Beaker client

    Python 8 Apache-2.0 2 1 5 Updated Jun 14, 2024
  • discoverybench Public

    Discovering Data-driven Hypotheses in the Wild

    Python 4 1 0 3 Updated Jun 14, 2024
  • WildBench Public

    Benchmarking LLMs with Challenging Tasks from Real Users

    Python 110 Apache-2.0 10 0 0 Updated Jun 14, 2024
  • OLMo-Eval Public

    Evaluation suite for LLMs

    Python 271 Apache-2.0 29 3 10 Updated Jun 13, 2024
  • SciRIFF Public

    Dataset and evaluation suite enabling LLM instruction-following for scientific literature understanding.

    Python 8 Apache-2.0 1 0 0 Updated Jun 14, 2024
  • ai2thor Public

    An open-source platform for Visual AI.

    C# 1,070 Apache-2.0 211 225 3 Updated Jun 13, 2024