Skip to content

Conversation

@doctorbanu
Copy link
Collaborator

@doctorbanu doctorbanu commented Oct 28, 2025


name: Apollo 11 test dataset and supporting files
about: This PR sets up the foundational dataset structure and documentation for further AI testing and comparison

Summary

This pull request introduces the Apollo 11 Test Dataset for evaluating AI models (commercial, distilled, SLM, and RAG systems) within the ELO2 - Green AI project.

Included files:

  • README.md – Overview and instructions for using the dataset
  • source_text.txt – Excerpted Apollo 11 text (~1,400 words) used for testing
  • test_prompts.md – 15 standardized test prompts covering summarization, reasoning, and retrieval
  • test_data.json – Structured JSON version of the dataset for automated workflows
  • RATIONALE.md – Detailed explanation of selection methodology and decisions

This dataset is designed for fair, consistent evaluation across multiple model types, including both recall and reasoning tasks. All files are properly linked and documented.

General Checks

  • the branch is up to date with main
  • the code works when pulled and run locally
  • All CI checks pass (or at least discussed)
  • all conflicts are resolved (if any)
  • PR has a descriptive title
  • PR has appropriate labels and milestones for easy identification
  • PR it is assigned to the owner
  • reviewers are assigned
  • the PR contributes only one focused change
  • It is in the appropriate column in the project board (if necessary)
  • has short and clear description
  • is linked to an issue (if it is related)
  • feedback is addressed (if any and if it is appropriate feedback.)

Markdown

  • the markdown source is formatted
  • spelling and grammar is correct in all text
  • The markdown looks correct when you preview the file
  • all links and images work

@doctorbanu doctorbanu added this to the milestone 2 milestone Oct 28, 2025
@doctorbanu doctorbanu self-assigned this Oct 28, 2025
@doctorbanu doctorbanu added the documentation Improvements or additions to documentation label Oct 28, 2025
@doctorbanu doctorbanu linked an issue Oct 28, 2025 that may be closed by this pull request
@doctorbanu doctorbanu changed the title Milestone 2- Apollo11 test prompts Milestone 2: Apollo11 test prompts Oct 28, 2025
Copy link
Collaborator

@CaesarGhazi CaesarGhazi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

INCREDIBLE WOOORRRK!!!

@CaesarGhazi CaesarGhazi merged commit 7007445 into main Oct 28, 2025
12 checks passed
AseelOmer pushed a commit that referenced this pull request Nov 13, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

documentation Improvements or additions to documentation

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Milestone 2: Unified Test Prompts for Model Evaluation

3 participants