[#Script #Coding] Can you guess the output here?

Spread the love

Can you guess the output here?

By freeCodeCamp.org
Published: Jan 12, 2026


freeCodeCamp.org Learn how to benchmark embedding models on your own data in this course for beginners.

In this course, you will learn:
– The limitations of extracting text from PDF files with Python libraries and to solve that with the help of VLMs (Vision Language Models).
– How to divide the extracted text into chunks that preserve context.
– Generation questions for each chunk using LLMs (Large Language Models).
– Use embedding models to create vector representations of the chunks and questions.
– Use both open source and proprietary embedding models.
– Use llama.cpp to run models in the GGUF format locally on your machine.
– Perform the benchmarking of different embedding models using various metrics and statistical tests with the help of ranx.
– Plot the vector representations to visualize if clusters are being formed.
– Understand how to interpret the p-value that a statistical test provides.
– And much more!

You can find the slides, notebook, and scripts in this GitHub repository:
https://github.com/ImadSaddik/Benchmark_Embedding_Models

The dataset is available here:
https://huggingface.co/datasets/ImadSaddik/BenchmarkEmbeddingModelsCourse

To connect with Imad Saddik, check out his social accounts:
LinkedIn: https://www.linkedin.com/in/imadsaddik/
YouTube: https://www.youtube.com/@3CodeCampers
Website: https://imadsaddik.com/

?? Course Contents ??
(0:00:00) About the course
(0:06:05) Introduction
(0:17:58) Extracting text from PDF documents
(1:01:08) Divide text into coherent chunks
(1:23:10) Generate question-answer pairs from text chunks
(1:38:48) Embed text chunks and questions
(2:17:06) Statistical tests and metrics
(3:12:01) Expanding the dataset and adding more languages
(3:45:24) Conclusion

This one can be super tricky. Can you find the output? Share your solution so others can check!


Spread the love
Proudly powered by WordPress
Creative Commons License
EricBrooks.Com® is licensed under a Creative Commons License.

Disclaimer: The views expressed herein are solely those of Eric Brooks. They do not necessarily reflect those of his employers, friends, contacts, family, or even his pets (though my cat, Puddy, seems to agree with me on many key issues.). In accordance to my terms of use, you hereby acknowledge my right to psychoanalyze you, practice accupuncture, and mock you incessantly with every visit. As the user, you also acknowledge that the author has been legally declared a "Problem Adult" by the Commonwealth of Pennsylvania, and is therefore not responsible for any of his actions. ALSO, the political views and products advertised on this site may/may not reflect the views of Puddy or myself, so please don't take them as an endorsement. We just need to eat.


Connect