Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  • Data: “Garbage in, garbage out” The training of any model hinges on accurate, high quality data in an optimal format. If a department lacks this, it can hinder development and even worse, lead to unreliable model output. Data can broken down into the following aspects:

    • Accuracy: The data for a process is accurate and up-to-date

    • Volume: There’s an adequate amount of data to train, per model requirements

    • Format: Data is in an adequate format for training

  • Hallucinations: If a model lacks adequate data on a particular topic it can potentially provide false information. Tuning of model parameters (e.g temperature) can help minimize hallucinations.

  • Infrastucture Costs

Design

image-20240305-164801.pngImage Removed

Data

Jumbo Bot bases it’s responsed Bot’s responses are based on data scraped from the following websites on a weekly basis:

...

Technical details such as design, code, testing, deployment and data ingestion can be found here: https://github.com/Tufts-Technology-Services/jumbo-bot/blob/main/README.md

...