Agents
There are 3 types of agents that can be created: Chat
, Content Creator
and Business Analyst
Menu Location
Agents can be created and edited from the following menu:
Settings > Agent > Create Agent
Required Details
- Name
- Mode (select one):
- Retriever
- Coder
- Casual
- Planner
Mode Descriptions
- Retriever: Chat agent enhanced by your documents, internet search, and functions
- Coder: Coding assistant for developers and data scientists, enhanced by programming documentation and code environments
- Casual: Everyday agent for general-purpose activities, image generation, and file analysis
- Planner: Agent that can plan and execute tasks leveraging other agents
Mode-Specific Settings
Retriever Mode
- Topics (Under Knowledge settings) - required
- Optional features:
- Advanced reasoning: Forces the agent to explain its reasoning step by step
- Knowledge Settings: Allows the agent to use documentation to answer questions
- Internet search: Enables the agent to search the internet for answers
Coder Mode
- Optional features:
- Knowledge Settings: Allows the agent to use documentation to answer questions
- Code execution: Enables the agent to execute code
- Code upload: Allows the agent to analyze source code
Casual Mode
- Optional features:
- Image generation: Allows the agent to generate images based on user requests
- Docs upload: Enables the agent to analyze files, bypassing the Knowledge Hub
Planner Mode
- Agent selection: Choose agents to execute tasks
- Planner tools: A new section appears to perform necessary operations
Knowledge Settings
In the Knowledge Settings, you assign the topics to the agent and set the configuration of the agent and the knowledge hub. By default, new agents will not have any topics assigned to them. You will need to assign topics to the agent in order for it to be able to answer questions.
- Topics: The topics that the agent will be able to answer questions about.
- Max history (1-10): How many of the previous conversation will the agent remember to use for context.
- Keywords for knowledge base query: Search through the documentation with the keywords of the question or the full question. ToothFairyAI recommends to use keywords for larger documents sets and full question for smaller documents sets.
- Max keywords (1-10): Max number of keywords to use for the knowledge base query.
- TopK (1-20): Max number of results to return from the knowledge base query.
- Doc TopK (1-20): Max number of documents to extract from the initial knowledge base query by document id. Use this setting to reduce document bias in case especially of conflicting information.
- Min retrieval accuracy: Minimum confidence level for the knowledge base query to return results.
- Recency importance (0-5): How important is the recency of the document in the knowledge base query. This setting is used to give more weight to the most recent documents.
- Keywords importance (0-5): How important are the keywords in the knowledge base query. This setting is used to give more weight to the documents that contain the keywords.
ToothFairyAI's AI Inner Knowledge validation
ToothFairyAI's validation system, will only allow answers from AI Inner Knowledge that are verified. This feature is not how the other Aprovided bys operate. Other systems will use their own reasoning model which does not have these strict fact checking measures in place like what ToothFairyAI does.
The AI Inner Knowledge answers generated by ToothFairyAI are checked by an algorithmic mathematical logic pattern. This pattern references answers provided by what knowledge / documentation is available for the questions asked to it. This is done by converting question texts into 1024 dimension metrics which are then used to compared against an agents selected topics.
The recommended percentage of confidence level used by ToothFairyAI for providing an answer is 60%. This ensures that the output from an agent is accurate to the users questions. However the user can set the minimum retrieval accuracy to a higher percentage if they require a higher level of confidence in the answers provided by the agent.
Functions settings
Only available for Chat
agents
Functions
allow agents to consume APIs and DBs as a data source. Moreover, depending on the type, the function can provide suggestions or generate static responses for greater controllability of the agent's responses including via referencing entire web pages within the chat message.
Functions
are automatically enabled for retriever
agents while for casual
and coder
agents, the functionality is not available.
Functions can be created in the Functions
section of the Settings
page however to take effect they need to be associated to the agent.
Chat
and HTML
functions, when invoked, override API
, DB
functions and Knowledge hub
documentation.
See Settings > Functions
for more details.
Functions context
Functions context allows the agent to inject additional information while calling functions and tools using the customer, case and the overall conversation data. Customer and Case data will be present only if the necessary customer and case info is present at the time of calling the function while the chat data will always be present regardless of when the function gets called.
This setting allows the agent to complete a function call using data that is not available in the conversation and/or in the most recent messages.
Three types of context are available
- Customer: The entire JSON object containing the customer information
- Case: The entire JSON object containing the case information
- Chat: Chat metadata such as summary, phone number / email associated to channel, agent id, chat id etc.
Planner tools
Only available for Planner
agents.
The section comprises the following fields:
- Available agents for planning: The agents that will be used to execute the tasks. If none are selected, ToothFairyAI will prevent you from saving the agent unless
Dynamic agents generation
is enabled. - Planning instructions: Additional custom instructions that the planner can use to better understand the domain in which tasks will be executed and how to plan accordingly.
- Review instructions: Additional custom instructions that the planner can use to better understand how to review the plan and how to correct it if necessary.
- Max execution steps: The maximum amount of steps the planner can execute.
- Max re-attempts: The maximum amount of times the planner can re-attempt a failed task. This applies to all agents including
coder
agents withcode execution
enabled - Approval before execution: When enabled, the planner will require approval before executing a task. This is useful when you want to review the plan before it is executed or when you want to review the execution of a step which requires approval. By default the field set to
off
- Allow plan adjustment: When enabled, the planner will be able to adjust the plan if it detects that a task is not executable. This is useful when you want to allow the planner to correct the plan if it detects that a task is not executable. By default the field set to
off
- Dynamic agents generation: When enabled, the planner will be able to generate agents on the fly if it detects that a task requires a specific agent to be executed not alredy available in the
Available agents for planning
field. - Email on approval: When enabled, the planner will send an email to the user when a task requires approval. This is useful when you want to notify the user that a task requires approval. By default the field set to
off
- Email on completion: When enabled, the planner will send an email to the user when a task is completed. This is useful when you want to notify the user that a task is completed. By default the field set to
off
- Email on failure: When enabled, the planner will send an email to the user when a task fails. This is useful when you want to notify the user that a task has failed. By default the field set to
off
Planner
agents can execute autonomously for no longer than 60 minutes as per ToothFairyAI security policies. If your organisation requires more time for the executions, please contact us.
Planner
agents for now can only use mystica
, sorcerer 1.5
and Llama 3.1 Nemotron
models due to the complex reasoning required. Our team plans to release more models in the future.
Planners cannot perform the following operations:
- Complex powerpoints creation including custom styles and templates.
- PDF conversion to other file types
- Images extraction from docx, ppt and pdf files.
- Creation of videos, animations and 3rd graphics.
- Audio files creation
Lastly, documents can be only processed by Sorcerer and Mystica models, therefore 3rd party models will not be able to process documents and the plan execution might fail if the agent executing the step is configured to use a 3rd party model.
Execution environments and code upload settings
Only available for Coder
agents with code execution
enabled
Code environments
allow agents to leverage predefined docker images and code snippets to execute code. Once assigned to an agent, ToothFairyAI agents will automatically detect and use the most suitable environment for the task at hand.
Code upload
allows agents to receive up to 10 source code files at the time. To review which file types are supported see here
Internet search
Only available for Chat
agents of type Retriever
.
- Allow internet search: When enabled, the agent will be able to search the internet for answers.
- Max search results: The maximum number of search results to return from the internet search for any given mode - therefore if you have both
search
andnews
enabled and the max search results parameter is set to 10, the agent will return a maximum of 20 results. This is done to prevent an arbitrary result prioritisation from our side besides the standard SEO ranking. - Search location: The location to use for the internet search. This will be used to return results more relevant to the selected location.
- Search mode: The search mode to use for the internet search. The available options are
search
,news
,videos
,images
andshopping
. All options can be combined however if this field is left empty the agent will default tosearch
mode only. - Excluded domains from search: Domains to exclude from the internet search separated by a comma.
- Allow deep search: When enabled, the agent will be able to search the internet for answers in depth while when disabled the agent will only search the summaries of the results. This feature is only available for
search
andnews
mode while for the other results no deep search will be conducted. Regardless of how many results are returned, the agent will only be able to perform a deep search on the top three results.
When a Retriever
agent has only internet search
enabled while having no functions associated and no knowledge hub topics connected, the agent will default to searching the internet for answers.
The exact search queries used to retrieve the results are shared along with the actual websites in the details section of the answer to provide maximum transparency.
The internet search leverages an internal reranking model to return the most relevant results based on the user query and the max search results parameter. The reranking model runs after the webpages have been retrieved from the internet search engine, therefore the list of websites provided are not all considered during the answer generation process.
Agent Instructions
Only available for Chat
agents
The Agent Character are the input fields about the agent purpose, goals and what it should and should not talk about.
- Agent role and instructions: This is to set the purpose of the agent. Any additional istruction for the agent should be added here.
- Agent tooling guidelines: This is to set the tooling guidelines for the agent. This is useful when you want to force the agent to use a specific tool or when you want to prevent the agent from using a specific tool.
- Default answer: If a response cannot be found, this will be the response that is provided.
- Goals: This is how the agent will fullfil its role.
- Inhibition passage: Subjects that you do not want the agent to talk about.
- Pertinance passage: Subjects to force the agent to focus on.
Agent feedback
Only available for Chat
agents
When the agent receives feedback from any of the available chats the feedback will be displayed in this section. For performance reasons, only the most recent 10 feedback will be displayed. The context of the conversation and the user feedback is automatically included in the instructions of the agent.
Agent tools
Only available for Chat
agents
- Summarised memory: When enabled, the agent will only reference a summary of the conversation rather than the whole set of messages.
- Multilingual: Enables the multi-language capability for the agent to reply in over 120 languages, however this can impact performance so turn on only if necessary.
- Generate charts: If requested the agent can generate charts, tables and mind maps. This feature is available only when using
Sorcerer
andMystica
model families. When possible, ToothFairyAI will also generate a downloadable .png file for each chart and graph generated in the message. - Enable images upload: This setting will allow the user to upload images to the agent. - this feature is available only in
retriever
andcasual
mode and only one image can be uploaded for each message. The images must be in .png, .jpg or .jpeg format. - Enable audio upload: This setting will allow the user to upload audio files to the agent. - this feature is available only in
retriever
andcasual
mode and only one audio file can be uploaded for each message. The audio files must be in .wav or .mp4 format. - Enable video upload: This setting will allow the user to upload video files to the agent. - this feature is available only in
retriever
andcasual
mode and only one video file can be uploaded for each message. The video files must be in .mp4 format. - Enable audio generation: This setting allows agents output to be converted to speech. When the feature is enabled, a small speaker icon will appear next to the copy to clipboard icon. Clicking on the icon will play the audio. Depending on the internet connection, the audio may take a few seconds to load. To guarantee good output quality, ToothFairyAI agents preprocess the text in the message to ensure the audio is generated correctly and it is listeners friendly therefore the text displayed in some cases might not fully match the audio version. This feature is not available for
Starter
subscription users. - Enable speech to speech: This setting allows agents to convert speech to text and text to speech dynamically. The user will always need to press the microphone button to start the conversation while the response will be converted to speech automatically. This feature is only for
Enterprise
subscription users. - Voice selection: Choose the voice for audio generation, with female British English as the default; you can upload a custom MP3 or WAV file (under 10MB) to personalize your agent's voice. This feature effectively enables deep voice cloning and should be used responsibly and with proper consent. The user can also record 30 seconds of audio to use as the voice for the agent. Currently we natively support only English for
Starter
andPro
subscriptions while we support the following languages forEnterprises
: French, German, Spanish, Italian, Portuguese, Czech, Polish, Russian, Dutch, Turksih, Arabic, Mandarin Chinese - Enable docs upload: This setting will allow the user to upload documents to the agent. - this feature is available only in
casual
mode with up to 5 files uploaded at each turn. The docs must be of one of the following formats: .pdf, .csv, .doc, .docx, .xls, .xlsx, .html, .txt, .md . - Enhance answers with NER: This setting will allow the agent to used named entities in the response. - this feature is available only in
retriever
mode. - Retrieve images from docs: This setting will allow the agent to retrieve images and display them in the response. - this feature is available only in
retriever
mode. This feature is available only when usingSorcerer
andMystica
model families.
The following tools are not available for Planner
agents:
- Short term memory
- Multilingual
- Enhance ansers with NER
- Retrieve images from docs
Planner agents must allow for image, docs, video and audio uploads to be enabled in case the user wants to start a plan from one or more files.
Images generation
Only available for Chat
agents with casual
mode
Allow images generation: This setting allows the agents to generate new images based on the user input. ToothFairyAI agents generate by default four images at each interaction allowing users to further refine the output.
Image generation model: This setting allows the user to select the model to be used for image generation.
Mystica SD
is the default model for general purpose image creation;Mystica SD realism
is a fine-tuned image generation model for photographic generation tasks.Flux Pro
is an image generation model provided by Black Forest Labs for commercial use cases (not available forStarter
andPro
plan).Flux Pro 1.1
is the latest image generation model provided by Black Forest Labs for commercial use cases (not available forStarter
andPro
plan).Flux Pro 1.1 Ultra Realistic
is the latest image generation model provided by Black Forest Labs for commercial use cases finetuned for ultra realistic images (not available forStarter
andPro
plan).Max images generated: This setting allows the user to select the number of images to be generated at each interaction. The default value is 4. When the agent is invoked by the planner agent by default the image generation will produce only one at each step
When a Flux Pro
model is selected ToothFairyAI defaults the Max images generated
to 1
When a Planner
agent invokes a Casual
agent with Image generation
enabled, only one image will be generated at any given step.
Agent channels
Only available for Chat
agents
This section allows agents to be connected to one or more communication channels. The agent can be connected to as many channels as you want however each chat can be connected to only one channel at a time.
- Assign channels: This is the list of channels the agent can be connected to based on the channels created in the workspace. The user can add or remove channels from the list.
- Phone number: This is the phone number that will be used to send SMS messages to the user. This is required if the agent is connected to a SMS channel.
- Whatsapp number: This is the phone number that will be used to send Whatsapp messages to the user. This is required if the agent is connected to a Whatsapp channel.
- Email address: This is the email address that will be used to send emails to the user. This is required if the agent is connected to an Email channel.
- Delivery delay: If the agent is connected to a SMS, Whatsapp or Email channel the user will receive the message after the delay. This is useful to simulate a human response. The delay can be between 0 and 120 seconds.
Moderation and Feedback
- Allow feedback: This option allows the user to thumb up or down a response to show if the generated response is correct or not.
- Content moderation: When enabled, the agent will filter out any profanity from the response by responding back to the user with the moderation rule applied when necessary.
- Moderation rules: The list of rules the agent will use to moderate the content. The user can add or remove rules from the list.
The moderation of the user message is performed via a combination of a moderation model fine-tuned by the ToothFairyAI team and the base model associated to the agent
Advanced Settings
Temperature (0.01-1): Determines how creative the response will be, higher meaning most creative. Reduce this number to the lowest value to reduce hallucinations
Max output tokens (50-4096): Determines the number of tokes (characters set) in the response. The larger the number the longer and more detailed the responsed can be. If the number is set too low some answers might get truncated.
Prompt enhancement: When enabled, it enhances the user prompt behind the scenes to improve the quality of the generated response. This feature might add a slight delay to the response therefore it is not recommended for public facing agents.
Show code blocks: When set to
false
it suppresses the display of any code snippet. This feature is available only forCoder
andPlanner
agents. By default this option is set totrue
.Plain text output: Removes any form of styling from the responses.
Show response time: Displays the time from the first word to the end of the last word in the response.
Show detected language: This setting will display the language that what used in the response.
Restrict access: When enabled, the agent will only be accessible to the users specified in the user access settings and the admins of the workspace. When this mode is enabled the agent will not be accessible to the public through the web-widget and all chats will be private by default.
When Multimodal
is enabled, the agent will be able to generate responses with images, charts, tables and mind maps. This feature is available only when using Sorcerer
and Mystica
model families.
Appearance and Web Widget
Only available for Chat
agents
- Splash Logo: A logo you would like displayed in the widget.
- First chat message: The initial message displayed when the chat is started.
- Show agent name: When ticked, the agent name is displayed in the chat message. This is very useful when agent hand-off is enabled.
- Show splash message: When ticked, the first chat message is displayed as splash message in the middle of the chat.
- Input placeholder: Text displayed in the user input field.
- Loading placeholder: Text displayed when the agent is retrieving the answer.
- Disclaimer: A statement that explains rules, limits, or warnings.
- Icons colour in light mode: The global icon colour in light mode. This setting will be applied also in the side menu of
Chat agent
. - Icons colour in dark mode: The global icon colour in dark mode. This setting will be applied also in the side menu of
Chat agent
. - Splash background colour in light mode: The background colour of the top bar where the logo is displayed in light mode.
- Splash background colour in dark mode: The background colour of the top bar where the logo is displayed in dark mode.
- Favicon url: The url of the favicon to be displayed in the browser tab. The favicon should be a square image and must be in .ico or .png format.
- Initial questions: A list of questions that the agent can answer when the chat is started. The user can create up to three questions separated by a semicolon.
- Go to widget: The url of the page where the user can directly interact with the agent. Available only when the agent is
Block external usage
is ticked off. - Block external usage: Blocks the widget from public access.
Chat agents by default are provided with both the public url of the agent if enabled and the code to embed the agent into any web-page as a chat widget.
When you provide a custom icon to the agent it will be displayed in the side menu of Chat agent
and in the web widget.
Hosting and Models
- LLM provider: The provider used to host the agent. Currently only ToothFairyAI is available as a provider for
Starter
andPro
plans. ForEnterprise
plans, the user can choose between ToothFairyAI and their own hosting. - Base model: The model used by the agent to provide answers and generate outputs for the
Chat agent
,Business analyst
andContent Creator
agents.Sorcerer
: The default model used by any agent fine-tuned by ToothFairyAI.Sorcerer 1.5 (experimental)
: A quantum leap in the capabilities of the Sorcerer model, offering enhanced performance and accuracy. Request access to this model from the support team.Mystica
A larger model more powerful and more accurate however slower fine-tuned by ToothFairyAI.Mistral 7b - FP16
: A small but very capable model based created by Mistral team.Qwen 2.5 7b - FP16
: The smallest version of the latest open-sourced models provided by the Qwen team.Qwen 2.5 Coder 32b - FP8
: The latest open-sourced model designed for programming tasks provided by the Qwen team.Mistral 8x7b - FP16
: A larger model provided by Mistral team.Llama 3 8b
: The legacy model created by Meta AI in its smallest version. (dismissed on 15/08/2024)Llama 3.1 8b - FP8
: The latest model created by Meta AI in its smallest version and larger context window.Gemma 2 9b - FP16
: The latest open source model provided by Google.Gemma 2 27b - FP16
: The largest open source model provided by Google.Llama 3 70b
: The legacy model created by Meta AI in its larger version. (dismissed on 15/08/2024)Mistral 8x22b - FP16
: The largest open-sourced model provided by Mistral team (not available forStarter
plan).Qwen 2 72b - FP16
: The largest open-sourced model provided by the Qwen team (not available forStarter
plan).Qwen 2.5 72b - FP16
: The largest and most modern version of the latest open-sourced models provided by the Qwen team (not available forStarter
plan).Mistral Large
: A larger model closed-sourced provided by Mistral team (not available forStarter
plan).Mistral Large 2
: The largest and most capable closed-sourced model provided by Mistral team (not available forStarter
plan).Llama 3.1 70b - FP8
: The latest model created by Meta AI in its largest version and larger context window (not available forStarter
plan).Llama 3.1 Nemotron 70b - FP16
: The latest fine-tuned model created by Nvidia using 70b Llama 3.1 model as base model (not available forStarter
plan).Llama 3.1 405b - FP8
: The largest open-sourced and state of the art model provided by Meta (not available forStarter
plan).
The FP16
models are optimized for speed and lower memory usage with 16-bit floating point precision.
The FP8
models are optimized for faster speed and lower memory usage with 8-bit floating point precision.
- Functions provider: The provider used to host the functions. Currently only ToothFairyAI is available as a provider for
Starter
andPro
plans. ForEnterprise
plans, the user can choose between ToothFairyAI and their own hosting. - Functions model: The model used by ToothFairyAI to select the functions to be used by the agent.
Sorcerer
: The default model used by any agent fine-tuned by ToothFairyAI.Sorcerer 1.5 (experimental)
: A quantum leap in the capabilities of the Sorcerer model, offering enhanced performance and accuracy. Request access to this model from the support team.Mystica
A larger model more powerful and more accurate however slower.Llama 3.1 8b - FP8
: The latest model created by Meta AI in its smallest version and larger context window.Llama 3.1 70b - FP8
: The latest model created by Meta AI in its largest version and larger context window.Llama 3.1 405b - FP8
: The largest open-sourced and state of the art model provided by Meta.
The models list can be expanded by 3rd party providers. Contact us for more information.
For Enterprise
plans, the user can choose to host the agent and functions locally on their own servers or on a cloud provider of their choice.
Contact us for more information.