🥷 Assistant mode

Assistant agents are designed for everyday, general-purpose activities. They offer a versatile set of capabilities that make them suitable for a wide range of tasks and interactions.

Key Features

General-Purpose Interaction: Assistant agents are equipped to handle a variety of everyday queries and tasks, making them ideal for general conversation and assistance.
Image Generation: One of the standout features of Assistant agents is their ability to generate images based on user requests. This can be particularly useful for creative tasks or visual explanations.
Video Generation: Another important feature of Assistant agents is the ability to create videos, up to 6 seconds long. ToothFairyAI behind the scenes generates two videos, one with audio and one without to grant maximum flexibility to the user.
Document Analysis: Assistant agents can analyse uploaded files, bypassing the Knowledge Hub. This feature allows for quick, on-the-spot document processing and information extraction.
Multimodal Capabilities: When enabled, Assistant agents can generate responses that include not just text, but also images, charts, tables, and mind maps, enhancing the richness of their output.
File Upload Support: Users can upload various types of files including images, audio, video, and documents for the agent to process and analyse.
- File size limits:
  - Images: Maximum 5MB per image (all subscriptions)
  - Audio: Maximum 10MB per audio file (all subscriptions)
  - Video: Maximum 50MB per video file (all subscriptions)
  - Documents: Subscription-based limits
    - Individual: Maximum 20MB per document
    - Business: Maximum 20MB per document
    - Enterprise: Maximum 20MB per document

Image Generation Settings

Allow images generation: Enables the agent to create new images based on user input. ToothFairyAI agents generate by default four images at each interaction allowing users to further refine the output.
Image generation model: Users can select from different models:
- Mystica SD for general-purpose image creation
- Mystica SD realism for photographic generation tasks
- Flux Pro, Flux Pro 1.1, and Flux Pro 1.1 Ultra Realistic for commercial use cases (Enterprise and Business plans only)
Max images generated: This setting allows the user to select the number of images to be generated at each interaction. The default value is 4. When the agent is invoked by the orchestrator agent by default the image generation will produce only one at each step

Flux Pro

When a Flux Pro model is selected ToothFairyAI defaults the Max images generated to 1

Orchestrator

When a Orchestrator agent invokes a Assistant agent with Image generation enabled, only one image will be generated at any given step.

Video & 3D Model Generation (Enterprise Only)

Allow videos generation: This setting allows the agents to generate new videos based on the user input. ToothFairyAI agents generate by default two videos - one with audio and one without. The user can also pass the initial frame of the video to generate by uploading an image file inside the chat (maximum 5MB per image). If multiple images are passed only the first one will be used for the start of the video.
Video generation model: Mystica SD video is the default model for video creation.
Allow 3D model generation: This setting allows the agents to generate new 3D models based on the user input. ToothFairyAI agents with this must receive an image inside the chat to generate the 3D model (maximum 5MB per image). If multiple images are passed only the first one will be used for the 3D model generation. The agent cannot generate a 3D model based on the prompt alone.

Orchestrator

When a Orchestrator agent invokes a Assistant agent with Video generation enabled, only one video will be generated at any given step.

Use Cases

Creative Assistance: Helping with brainstorming, idea generation, and visual concept creation.
Quick Document Analysis: Providing summaries or insights from uploaded documents without needing pre-loaded knowledge.
Visual Explanation: Creating images or diagrams to explain concepts or ideas.
General Q&A: Answering a wide range of questions on various topics.
Task Planning: Assisting with day-to-day task organisation and planning.
Content Creation: Helping generate ideas or drafts for various types of content.
Visual Content Generation: Creating custom images or short videos for presentations, social media, or personal projects.
Interactive Learning: Using multimodal capabilities to provide engaging explanations on various subjects.
Quick File Analysis: Offering rapid insights on uploaded documents, images, or audio files.
Everyday Problem Solving: Providing advice and solutions for common daily challenges.

Advanced Features

Multimodal Output: Ability to generate responses combining text, images, charts, and mind maps for comprehensive explanations.
File Processing: Capability to analyse and extract information from various file types including documents, images, and audio.
Contextual Understanding: Ability to maintain context throughout a conversation for more coherent and relevant responses.

Limitations

Assistant agents do not have access to the extensive knowledge base that Operator agents use.
They cannot execute code like Programmer agents.
They cannot accomplish complex, multi-step tasks like Orchestrator agents.
Image and video generation quality may vary based on the complexity of the request.
3D model generation requires an initial image input and may have limitations in detail and complexity.

Best Practices

Be Specific in Requests: Provide clear, detailed instructions when asking for image or video generation.
Utilise Multimodal Capabilities: Combine requests for text, images, and other visual elements for comprehensive responses.
Leverage File Upload: When seeking analysis or insights, upload relevant files for the agent to process.
Iterate on Creative Tasks: Use the agent's output as a starting point and refine through follow-up requests.
Combine with Other Agent Types: For complex tasks, consider using Assistant agents in conjunction with other specialised agents.

By leveraging these features and following best practices, Assistant agents can serve as versatile assistants for a wide range of everyday tasks and creative endeavours.

Key Features​

Image Generation Settings​

Video & 3D Model Generation (Enterprise Only)​

Use Cases​

Advanced Features​

Limitations​

Best Practices​