Frequently Asked Questions About Cohere ↗
noOriginal Documentation
title: Frequently Asked Questions About Cohere slug: docs/cohere-faqs hidden: false description: >- Cohere is a powerful platform for using Large Language Models (LLMs). This page covers FAQs related to functionality, pricing, troubleshooting, and more. image: type: fileId value: ‘https://files.buildwithfern.com/cohere.docs.buildwithfern.com/346977d0e82f88b23da74f842e80460af9847cdcd7ac3c4eb2070df017b8824e/assets/images/e2602c8-meta_docs_image_cohere.jpg' keywords: ’natural language processing, generative AI, fine-tuning models’#
Here, we’ll walk through some common questions we get about how Cohere’s models work, what pricing options there are, and more!
Cohere Models#
You can check out this link to learn more about Aya models, datasets and related research papers.
To fine-tune models for tasks like data extraction, question answering, or content generation, it’s important to start by defining your goals and ensuring your data captures the task accurately.
For generative models, fine-tuning involves training on input-output pairs, where the model learns to generate specific outputs based on given inputs. This is ideal for tasks like customizing responses or enforcing a particular writing style.
For tasks like data extraction, fine-tuning helps the model identify relevant patterns and structure data as needed. High-quality, task-specific data is essential for achieving accurate results.
For more details, you can refer to Cohere’s fine-tuning guide for best practices.
Fine tuning is a powerful capability, but takes some effort to get right. You should first understand what you are trying to achieve and then determine if the data you are planning to train on effectively captures that task. The generative models specifically learn off of input/output pairs and therefore need to see examples of the expected input for your task and the ideal output. For more information, see our finetuning guide.
Our Command model family is our flagship series of generative models. These models excel at taking a user instruction (or command) and generating text following the instruction. They also have conversational capabilities which means that they are well-suited for chatbots and virtual assistants.
For representation tasks, we offer two key models:
- Embed: Embed models generate embeddings from text, allowing for tasks like classification, clustering, and semantic search.
- Rerank: Rerank models improve the output of search and ranking systems by re-organizing results according to specific parameters, improving the relevance and accuracy of search results.
Our models perform best when used end-to-end in their intended workflows. For a detailed breakdown of each model, including their latest versions, check our models page.
From there, you should associate each chunk to a page and a doc id which will allow you to have various levels of granularity for retrieval.
You can find further guides on chunking strategies and handling PDFs with mixed data.
Additionally, pre-training data has been included for the following 13 languages: Russian, Polish, Turkish, Vietnamese, Dutch, Czech, Indonesian, Ukrainian, Romanian, Greek, Hindi, Hebrew, Persian.
You can find a full list of languages that are supported by Cohere’s multilingual embedding model here.
Model Deployment#
Platform & API#
Trial API key usage is free, but limited. You can test different applications or build proofs of concept using all of Cohere’s models and APIs with a trial key by simply signing up for a Cohere account here.
Getting Started#
Visit the API docs for further details.
You can find the resources as follows:
For learning, we recommend our LLM University hub resources, which have been prepared by Cohere experts. These include a number of very high-quality, step-by-step guides to help you start building quickly.
For building, we recommend checking out our Github Notebooks, as well as the Get Started and Cookbooks sections in our documentation.
- Prompt Engineering Basics Guide
- Tips on Crafting Effective Prompts
- Techniques of Advanced Prompt Engineering.
For the most reliable results when working with external document sources, we recommend using a technique called Retrieval-Augmented Generation (RAG). You can learn about it here:
You can find a list of comprehensive tutorials and code examples in our LLM University hub and the Cookbook guides.
Troubleshooting Errors#
API Key Limitations#
Cohere’s API keys have certain limitations and permissions associated with them. If you are encountering errors related to API key limitations, it could be due to the following reasons:
- Rate Limits: Cohere’s API has rate limits in place to ensure fair usage. If you exceed the allowed number of requests within a specific time frame, you may receive an error. To resolve this, double check the rate limits for your API plan and ensure your usage is within the specified limits. You can also implement a rate-limiting mechanism in your code to control the frequency of API requests.
- API Key Expiration: API keys may have an expiration date. If your key has expired, it will no longer work.Check the validity period of your API key and renew it if necessary. Contact Cohere’s support team if you need assistance with key renewal.
Missing Artifacts#
Cohere’s dataset creation process involves generating artifacts, which are essential components for training models. If you receive errors about missing artifacts, consider the following:
- Incorrect Dataset Format: Ensure your dataset is in the correct format required by Cohere’s API. Different tasks (e.g., classification, generation) may have specific formatting requirements. Review the documentation for dataset formatting guidelines and ensure your data adheres to the specified structure.
- File Upload Issues: Artifacts are generated after successfully uploading your dataset files. Issues with file uploads can lead to missing artifacts. Verify that your dataset files are accessible and not corrupted. You should also check file size limits to ensure your files meet the requirements.
- Synchronization Delay: Sometimes, there might be a slight delay in generating artifacts after uploading the dataset. Wait for a few minutes and refresh the dataset status to see if the artifacts are generated.
General Troubleshooting Steps#
If your problem doesn’t fall into these buckets, here are a few other things you can try:
- Check API Documentation: Review the Cohere API documentation for dataset creation to ensure you are following the correct steps and parameters.
- Inspect API Responses: Carefully examine the error responses returned by the API. They often contain valuable information about the issue. Cohere uses conventional HTTP response codes to indicate the success or failure of an API request. In general:
- Codes in the 2xx range indicate success.
- Codes in the 4xx range indicate an error that failed given the information provided (e.g., a required parameter was omitted, a charge failed, etc.).
- Codes in the 5xx range indicate an error with Cohere’s servers (these are rare).
Review the Errors page to learn more about how to deal with non-2xx response code.
Reach Out to Cohere Support#
If the issue persists, contact Cohere’s support team. They can provide personalized assistance and help identify any specific problems with your API integration.
First, check our status page at status.cohere.com to see if any known issues or maintenance activities might impact your access.
If the status page doesn’t indicate any ongoing issues, the next step would be to reach out to our support teams. They’re always ready to assist and can be contacted at support@cohere.com. Our support team will be able to investigate further and provide you with the necessary guidance to resolve the login issue.
Check Your Credentials: Ensure you use the correct username and password. It’s easy to make a typo, so double-check your credentials before logging in again.
Clear Cache and Cookies: Sometimes, issues with logging in can be caused by cached data or cookies on your device. Try clearing your browser’s cache and cookies, then attempt to log in again.
Contact Support: If none of the above steps resolve the issue, it’s time to contact our support team. We are equipped to handle a wide range of login and authentication issues and can provide further assistance. You can contact us at support@cohere.com.
If you’re facing any technical challenges or need guidance, our support team is here to help. Contact us at support@cohere.com, and our technical support engineers will provide the necessary assistance and expertise to resolve your issues.
Billing, Pricing, Licensing, Account Management#
What model are you referring to?
Copy paste the error message
- Please note that this is our error message information:
- 400 - invalid combination of parameters
- 422 - request is malformed (eg: unsupported enum value, unknown param)
- 499 - request is canceled by the user
- 401 - invalid api token (not relevant on AWS)
- 404 - model not found (not relevant on AWS)
- 429 - rate limit reached (not relevant on AWS)
- Please note that this is our error message information:
What is the request seq length you are passing in?
What are the generation max tokens you are requesting?
Are all the requests of various input/output shapes failing?
Share any logs
Please refer to our dedicated pricing page for most up-to-date pricing.
Trial Key Limitations
Trial keys are rate-limited depending on the endpoint you want to use. For example, the Embed endpoint is limited to 5 calls per minute, while the Chat endpoint is limited to 20 calls per minute. All other endpoints on trail keys are 1,000 calls per month. If you want to use Cohere endpoints in a production application or require higher throughput, you can upgrade to a production key.
Production Key Specifications
Production keys for all endpoints are rate-limited at 1,000 calls per minute, with unlimited monthly use and are intended for serving Cohere in a public-facing application and testing purposes. Usage of production keys is metered at price points which can be found on the Cohere pricing page.
To get a production key, you’ll need to be the admin of your organization or ask your organization’s admin to create one. Please visit your API Keys > Dashboard, where the process should take less than three minutes and will generate a production key that you can use to serve Cohere APIs in production.
However, if you have a request that requires further assistance or if the changes you wish to make are not covered by the Dashboard, our support team is here to help. Please feel free to reach out directly at support@cohere.com or ask your question in our Discord community.
Check our pricing page for more information.
Check our free trial documentation for more information.
In terms of usage guidelines, we’ve compiled a comprehensive set of resources to ensure a smooth and compliant experience. You can access these guidelines here.
We’re excited to support your business and its unique needs. If you have any further questions or require additional assistance, please don’t hesitate to reach out to our team at sales@cohere.com or support@cohere.com for more details.
If you’re unable to find the specific feature or information regarding merging accounts, our support team is always eager to help.
Simply start a new chat with them using the chat bubble on our website or reach out via email to support@cohere.com.
It’s important to note that different models may have different token and document limits. To ensure you’re working within the appropriate parameters, we’ve provided detailed information about these limits for each model in this model overview section.
We understand that managing token limits can be a crucial aspect of your work, and we’re here to support you in navigating these considerations effectively. If you have any further questions or require additional assistance, please don’t hesitate to reach out to our team at support@cohere.com
Should you have any further questions please feel free to reach out to our sales team at sales@cohere.com or support@cohere.com for more details.
Legal, Security, Data Privacy#
1. Model Security and Safety#
This responsibility lies primarily with the model provider, and at Cohere, we are deeply committed to ensuring responsible AI development. Our team includes some of the top experts in AI security and safety. We lead through various initiatives, including governance and compliance frameworks, safety and security protocols, strict data controls for model training, and industry thought leadership.
2. Secure Application Development with Cohere Models:#
While Cohere ensures the model’s security, customers are responsible for building and deploying applications using these models securely. A strong focus on a Secure Product Lifecycle is essential, and our models integrate seamlessly into this process. Core security principles remain as relevant in the AI space as elsewhere. For example, robust authentication protocols should exist for all users, services, and micro-services. Secrets, tokens, and credentials must be tightly controlled and regularly monitored.
Our recommendations:#
- Implement responsible AI and governance policies in your AI development process, focusing on customer safety and security.
- Continuously monitor the performance of your applications and promptly address any issues that arise.
We also regularly share insights and best practices on AI security on our blog. Here are a few examples: 1, 2, 3.