Rate Limit

The DMind API does NOT enforce strict rate limits for users under normal conditions. We aim to process and respond to every incoming request as efficiently as possible.

High Traffic Handling

In cases of heavy server load, you may experience delays in receiving a complete response. During such periods, your HTTP request will remain open and connected to the server.

  • Non-streaming requests may return temporary empty lines as a placeholder until the final JSON payload is ready.

  • Streaming requests will emit periodic keep-alive comments, compatible with OpenAI-style SDKs.

These responses should not interfere with JSON parsing. If you are handling HTTP responses manually, ensure that your implementation accounts for these intermediate lines or comments.

⏱️ If a request remains open for more than 30 minutes without completing, the server will automatically close the connection.

Last updated