⏱️ The Essentials in 3 Minutes

• A high-performing document covers one topic, can be summarized in a short precise title, and contains only up-to-date information.
• Redundancies between documents compete with complementary information and degrade the relevance of responses.
• Access rights must match the actual scope of the users concerned, to avoid interference.
• Four questions to ask before each import: single topic? valid information? no duplicates? appropriate access?

🧠 Understand the Pedagogical Value of Document Structure

Didask AI does not "read" a document the way a human does: it compares each question against your entire knowledge base and selects the passages closest to the learner's need. A poorly structured, too long, too vague, or redundant document will be hard to retrieve, even if it contains the right answer.

Ensuring your documents are well-structured allows the AI to pinpoint exactly what each collaborator needs at the right moment. It also guarantees that your business knowledge is transmitted faithfully, without noise or ambiguity.

✅ Evaluate a Document Before Adding It to the Base

Before any import, ask yourself these four questions:

Question	If the answer is no...
Can I summarize this document in a short, precise title?	The document probably covers too many topics: split it.
Is all the information still valid?	Clean up the document or do not include it.
Are there other documents in the base that say the same thing?	Remove the overlapping parts or merge the documents.
Is this document useful to all users who have access to this base?	Restrict access or separate into distinct bases.

📌 Follow the Rule: One Document, One Topic

A document that covers multiple topics at once will be hard to associate with a specific question. The AI will struggle to identify it as the most relevant source, even if it contains the right information.

The tell-tale sign: you have trouble giving it a short, precise title.

Good example: "Client quote validation process" - explains only the validation steps, the stakeholders, and the timelines.
To avoid: "Sales team meeting - March 12" - covers quote validation, an HR update, Q3 strategy, and a bug post-mortem all at once.

What to do: split the document into as many documents as there are distinct topics. Each document should have a title that summarizes its content in one short sentence.

🧹 Remove Outdated Information

The AI cannot distinguish what is still valid from what is outdated. Old information can make the right information harder to find, or even cause incorrect responses.

Good example: an up-to-date expense reimbursement policy, containing only the rules currently in force.
To avoid: the same document with new rules added at the bottom, without removing the old ones. The AI cannot tell which ones are valid.

What to do: extract only the still-valid parts and create a clean new document. If a document is too cluttered to clean up easily, it is better not to include it. Fewer documents of higher quality always yields better performance.

🔁 Avoid Redundancies Between Documents

If multiple documents cover the same topic in different words, the AI will surface them simultaneously. These duplicates take up space that could be filled with complementary knowledge, and the overlapping parts mask the specific content of each document.

Good example: two documents on the same tool - one titled "Technical guide to tool X for developers", the other "Using tool X day-to-day for business teams". Each goes straight to what is specific to its audience, without a shared introduction.
To avoid: three versions of documentation on the same feature (marketing, support, developers) that all start with the same two paragraphs of general overview.

What to do: remove the overlapping parts, or isolate them in a dedicated document that you reference. Clearly distinguish documents in their title ("technical version" or "simplified version").

🔒 Adapt Access Rights to Users

Not all learners need all the knowledge. Highly technical documents intended for a small group can interfere with responses for other users whose questions use similar words.

Good example: technical documentation on the product architecture is accessible only to R&D teams. Sales teams have access to a separate base with product sheets and sales pitches.
To avoid: a base accessible to everyone that mixes client call transcripts, internal technical specifications, and onboarding guides.

What to do: define user groups and assign documents accordingly. Build the base progressively: add a document when a group expresses a real need, not in anticipation.

Keywords: document structure, knowledge base, Didask AI, document quality, performance, redundancies, access rights.

📚 Understand the Fundamentals of the Knowledge Base for High-Performing AI

🗣️ Analyze Response Quality and User Feedback

📢 Rate AI Responses

📘 Feed the Didask AI Knowledge Base

📄 Choose the Documents to Ingest into Your Knowledge Base

📄 Structure Your Documents for Better AI Responses