𝟱 𝗣𝗿𝗼𝗺𝗽𝘁 𝗙𝗿𝗮𝗺𝗲𝘄𝗼𝗿𝗸𝘀 𝗳𝗼𝗿 𝗠𝗲𝗱𝗶𝗰𝗮𝗹 𝗔𝗜

📅2 days ago⏱2 min read

Long prompts can make AI mistakes in medical testing.

I tested how different prompt styles affect AI accuracy in classifying genetic variants. I used 27 tests to ensure the results were reliable.

Here is what I found.

𝗧𝗵𝗲 𝗧𝗲𝘀𝘁𝗶𝗻𝗴 𝗦𝘁𝘆𝗹𝗲𝘀

Verbose: Detailed instructions. Tells the AI to act as an expert and list every clinical rule.
Concise: Short and direct. Tells the AI to classify the gene and stop.
Structured: Uses a JSON-like format with specific fields like Gene and Variant.

𝗧𝗵𝗲 𝗥𝗲𝘀𝘂𝗹𝘁𝘀

The Verbose style had the lowest accuracy at 48.1%. The Concise style had the highest accuracy at 81.5%.

Why did Verbose fail?

When you tell an AI to look for specific disease markers, you bias it. In one test, the AI saw a common benign variant. Because the prompt forced it to look for disease rules, the AI ignored the frequency data. It tried too hard to find a problem that was not there.

The Concise style worked better because it did not force a bias. It allowed the AI to evaluate all data equally.

𝗧𝗵𝗲 𝗧𝗵𝗶𝗻𝗸𝗶𝗻𝗴 𝗧𝗼𝗸𝗲𝗻 𝗧𝗮𝘅

Adding more words does not make the AI think harder.

In my tests, moving from a medium task to a complex task increased the prompt length by 5 times. However, the AI's actual reasoning tokens only increased by 1.6 times.

If you want better reasoning, do not just write more. Instead, ask for "Step-by-step evaluation" within a structured format.

𝗞𝗲𝘆 𝗧𝗮𝗸𝗲𝗮𝘄𝗮𝘆𝘀

Do not lead with disease rules. This causes the AI to miss benign results.
Short prompts often beat long prompts for quality.
Use structured formats for stability in large data batches.
Always run tests multiple times. A single test might just be a lucky cache hit.

Source: https://dev.to/jh5_pulse/wu-ge-shi-yong-yu-yi-liao-chang-yu-de-ti-shi-ci-promptkuang-jia-yu-fan-li-4c66

Optional learning community: https://t.me/GyaanSetuAi

𝟱 𝗣𝗿𝗼𝗺𝗽𝘁 𝗙𝗿𝗮𝗺𝗲𝘄𝗼𝗿𝗸𝘀 𝗳𝗼𝗿 𝗠𝗲𝗱𝗶𝗰𝗮𝗹 𝗔𝗜

Continue reading

𝗔𝗜 𝗜𝘀 𝗠𝗼𝗿𝗲 𝗧𝗵𝗮𝗻 𝗣𝗿𝗼𝗺𝗽𝘁𝘀

𝗔𝗜 𝗜𝘀 𝗠𝗼𝗿𝗲 𝗧𝗵𝗮𝗻 𝗣𝗿𝗼𝗺𝗽𝘁𝘀

𝟱 𝗣𝗿𝗼𝗺𝗽𝘁 𝗙𝗿𝗮𝗺𝗲𝘄𝗼𝗿𝗸𝘀 𝗳𝗼𝗿 𝗠𝗲𝗱𝗶𝗰𝗮𝗹 𝗔𝗜

𝗧𝗵𝗲 𝗔𝗜 𝗥𝗲𝘃𝗶𝗲𝘄 𝗧𝗿𝗮𝗽: 𝗪𝗵𝘆 𝗩𝗲𝗿𝗶𝗳𝗶𝗰𝗮𝘁𝗶𝗼𝗻 𝗠𝗮𝘁𝘁𝗲𝗿𝘀 𝗠𝗼𝗿𝗲 𝗧𝗵𝗮𝗻 𝗣𝗿𝗼𝗺𝗽𝘁𝗶𝗻𝗴

𝗧𝗵𝗲 𝗔𝗜 𝗥𝗲𝘃𝗶𝗲𝘄 𝗧𝗿𝗮𝗽: 𝗪𝗵𝘆 𝗩𝗲𝗿𝗶𝗳𝗶𝗰𝗮𝘁𝗶𝗼𝗻 𝗠𝗮𝘁𝘁𝗲𝗿𝘀 𝗠𝗼𝗿𝗲 𝗧𝗵𝗮𝗻 𝗣𝗿𝗼𝗺𝗽𝘁𝗶𝗻𝗴