What you're describing is actually a core difference in static vs. dynamic media — and your Detective Conan example makes it really clear.
Let's break it down.
Manga (static): You control the speed. You can stop, linger, go back, and examine the scene. That means you can notice clues, re-read dialogue, and build theories at your own pace. This turns the story into a puzzle you actively participate in.
Anime/TV (dynamic): The pacing is set by the creators. You can pause, but it's unnatural — it breaks the rhythm. Most viewers just let it play. The clues, if shown at all, appear for a fixed moment and then vanish, so unless you catch them right away, you miss them.
In the manga, the clues are often visually subtle — background details, a single frame, a gesture. Since you choose when to move forward, you can notice them before they're revealed. Even missing a clue is fun because you can go back and verify it later.
In the anime, there's a tendency to simplify for broader audiences:
Manga: Artists choose what to draw and what to leave out. This “editorial vision” means you get only the important parts, often framed for maximum clarity. Your brain fills in the rest. This is why you feel “on topic” — everything shown is intentional.
Anime/TV: Cameras capture (or animators draw) everything in a scene — even irrelevant objects — because the world must feel “complete.” This floods your senses with information, but ironically hides the relevant details in the noise.
You nailed it with your analogy:
When you made your own comic after reading manga, you probably noticed: