Many languages use docstrings rather than doc comments: it certainly makes extracting them easier. You just parse the code. Could you go further and require all comments to be string literals? I'm struggling to think of many downsides.