just found this thread by Ludwig, and would like to +1 it.
I am desperately waiting for this new OCR feature. I would very much like to have the option to remove existing text layers (sometimes the layers of given files are wrong and I would like to replace them) - but not by turning the file into a mere image-file with a size that is much larger than the original. Can it be said when such a new (but very essential) feature will be implemented?
I have also "desperately" been looking for ways to remove so-called "renderable" text (layers) from PDF files.
For example, I often have like 500-pages scanned PDF's which are only around 10MB INCLUDING an OCR text layer which however I'd like to remove for certain reasons. By re-printing the file to PDF however I always seem to end up with something that is 5 to 10 times bigger (and that is, without text layer).
So has there been any progress on this (i.e. remove text layers from entire documents)?