The situation is like this. According to the introduction of the python-docx library, it can operate Microsoft Word 2007+ (.docx) files. I use python-docx to parse a docx format document created by ...
I am using python-docx to process Word documents in bulk and it has been reported that python-docx skip some text fragment (see here: echr-od/ECHR-OD_process#152). I am unable so far to trace the root ...