There hasn’t been any work done specifically on BEeS for over a year, so we decided to release the source code. Much of it is incomplete (and could use a code quality cleanup), but there is a packet parser that can read doc and docx files and convert them to formats like HTML and JSON. The success rate depends upon the parser used (structured or unstructured, which vary on how closely a packet must match the format), but it can parse 68-78% of the +400 packets in the test set.
You can get to the repo here . If you have any questions, feel free to message me.
The scariest thing of all is Protobowl
1 post • Page 1 of 1