Parser is now stable!

We did it!

The newest version of Parser / Parsing Library works with 100% accuracy across the pdfs. Not just physics, Math, or computer science but every A-level pdfs. There are still some rooms for improvement which I will do tomorrow. But I guess now I will gradually start shifting my focus from OpenPastPaper towards the hackathon I am currently attending.

Anyway, it can classify all the questions — along with sub-questions — in any pdfs. Sub-questions can be important if we are going to separate them as each sub-question can be a part of a different chapter which will be interesting to see. I am still skeptical about the reliability of sub-question extraction and the use case for now, but I may just get rid of sub-question extraction in the future if it deems too time-consuming.

Bug — what is this stupid bug.

The problem is about a dictionary that stores some metadata about its characters. But when appending to an array, the entire array seems to have a single copy of the metadata everywhere — which is weird like this is not supposed to happen but now I cannot even pinpoint the bug.

Update — The issue has been resolved!

Originally published at on September 1, 2020.




Designed for those who want to efficiently teach and learn.

Love podcasts or audiobooks? Learn on the go with our new app.

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
Open Past Paper

Open Past Paper

Designed for those who want to efficiently teach and learn.

More from Medium

Embedded hardware — Sowing the Seeed of enterprise IoT success

6/27 baekjoon blackjack

Improving Apache HTTP Server’s performance using ListenCoresBucketsRatio directive

Fully Homomorphic Encryption: Cutting the Gordian Knot of querying health data without accessing it