I have a pretty specific piece of information I need to extract from the proxy statements of public companies. In these statements, companies will disclose their quorum requirement (i.e., the required proportion of shareholders that must be present to validly convene a shareholder meeting). The primary challenge in extracting this piece of information is that companies do not disclose this information in a standardized way. So, companies can disclose it however they want (e.g., “quorum for our meeting requires the majority of outstanding shares to be present”, “our quorum constitutes 50% of outstanding shares”). For the most part, it is disclosed in pretty straightforward way, and the vast majority of firms have a 50% quorum requirement, but some have either a 33% or a 66% requirement. To make things a bit more difficult, companies often mention the word “quorum” when discussing certain proposals that are also discussed in the proxy statement (e.g., “subject to achieving quorum at our meeting, shareholders will be able to vote on the following proposals…”). I am looking for a Python script to extract the quorum requirements from the proxy statements. I need this to be extremely accurate, so utilizing an LLMs sounds like the best option. I already have the proxy statements stored as text files.
Keyword: Python
Price: $300.0
Python Data Scraping Data Extraction Large Language Model