this is currently me getting the base cpp implementation up, for those who have used my python parser, this is has less features (in certain areas) but is much faster ...
LLMs frequently return JSON in unexpected formats. Models without response_format support often wrap JSON in explanatory text or produce malformed syntax. Even models with structured output support ...