Python Regex to Capture Proceeding Text – mixing cas insensitivity in group
Python Regex to Capture Proceeding Text – mixing cas insensitivity in group Question: Example Link RegEx Group returning issue: (?P<qa_type>(Q|A|Mr[.|:]? [a-z]+|Mrs[.|:]? [a-z]+|Ms[.|:]? [a-z]+|Miss[.|:]? [a-z]+|Dr[.|:]? [a-z]+))?([.|:|s]+)? Objective: To extract text from proceeding transcript pdfs for each question/answer/speaker type. Using Python: interage through pages in PDF extracted text and group Qestion/Answer text. Desired Results = qa_type, page_start, …