🔍 Code Extractor

Search Components

Full-Text: Fast keyword matching | Semantic: AI-powered understanding of intent (finds similar concepts)

Search Results for "processing"

Found 26 matching component(s)

  • function test_template_with_data

    Tests a template file by replacing placeholders with test data and validates that all required placeholders have been filled, excluding known conditional placeholders.

    File: /tf/active/vicechatdev/test_comprehensive_templates.py

    template-testing validation placeholder-replacement jinja2 file-processing
  • function clean_text

    Cleans and normalizes text content by removing HTML tags, normalizing whitespace, and stripping markdown formatting elements.

    File: /tf/active/vicechatdev/improved_convert_disclosures_to_table.py

    text-processing text-cleaning normalization html-removal markdown-removal
  • function clean_text_for_xml

    Sanitizes text by removing or replacing XML-incompatible characters to ensure compatibility with Word document XML structure.

    File: /tf/active/vicechatdev/improved_convert_disclosures_to_table.py

    text-processing xml sanitization word-documents character-encoding
  • function extract_warranty_data_improved

    Parses markdown-formatted warranty documentation to extract structured warranty data including IDs, titles, sections, disclosure text, and reference citations.

    File: /tf/active/vicechatdev/improved_convert_disclosures_to_table.py

    markdown-parsing text-extraction warranty-processing document-parsing regex
  • function parse_references_section

    Parses a formatted references section string and extracts structured data including reference numbers, sources, and content previews using regular expressions.

    File: /tf/active/vicechatdev/improved_convert_disclosures_to_table.py

    parsing text-processing references citations regex
  • function create_word_report_improved

    Generates a formatted Microsoft Word document report containing warranty disclosures with table of contents, structured sections, and references.

    File: /tf/active/vicechatdev/improved_convert_disclosures_to_table.py

    document-generation word-processing report-generation docx warranty-management
  • function main_v1

    Orchestrates the conversion of an improved markdown file containing warranty disclosures into multiple tabular formats (CSV, Excel, Word) with timestamp-based file naming.

    File: /tf/active/vicechatdev/improved_convert_disclosures_to_table.py

    file-conversion markdown-processing warranty-data csv-export excel-export
  • class MetadataCatalog

    Helper class to manage FileCloud metadata sets and attributes. This class provides methods to work with FileCloud metadata by providing a more user-friendly interface on top of the raw API.

    File: /tf/active/vicechatdev/metadata_catalog.py

    class metadatacatalog
  • function validate_and_alternatives

    Validates whether a given keyword is a valid chemical compound, biochemical concept, or drug-related term using GPT-4, and returns alternative names/synonyms if valid.

    File: /tf/active/vicechatdev/offline_parser_docstore.py

    validation chemistry biochemistry drug-research llm
  • class RegulatoryExtractor

    A class for extracting structured metadata from regulatory guideline PDF documents using LLM-based analysis and storing the results in an Excel tracking spreadsheet.

    File: /tf/active/vicechatdev/reg_extractor.py

    pdf-extraction regulatory-documents llm-extraction ocr data-extraction
  • function test_markdown_link_parsing

    A test function that validates markdown link parsing capabilities, specifically testing extraction and URL encoding of complex URLs containing special characters from Quill editor format.

    File: /tf/active/vicechatdev/test_complex_hyperlink.py

    testing markdown url-parsing regex url-encoding
  • function extract_warranty_data

    Parses markdown-formatted warranty documentation to extract structured warranty information including IDs, titles, sections, source document counts, warranty text, and disclosure content.

    File: /tf/active/vicechatdev/convert_disclosures_to_table.py

    markdown-parsing data-extraction warranty-processing text-processing regex
  • function create_csv_report

    Creates two CSV reports (summary and detailed) from warranty data, writing warranty information to files with different levels of detail.

    File: /tf/active/vicechatdev/convert_disclosures_to_table.py

    csv reporting data-export file-io warranty
  • function main_v2

    Converts a markdown file containing warranty disclosure data into multiple tabular formats (CSV, Excel, Word) with timestamped output files.

    File: /tf/active/vicechatdev/convert_disclosures_to_table.py

    markdown-conversion data-extraction report-generation csv-export excel-export
  • function create_enhanced_word_document

    Converts markdown-formatted warranty disclosure content into a formatted Microsoft Word document with hierarchical headings, styled text, lists, and special formatting for block references.

    File: /tf/active/vicechatdev/improved_word_converter.py

    document-generation markdown-to-word docx warranty-processing legal-documents
  • function main_v3

    Main entry point function that reads a markdown file, converts it to an enhanced Word document with preserved heading structure, and saves it with a timestamped filename.

    File: /tf/active/vicechatdev/improved_word_converter.py

    document-conversion markdown-to-word file-processing docx main-entry-point
  • function clean_text_for_xml_v1

    Sanitizes text strings to ensure XML 1.0 compatibility by removing or replacing invalid control characters and ensuring all characters meet XML specification requirements for Word document generation.

    File: /tf/active/vicechatdev/enhanced_word_converter_fixed.py

    text-processing xml sanitization data-cleaning word-documents
  • function extract_warranty_sections

    Parses markdown content to extract warranty section headers, returning a list of dictionaries containing section IDs and titles for table of contents generation.

    File: /tf/active/vicechatdev/enhanced_word_converter_fixed.py

    markdown-parsing text-processing warranty-documents table-of-contents document-structure
  • function extract_total_references

    Extracts the total count of references from markdown-formatted content by first checking for a header line with the total, then falling back to manually counting reference entries.

    File: /tf/active/vicechatdev/enhanced_word_converter_fixed.py

    markdown parsing text-processing references bibliography
  • function create_enhanced_word_document_v1

    Converts markdown content into a formatted Microsoft Word document with proper styling, table of contents, warranty sections, and reference handling for Project Victoria warranty disclosures.

    File: /tf/active/vicechatdev/enhanced_word_converter_fixed.py

    document-generation word-processing markdown-conversion docx formatting
  • function format_inline_references

    Formats inline citation references (e.g., [1], [2]) in a Word document paragraph by applying italic styling to them while preserving the rest of the text.

    File: /tf/active/vicechatdev/enhanced_word_converter_fixed.py

    document-formatting word-processing python-docx text-formatting citations
  • function main

    Main orchestration function that reads an improved markdown file and converts it to an enhanced Word document with comprehensive formatting, including table of contents, warranty sections, disclosures, and bibliography.

    File: /tf/active/vicechatdev/enhanced_word_converter_fixed.py

    document-generation word-processing markdown-conversion docx file-processing
  • class MyEmbeddingFunction

    A custom embedding function class that generates embeddings for documents using OpenAI's API, with built-in text summarization for long documents and token management.

    File: /tf/active/vicechatdev/OneCo_hybrid_RAG copy.py

    embeddings openai chromadb vector-database text-summarization
  • class SimpleDataHandle

    A data handler class that manages multiple data sources with different types (dataframes, vector stores, databases) and their associated processing configurations.

    File: /tf/active/vicechatdev/OneCo_hybrid_RAG copy.py

    data-management registry vector-store RAG dataframe
  • class OneCo_hybrid_RAG

    A class named OneCo_hybrid_RAG

    File: /tf/active/vicechatdev/OneCo_hybrid_RAG copy.py

    class oneco_hybrid_rag
  • class FixedProjectVictoriaGenerator

    Fixed Project Victoria Disclosure Generator that properly handles all warranty sections.

    File: /tf/active/vicechatdev/fixed_project_victoria_generator.py

    class fixedprojectvictoriagenerator

Search Examples