Supporting the Text Preprocessing by Means a Web Tool

Ruth Reátegui, Janneth Chicaiza, Edgar Guamo, Henry N. Roa, Daniel Pulla-Sánchez

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Abstract

    Unstructured data contains important information that can offer insights in a variety of fields, but the lack of a predefined format makes it difficult to analyze, manage, and use effectively. This paper investigates the crucial role that textual data processing plays and presents a web tool that was developed in Python and Django. This tool includes standard procedures for processing data, which include eliminating stop words, lemmatization, stemming, and generating bigrams and trigrams. Additionally, the tool was tested with citizen security-related data taken from X social media.

    Original languageEnglish
    Title of host publicationInformation Technology and Systems, ICITS 2025
    EditorsAlvaro Rocha, Carlos Ferrás, Hiram Calvo
    PublisherSpringer Science and Business Media Deutschland GmbH
    Pages303-312
    Number of pages10
    ISBN (Print)9783031931086
    DOIs
    StatePublished - 2025
    EventInternational Conference on Information Technology and Systems, ICITS 2025 - Mexico City, Mexico
    Duration: Jan 22 2025Jan 25 2025

    Publication series

    NameLecture Notes in Networks and Systems
    Volume1447 LNNS
    ISSN (Print)2367-3370
    ISSN (Electronic)2367-3389

    Conference

    ConferenceInternational Conference on Information Technology and Systems, ICITS 2025
    Country/TerritoryMexico
    CityMexico City
    Period01/22/2501/25/25

    Keywords

    • artificial intelligence
    • citizen security
    • natural language processing
    • text preprocessing

    Fingerprint

    Dive into the research topics of 'Supporting the Text Preprocessing by Means a Web Tool'. Together they form a unique fingerprint.

    Cite this