Dr Vemula Dinesh Reddy, Assistant Professor, Department of Computer Science and Engineering, has been granted a patent for his invention “A System And A Method for Fog-Based Animal Intrusion Detection” with the Application No: 202341026013, in the Indian Patent Official Journal. The invention acts as a groundbreaking fog computing-based system designed for real-time detection of animal intrusions in sensitive areas using smart sensors for instant alerts.
Abstract
This research introduces an intelligent system using fog computing to detect animal intrusions in sensitive or protected zones such as farmlands, highways, and forest borders. The system enables real-time data processing closer to the site of intrusion, offering faster detection and reduced dependency on centralised cloud systems. Furthermore, we proposed the Quantum-Inspired optimisation technique called Quantum Evolutionary Algorithm.
Practical Implementation/ Social Implications of the Research
Through this invention, we can:
Future Research Plans
The digital age is flooded with multimedia content ranging from articles and podcasts to videos and images, spanning multiple languages. The challenge isn’t just accessing information but understanding and summarising it efficiently. Addressing this need, a pioneering patent titled “A System and Method for Multimodal Multilingual Input Summarization Using Quantum Motivated Processors” (Application Number 202341005519) has been granted to Dr Ashu Abdul, Assistant Professor in the Department of Computer Science and Engineering, and Mr Phanidra Kumar S, PhD Scholar, as published in the Indian Patent Office Journal. This innovative system converts all kinds of media like text, images, audio, and video into descriptive text, then leverages quantum-inspired algorithms to extract and stitch together the most relevant sentences and visuals, thereby crafting a perfect summary.
Abstract
This research details a system and method for summarizing multimodal and multilingual input data by leveraging quantum-motivated processors. The system is designed to handle input documents comprising text, audio, image, and video data, potentially in multiple languages. A pre-processing engine extracts textual descriptions from all these modalities (using deep learning, CNN, VAF, Python), merging them into a unified text corpus. A quantum enabler module assigns initial probabilities and encodes sentences from this corpus into binary states (0s or 1s), reflecting a quantum measurement concept (using randint). A selection module, utilizing an objective/fitness function incorporating factors like term frequency, sentence length, pronoun presence, coverage (QCSS-based similarity), and title relevance (Sentence-to-Title QCSS), calculates a fitness score for each encoded sentence and shortlists relevant ones using a “radiant function”. This module also handles duplicate removal based on QCSS. A rearrangement module scores and reorders the shortlisted sentences based on metadata (like publishing date) and scores (like ROUGE). A summary generation module produces a textual summary. Concurrently, an image selector engine selects a relevant image from the input data, primarily based on the image’s textual description and the generated summary, often using QCSS (Quantum Cosine Similarity Score). Finally, an output engine merges the textual summary with the selected image to create a multioutput (MO) summary. The system may also include a machine translation engine to translate non-English extracted descriptions into English before summarization, if needed. The approach employs quantum measurement and adaptive quantum rotation gates within an evolutionary framework (suggesting a Quantum Genetic Algorithm approach, referred to as MSQMGA) to find optimal summary sentences, demonstrating improved performance and efficiency compared to traditional Genetic Algorithms.
Practical Implementation of the Research
The system’s design outlines a modular architecture suitable for software or hardware implementation, involving distinct processing engines (Input, Pre-processing, Quantum Enabler, Selection, Rearrangement, Summary Generation, Image Selector, Output, and potentially Machine Translation). Key technical details include:
These specifics suggest practical implementation could involve developing software modules that utilize libraries for deep learning (e.g., TensorFlow, PyTorch with CNN, LSTM components), natural language processing (e.g., NLTK, spacy for tokenization, POS, lemmatization), and potentially frameworks for simulating or interfacing with quantum-inspired algorithms. The “real-time applications” aspect implies design considerations for efficiency and processing speed. Potential deployments include news aggregation platforms, content management systems, competitive intelligence dashboards, cross-cultural communication tools, or applications for analysing vast archives of mixed-media data.
Social Impact
Beyond basic information access, this technology has the potential to foster greater understanding and reduce bias by providing summarized content across linguistic and cultural divides. It could empower individuals and organisations to consume and analyse global information landscapes more effectively. For educators, it could facilitate the creation of multimodal learning materials from diverse sources. For researchers, it could accelerate literature review across different fields and languages. However, it also raises potential implications related to the source and neutrality of the summarisation models themselves – whose perspective is encoded, and how might summaries differ based on training data or algorithmic biases? Ethical considerations around information representation and potential manipulation of summaries would be important as such technologies become more widely adopted.
Future Research Plans
Although the patent doesn’t explicitly list a roadmap, the detailed description and stated advantages imply several potential future research directions and refinements based on the current work: