Recent News

School of Engineering and Sciences

Department of Computer Science and Engineering

Recent News

Enhancing Visual Saliency in Group Photographs: A Novel Approach for Improved Security and Healthcare Applications September 5, 2024

Dr Ravi Kant Kumar, an Assistant Professor at the Department of Computer Science and Engineering, and his research scholar, Ms Gayatri Dhara, have come up with a patent titled “A System and Method for Enhancement Of Visual Saliency Of Intended Face In Group Photography.” The patent, with Application Number 202441040020, employs pathbreaking technology to enhance security and healthcare applications, with real-time face recognition and remote diagnostics.

Abstract:

Visual saliency is a way of figuring out which parts of a scene draw our attention the most. When looking at a crowd or a group of faces, our eyes naturally focus more on certain faces than others. This happens because some faces have dominant features that stand out more. For faces that don’t naturally catch our attention, there is a need to make them more noticeable. This new method and system are designed to do just that. The system calculates scores based on various factors like skin tone, colour, contrast, position, and other visual details. These scores help identify which face needs enhancement, making it more prominent in a group of faces. The primary advantage of this invention is its potential to improve user experience in various applications, such as photo editing, social media, security systems, and more. By giving users, the control to select and enhance a specific face, it allows for a more personalised and targeted approach to face recognition and enhancement. This could be particularly beneficial in scenarios where the user wants to highlight a specific individual in a group photo or in a crowd. Overall, this invention represents a significant advancement in the field of face recognition and image enhancement, offering a novel and user-centric approach to visual saliency. It opens up new possibilities for user interaction and control in image editing and face recognition technology.

Practical and Social Implications

The practical implementation of this research lies in its ability to identify and enhance faces within a group or crowd that do not naturally draw attention. This innovative method and system address this issue by calculating saliency scores based on factors such as skin tone, colour, contrast, position, and other visual details. These scores are then used to identify faces that need enhancement to become more prominent in a group. The system’s ability to enhance specific faces has significant practical applications in several fields.

Photo Editing: Users can easily enhance specific individuals in group photos, ensuring that everyone stands out as desired. This is particularly useful for personal photos, event photography, and professional photo editing.

Social media: Enhanced face recognition and saliency can improve user experience by allowing users to highlight specific people in their posts, making photos more engaging and personalised.

Security Systems: In surveillance and security applications, the ability to enhance less prominent faces can improve the accuracy of face recognition systems, aiding in the identification of individuals in crowded or low-visibility conditions.

Collaborations:

SRM University-AP,
Dr Ravi Kant Kumar,
Mrs Gayathri Dhara.

Future Research Plans:

Future plans for this visual saliency-based face enhancement system include refining algorithms for greater accuracy and efficiency, and integrating with popular photo editing software and social media platforms for seamless user experience. The technology will be expanded into security and healthcare applications, enhancing real-time face recognition and remote diagnostics. Emphasis will be placed on reducing biases, ensuring privacy protection, and enabling user customisation. Collaborations with academic institutions will drive further research, while commercialisation efforts will focus on launching products globally.
Continue reading →
Dr Mahesh Kumar Secures Patent for Advanced Data Generation Method August 28, 2024
In a significant achievement for the Department of Computer Science and Engineering, Dr Mahesh Kumar Morampudi, Assistant Professor, along with B.Tech. student Ms. Nunna Lakshmi Manasa, has been granted a patent for their groundbreaking invention titled “System and method for generating synthetic data based on variational autoencoder.” The patent, with Application Number: 202241049545, was officially recognised in the Indian Patent Office.

This innovative system leverages the capabilities of variational autoencoders to generate synthetic data, which has vast applications in various fields, including machine learning, data privacy, and simulation. The ability to create high-quality synthetic datasets can significantly enhance research and development processes, providing researchers and practitioners with valuable tools for analysis and experimentation.

The recognition of this patent not only highlights the innovative spirit within the department but also underscores the collaborative efforts between faculty and students in advancing technology and contributing to the field of computer science.

Abstract of the Research

Diabetic retinopathy (DR) is a diabetes-related eye condition that occurs when high blood sugar levels cause damage to the blood vessels in the retina, the light-sensitive tissue at the back of the eye. Over time, these damaged vessels can leak blood or other fluids, leading to vision impairment.

DR typically progresses through stages, starting with mild non-proliferative retinopathy, where small bulges form in the blood vessels, to proliferative retinopathy, the most severe stage, where new abnormal blood vessels grow on the retina and in the vitreous humor, potentially leading to blindness. Early detection and management are crucial to prevent significant vision loss, often involving regular eye exams, blood sugar control, and treatments like laser therapy or surgery.

Synthetic data generation for DR is an emerging approach to augment limited clinical datasets, enhancing the training of machine learning models for diagnosis and prognosis. The present disclosure envisages a system for generating synthetic data based on a variational autoencoder (VAE). This work explores the use of a VAE combined with deep learning for the detection and classification of DR. VAEs, known for their ability to learn compact and meaningful representations of complex data, are employed to generate latent features from retinal images, effectively capturing the subtle variations and anomalies indicative of DR. These latent features are then fed into a deep learning classifier, which is trained to categorise the severity of DR into various stages, ranging from no DR to proliferative DR.

Research in Layperson’s Terms

Imagine your eye is like a camera, and the retina at the back of your eye is the film that captures the pictures you see. DR is a condition that affects this “film” when someone has diabetes for a long time. High blood sugar levels can damage the tiny blood vessels in the retina, leading to vision problems and, in severe cases, blindness.

Our research is about creating a computer program that can help doctors detect and classify this eye condition more accurately. We use a special kind of technology called a VAE, which is like a smart artist that learns to understand and recreate detailed images of the retina. This “artist” can pick up on the tiny changes and patterns in the retina that might be missed by the human eye. Once the VAE has learned these details, it passes them on to another program, which is really good at sorting things into categories. This second program, a deep learning classifier, uses the information from the VAE to decide how severe diabetic retinopathy is—whether it’s mild, moderate, or severe.

By combining these two technologies, our system can help doctors detect diabetic retinopathy earlier and more accurately, which is crucial for preventing vision loss in people with diabetes.

Title of the Patent in the Citation Format
Inventor Name(s):
Dr Mahesh Kumar Morampudi
Nunna Lakshmi Manasa
“System and method for generating synthetic data based on variational auto encoder” with Application Number: ” 202241049545.” Date of Patent Grant. 29/07/2024

Practical Implementation:
The practical implementation of our research involves integrating the VAE and deep learning classifier into a software tool that can be used by eye care professionals. Here’s how it might work in a real-world setting:
1. Data Collection: Retinal images are collected from patients during routine eye exams. These images are fed into the system, which has been trained using a large dataset of retinal images, including those showing different stages of DR.
2. Feature Extraction: The VAE processes these images, learning to capture and condense the important features that indicate the presence and severity of DR. This step is crucial because it allows the system to focus on the subtle details that might signify early stages of the disease.
3. Classification: The deep learning classifier then takes these features and classifies the severity of DR. It provides a diagnosis, categorising the condition as no DR, mild, moderate, or severe. The system could also flag cases that need urgent attention, helping prioritise patient care.
4. Clinical Decision Support: The results are presented to the healthcare provider through an intuitive interface. This tool could be used in clinics, especially in areas with limited access to specialised eye care, allowing general practitioners or technicians to screen for DR and refer patients for further treatment if needed.
Social Implications:
1. Increased Access to Early Detection: By automating the detection of diabetic retinopathy, this technology can be deployed in remote or underserved areas where access to specialists is limited. Early detection is key to preventing vision loss, and this tool can make screening more accessible, especially in communities with high rates of diabetes.
2. Reduced Healthcare Costs: Early detection and treatment of diabetic retinopathy can prevent the progression to more severe stages, reducing the need for expensive treatments like surgery or long-term care for blindness. This could lower healthcare costs for both patients and the healthcare system.
3. Empowering Healthcare Providers: The tool can support general healthcare providers in making more accurate diagnoses, reducing the burden on specialists and allowing them to focus on more complex cases. This democratizes eye care, making it possible for more people to get the care they need without long delays.
4. Improved Patient Outcomes: With more accurate and timely diagnosis, patients can receive treatment sooner, leading to better health outcomes and preserving vision. This can significantly enhance the quality of life for individuals with diabetes, who might otherwise suffer from preventable blindness.
5. Ethical Considerations: While technology offers many benefits, it also raises ethical questions about the use of AI in healthcare, such as ensuring that the system is fair, transparent, and does not introduce biases. Continuous monitoring and updates would be necessary to ensure the system remains accurate and equitable.
Future Research Plans
1. Improving Model Robustness and Generalization: Enhance the VAE and deep learning classifier’s ability to generalize across diverse populations and imaging conditions.
2. Incorporating Multimodal Data: Integrate additional data sources, such as patient medical history, blood sugar levels, and genetic information, to improve diagnostic accuracy.
3. Real-Time Implementation and Mobile Integration: Adapt the system for real-time analysis and deploy it on mobile devices.
4. Collaboration with Clinical Trials: Validate the AI system’s effectiveness through collaboration with clinical trials.
Continue reading →

...