VistronAI: Unlocking the Future of AI-Powered Multimedia Interaction

Sonali Bhanudas Mali; Akanksha Ramesh Solankurkar

doi:10.32628/IJSRSET25122200

Authors

Sonali Bhanudas Mali Department of CSE (AIML), D.Y Patil College of Engineering & Technology, Kolhapur, Maharashtra, India Author
Akanksha Ramesh Solankurkar Department of CSE (AIML), D.Y Patil College of Engineering & Technology, Kolhapur, Maharashtra, India Author

DOI:

https://doi.org/10.32628/IJSRSET25122200

Keywords:

Large Language Models, Content Analysis, Workflow Automation, Multi-modal Data Processing, Information Retrieval

Abstract

The integration of Large Language Models (LLMs) into various applications occurs because of increasing demands for intelligent automation and real-time decision-making. LLMs power extensive business potential which enables operational improvements along with workflow streamlining and productivity improvements for various industries. VistronAI aims to develop a suite of six LLM-based applications designed to tackle challenges in areas such as data analysis, content generation, and automation. VistronAI addresses these challenges by integrating applications for audio, video, document, URL analysis, OCR for images, and real-time Q&A. By using advanced LLM technologies, the suite combines natural language processing and machine learning techniques to solve complex real-world problems. The combination of improved information retrieval with workflow automation position VistronAI as a robust solution for AI-driven content analysis, multi-modal data processing across diverse use cases.

Downloads

Download data is not yet available.

References

Kromidha, Endrit, and Robert M. Davison. "Generative AI-augmented decision-making for business information systems." In IFIP International Conference on Human Choice and Computers, pp. 46-55. Cham: Springer Nature Switzerland, 2024.

Al Naqbi, Humaid, Zied Bahroun, and Vian Ahmed. "Enhancing work productivity through generative artificial intelligence: A comprehensive literature review." Sustainability 16, no. 3 (2024): 1166.

Gan, Wensheng, Zhenlian Qi, Jiayang Wu, and Jerry Chun-Wei Lin. "Large language models in education: Vision and opportunities." In 2023 IEEE international conference on big data (BigData), pp. 4776-4785. IEEE, 2023.

Shahriar, Sakib, Sonal Allana, Seyed Mehdi Hazratifard, and Rozita Dara. "A survey of privacy risks and mitigation strategies in the artificial intelligence life cycle." IEEE Access 11 (2023): 61829-61854.

Wang, Qi, Jindong Li, Shiqi Wang, Qianli Xing, Runliang Niu, He Kong, Rui Li, Guodong Long, Yi Chang, and Chengqi Zhang. "Towards next-generation llm-based recommender systems: A survey and beyond." arXiv preprint arXiv:2410.19744 (2024).

Kumar, Vimal, Priyam Srivastava, Ashay Dwivedi, Ishan Budhiraja, Debjani Ghosh, Vikas Goyal, and Ruchika Arora. "Large-language-models (llm)-based ai chatbots: Architecture, in-depth analysis and their performance evaluation." In International Conference on Recent Trends in Image Processing and Pattern Recognition, pp. 237-249. Cham: Springer Nature Switzerland, 2023.

Dam, Sumit Kumar, Choong Seon Hong, Yu Qiao, and Chaoning Zhang. "A complete survey on llm-based ai chatbots." arXiv preprint arXiv:2406.16937 (2024).

Lee, Taehong, Gayeon Kim, Hyungjin Ahn, Jaejeon Jeong, Mingyu Jeong, and Jiu Song. "Integrating OCR and LLMs for Enhanced Document Digitization in ERP Systems." In 2024 15th International Conference on Information and Communication Technology Convergence (ICTC), pp. 1650-1653. IEEE, 2024.

Song, Enxin, Wenhao Chai, Tian Ye, Jenq-Neng Hwang, Xi Li, and Gaoang Wang. "Moviechat+: Question-aware sparse memory for long video question answering." arXiv preprint arXiv:2404.17176 (2024).

Pontes, Felipe Arruda, Michael Schukat, and Edward Curry. "Real-Time Context-Aware Early Filtering for High-Definition Video Analytics on Commodity Edge Devices using GenAI for Data Augmentation." IEEE Access (2024).

Li, Dongting, Chenchong Tang, and Han Liu. "Audio-LLM: Activating the Capabilities of Large Language Models to Comprehend Audio Data." In International Symposium on Neural Networks, pp. 133-142. Singapore: Springer Nature Singapore, 2024.

Chu, Peng, Jiang Wang, and Andre Abrantes. "LLM-AD: Large language model based audio description system." arXiv preprint arXiv:2405.00983 (2024).

Sava, D. "Text-based classification of websites using self-hosted Large Language Models: An accuracy and efficiency analysis." Bachelor's thesis, University of Twente, 2024.

Tin, Ting Tin, Seow Yu Xuan, Wong Man Ee, Lee Kuok Tiung, and Ali Aitizaz. "Interactive ChatBot for PDF Content Conversation Using an LLM Language Model LLM-Based PDF ChatBot." International Journal of Advanced Computer Science & Applications 15, no. 9 (2024).

Jindal, Chirag, Satyam Gupta, Jyoti Mehra, Tushar Sharma, and Pulkit Aggrawal. "An AI-Driven PDF Query System Leveraging OpenAI LLM and LangChain for Enhanced Data Retrieval." International Journal of Informatics and Applied Mathematics 7, no. 2: 16-28.

Chkirbene, Zina, Ridha Hamila, Ala Gouissem, and Unal Devrim. "Large Language Models (LLM) in Industry: A Survey of Applications, Challenges, and Trends." In 2024 IEEE 21st International Conference on Smart Communities: Improving Quality of Life using AI, Robotics and IoT (HONET), pp. 229-234. IEEE, 2024.

Pasupuleti, Rajesh, Ravi Vadapalli, Christopher Mader, and Norris Timothy. "Popular LLM-Large Language Models in Enterprise Applications." In 2024 2nd International Conference on Foundation and Large Language Models (FLLM), pp. 125-131. IEEE, 2024.

Chew, Robert, John Bollenbacher, Michael Wenger, Jessica Speer, and Annice Kim. "LLM-assisted content analysis: Using large language models to support deductive coding." arXiv preprint arXiv:2306.14924 (2023).

Nejjar, Mohamed, Luca Zacharias, Fabian Stiehle, and Ingo Weber. "Llms for science: Usage for code generation and data analysis." Journal of Software: Evolution and Process 37, no. 1 (2025): e2723.

Zahran, Raghda, Jianfei Xu, Huizhi Liang, and Matthew Forshaw. "Data Science Students Perspectives on Learning Analytics: An Application of Human-Led and LLM Content Analysis." arXiv preprint arXiv:2502.10409 (2025).

VistronAI: Unlocking the Future of AI-Powered Multimedia Interaction

Authors

DOI:

Keywords:

Abstract

Downloads

References

Downloads

Published

Issue

Section

License

How to Cite

Similar Articles

IssueDate

RightSideBlock

Latest publications

Similar Articles

Modernizing Agricultural Infrastructure with Machine Learning-Based Remote Sensing Techniques

LLM for Retail Business (Optimizing Clothing Sales with AI)

Detecting Hate Speech in Tweets with Advanced Machine Learning Techniques

Strategic Integration of LangChain, Hugging Face Transformers, and OpenAI for Document Intelligence Systems

Extraction of Text Summarization

Sensor Performance Test of Laboratory Type Silage Production, Data Acquisition and Control System; Module-B

Web-Based Data Management System

Adapting YOLO11 for Classification of Unlabelled Data : A Semi-Supervised Approach

Advances in SLA Monitoring, Root Cause Analysis, and Vendor Compliance in Next-Generation Networks

A Conceptual Framework for AI-Enhanced Investment Decision-Making in Venture Capital: Unlocking Opportunities in Emerging Markets