Portrait of Md Motaleb Hossen Manik
Ph.D. Student · Computer Science · RPI
Multimodal AI · Medical Imaging · Trustworthy Systems

Md Motaleb Hossen Manik

I am currently a Ph.D. student in Computer Science at Rensselaer Polytechnic Institute, where my research centers on multimodal AI for safety-critical domains. My work focuses particularly on medical imaging education, workflow auditability, and synthetic agent societies, combining benchmark design, interactive system development, and risk-aware evaluation with emphasis on reproducibility, provenance, and accountability.

Ph.D. student in Computer Science at Rensselaer Polytechnic Institute, advised by Prof. Ge Wang, and Assistant Professor on study leave from KUET.

Current profile
Research directions Multimodal AI, medical imaging education, workflow auditability, synthetic agent societies, provenance, and trustworthy AI
Current institution Rensselaer Polytechnic Institute · Wang-AXIS Lab
Current citation profile 462+ Google Scholar citations · h-index 8
Professional direction Interested in faculty, research scientist, and collaboration opportunities in large language models, multimodal AI, medical imaging, and agentic systems
Research

Research Overview

My research centers on multimodal AI for safety-critical domains, especially medical imaging education, workflow auditability, and synthetic agent societies. Across these themes, I focus on structured, verifiable, and human-centered AI.

Multimodal AI, LLMs & Medical Imaging Education

2019 — Present
Ph.D. at RPI; earlier ML/NLP foundation

My recent research focuses on multimodal AI systems for medical imaging education, benchmark design, and interactive learning. I built MEDI-SLATE, a slide-lecture aligned dataset containing 1,117 high-resolution slides and 262,182 refined narration tokens from a full 23-lecture undergraduate medical imaging course.

I designed MILU, a benchmark for structured lecture understanding across four open-source VLMs, generating 15,000+ JSON artifacts in 9–11 GPU-hours on 4 × NVIDIA RTX A5000 GPUs. Across 1,117 slides, parsing coverage remained 92–99%, while semantic agreement stayed low, with pairwise concept Jaccard of 0.03–0.09 and triple-level F1 of 0.001–0.033.

I also contributed to ALIVE, a fully local avatar-lecture interaction engine integrating ASR, FAISS-based retrieval, local LLM reasoning, text-to-speech, and talking-head synthesis for real-time lecture-grounded interaction. Earlier work in ML and NLP included Bangla and phonetic Bangla text analysis, where a manually annotated dataset of 1,500 reviews achieved 75.58% accuracy with SVM, and later work on unified sentiment and emotion recognition.

AI Safety & Synthetic Agent Systems

2025 — Present
Ph.D., RPI

My current safety-oriented work studies risky instruction propagation, social regulation, and decentralized governance in synthetic AI societies. I developed OpenClaw on Moltbook, an agent-only social environment for analyzing emergent behavior among autonomous agents.

In an empirical study of 39,026 posts and 5,712 comments from 14,490 agents, I found that 18.4% of posts contained action-inducing language and that such posts were more likely to elicit norm-enforcing responses, while toxic responses remained rare.

I also developed ADAPT, an AI-driven decentralized publishing framework that models scholarly publishing as a closed-loop governance system with bounded policy adaptation under overload, disagreement, and collusion-related stress. This work led to U.S. Provisional Patent Application No. 63/975,609.

Blockchain Provenance, Secure Retrieval & Applied AI

2020 — Present
M.Sc. at KUET + continuing work

My M.Sc. and related work focus on blockchain-based trust, provenance, and secure information systems. I completed my M.Sc. thesis on a blockchain-based secure framework for user-centric multi-party skyline queries, introducing multi-party ElGamal, re-encryption and shuffling, targeted queries, and blockchain-based integrity with distinct blocks for each party.

I built SlideChain, a blockchain-backed semantic provenance framework for educational AI, using four VLMs over 1,117 lecture slides and achieving approximately one-slide-per-second registration throughput, 100% tamper detection, and deterministic reproducibility with Jaccard = 1.0.

I also developed secure data-sharing and applied AI systems including ShaEr, a privacy-preserving medical data sharing and monetisation framework, and contributed to a blockchain-aided heart disease detection system integrating seven datasets and achieving 89.2% accuracy, with 85.3% precision, 97.0% recall, and 90.8% F1 using a voting ensemble with private blockchain support.

Academic Profile

Education

Ph.D. in Computer Science
Aug 2024 – Present
Rensselaer Polytechnic Institute, Troy, NY
Advisor: Prof. Ge Wang · Wang-AXIS Lab
M.Sc. Eng. in Computer Science and Engineering
Sep 2020 – Mar 2023
Khulna University of Engineering & Technology
CGPA 4.00 / 4.00 · Thesis on blockchain-based secure multi-party skyline queries
B.Sc. Eng. in Computer Science and Engineering
Nov 2015 – Mar 2020
Khulna University of Engineering & Technology
CGPA 3.93 / 4.00 · Ranked 1st out of 134 graduates
Teaching

Teaching & Academic Positions

Teaching Assistant
Aug 2024 – Present
Department of Computer Science, RPI
Courses include Principles of Software, Programming Languages, Software Design and Documentation, and RCOS.
Assistant Professor (On Study Leave)
Jun 2024 – Present
Department of Computer Science and Engineering, KUET
On study leave while pursuing doctoral research at RPI.
Lecturer
Feb 2022 – May 2024
Department of Computer Science and Engineering, KUET
Taught microprocessors, robotics lab, mobile computing, technical writing, and digital system design.
Record

Publications

Complete publication record across journals, conferences, preprints, and manuscripts under review.

2026

MEDI-SLATE: A slide-lecture aligned text ensemble for medical imaging education

Manik, M. M. H., Islam, M. Z., and Wang, G. Visual Computing for Industry, Biomedicine, and Art, in press.

Journal
2026

MILU: A Consensus Ensemble Benchmark for Multimodal Medical Imaging Lecture Understanding

Manik, M. M. H., Islam, M. Z., and Wang, G. Journal of Medical Imaging, 13(6):062202.

Journal
2026

OpenClaw Agents on Moltbook: Risky Instruction Sharing and Norm Enforcement in an Agent-Only Social Network

Manik, M. M. H., and Wang, G. arXiv:2602.02625.

Preprint
Review

A Secure Framework for User-centric Multiparty Skyline Queries via Pruned and Prioritized Datasets

Manik, M. M. H., Alam, K. M. R., and Morimoto, Y. Under review.

Under Review
Review

Blockchain-Enabled Secure Land Record Management System with a New Lightweight Cryptosystem Based on Hybrid Chaotic Map

Habib, M. A., and Manik, M. M. H. First revision in process.

Under Review
Review

ADAPT: AI-Driven Decentralized Adaptive Publishing Transformer

Manik, M. M. H., and Wang, G. Under review.

Under Review
Review

Emergent Decentralized Regulation in a Purely Synthetic Society

Manik, M. M. H., and Wang, G. Under review.

Under Review
Review

Evaluating Risk and Auditability in Workflow Agents for Safety-Critical Domains

Manik, M. M. H., and Wang, G. Under review.

Under Review
2025

Development of an Optically Emulated CT (OECT) Scanner for College Education

Manik, M. M. H., Muldowney, W., Islam, M. Z., and Wang, G. Visual Computing for Industry, Biomedicine, and Art.

Journal
2025

Blockchain-aided comparative study of heart disease detection using machine learning-based approaches with an expanded dataset

Ahmed, M. R., Aziz, A., Manik, M. M. H., and Habib, M. A. Computers in Biology and Medicine.

Journal
2025

ShaEr: A Blockchain-Based Framework for Secure Medical Data Sharing and Monetisation

Habib, M. A., and Manik, M. M. H. IET Blockchain.

Journal
2025

Unifying Sentiment Analysis and Emotion Recognition for Bangla Text: A Hybrid Approach

Manik, M. M. H., Sagor, A. H., Mondal, F. A., Touhid, M. M., and Islam, M. Z. ECCE 2025.

Conference
2025

SlideChain: Semantic Provenance for Lecture Understanding via Blockchain Registration

Manik, M. M. H., Islam, M. Z., and Wang, G. arXiv:2512.21684.

Preprint
2025

ALIVE: An Avatar-Lecture Interactive Video Engine with Content-Aware Retrieval for Real-Time Interaction

Islam, M. Z., Manik, M. M. H., and Wang, G. arXiv:2512.20858.

Preprint
2025

N-ReLU: Zero-Mean Stochastic Extension of ReLU

Manik, M. M. H., Islam, M. Z., and Wang, G. arXiv:2511.07559.

Preprint
2025

ChatGPT vs. DeepSeek: A Comparative Study on AI-Based Code Generation

Manik, M. M. H. arXiv:2502.18467.

Preprint
2024

An Android application for healthcare management system with ML-driven solution

Atik, A. I., Rahman, A., Sami, S. A., and Manik, M. M. H. ICCIT 2024.

Conference
2024

Enhancing Hyperledger Fabric: A scalable framework for optimized blockchain performance

Saha, A., Majumder, S., Manik, M. M. H., and Hashem, M. A. ICCIT 2024.

Conference
2024

Decentralized GDPR compliance: A blockchain framework for personal data management

Islam, M. R., Alam, K. M. R., and Manik, M. M. H. ICCIT 2024.

Conference
2024

Analyzing the Dynamics of COVID-19 Lockdown Success: Insights from Regional Data and Public Health Measures

Manik, M. M. H., Habib, M. A., Islam, M. Z., Ahmed, T., and Haque, F. arXiv:2402.18594.

Preprint
2024

Question-Answering System for Bangla: Fine-tuning BERT-Bangla for a Closed Domain

Roy, S. C., and Manik, M. M. H. arXiv:2410.03923.

Preprint
2023

Enhancing Robustness of Machine Learning Algorithms for Bangla Text Classification: A Defensive Approach against Adversarial Attacks

Manik, M. M. H., Mahadi, J., Touhid, M. M., and Alam, K. M. R. EICT 2023.

Conference
2023

Redefining Crime Record Storage: An Advanced Architecture Harnessing the Power of Blockchain Technology

Manik, M. M. H., Sagor, A. H., Habib, M. A., Touhid, M. M., Ahmed, T., Islam, M. Z., and Haque, F. ICCIT 2023.

Conference
2023

A Blockchain Based Scalable Framework for Academic Document Verification

Majumder, S., Zaha, R., Manik, M. M. H., and Alam, K. M. R. ICCIT 2023.

Conference
2023

A Blockchain-based Technique to Prevent Grade Tampering: A University Perspective

Habib, M. A., Manik, M. M. H., and Zaman, S. ECCE 2023.

Conference
2023

A Technique to Avoid Blockchain Denial of Service (BDoS) and Selfish Mining Attack

Habib, M. A., and Manik, M. M. H. BCCA 2023.

Conference
2023

A Novel Approach in Determining Areas to Lockdown During a Pandemic: COVID-19 as a Case Study

Manik, M. M. H. International Journal of Information Engineering and Electronic Business, 15(2), 30.

Journal
2022

A Hybrid Framework for Sentiment Analysis from Bangla Texts

Manik, M. M. H., Haque, F., Hashem, M. M. A., Habib, M. A., Islam, M. Z., and Ahmed, T. ICCIT 2022.

Conference
2022

A Blockchain Based Secure Framework for User-centric Multi-party Skyline Queries

Manik, M. M. H., Alam, K. M. R., and Morimoto, Y. ICCIT 2022.

Conference
2022

Machine Learning Algorithms on COVID-19 Prediction Using CpG Island and AT-CG Feature on Human Genomic Data

Manik, M. M. H., Habib, M. A., and Ahmed, T. International Conference on Machine Intelligence and Emerging Technologies.

Conference
2022

Classification of DNA Sequence Using Machine Learning Techniques

Habib, M. A., Manik, M. M. H., and Khulna, B. EasyChair preprint.

Preprint
2021

Machine learning approaches for tackling novel coronavirus (COVID-19) pandemic

Rahman, M. M., Islam, M., Manik, M. M. H., Islam, M. R., and Al-Rakhami, M. S. SN Computer Science, 2, 1–10.

Journal
2020

An automated system to limit COVID-19 using facial mask detection in smart city network

Rahman, M. M., Manik, M. M. H., Islam, M. M., Mahmud, S., and Kim, J. H. IEMTRONICS 2020.

Conference
2019

Opinion Mining from Bangla and Phonetic Bangla Reviews Using Vectorization Methods

Haque, F., Manik, M. M. H., and Hashem, M. M. A. EICT 2019.

Conference
Recognition

Honors, Funding & Patent

Honors & Awards

  • Best Paper Award, ECCE 2023
  • Dean’s Award for three consecutive years at KUET
  • Merit Scholarship during undergraduate study

Grant Support

Graduate research collaborator on an NVIDIA Academic Grant Program project led by Prof. Ge Wang, focused on Iterative Avatar Teaching in the NVIDIA Omniverse.

Patent

Co-inventor on the U.S. provisional patent for ADAPT: AI-Driven Decentralized Adaptive Publishing Testbed.

Service

Professional Service & Mentorship

Journal Reviewer Technology in Society, Elsevier · Journal of Systems and Software, Elsevier
Conference Reviewer IEEE COMPAS 2025 · EICT 2025 · ICCAconf 2024 · EICT 2023
Mentorship & Supervision Supervised 3 undergraduate B.Sc. thesis groups and 2 undergraduate project groups at KUET across machine learning, blockchain systems, and LLM-related topics.
Invited Talk “All About AI: Avatarized Online Course,” ECSE Best Practices Series, Department of Electrical, Computer and Systems Engineering, RPI, August 7, 2025.
Presentation Context Invited by Prof. Shayla Sawyer. Presented the ALIVE avatarized lecture system to a cross-departmental faculty audience and received commendation from senior RPI faculty.
Skills

Technical Skills

Machine Learning, LLMs & Multimodal AI

Large Language Models Vision–Language Models Multimodal Learning Transformers Retrieval-Augmented Generation Prompt Engineering Benchmark Design Model Evaluation Medical Imaging AI

Frameworks & Libraries

PyTorch TensorFlow Hugging Face SentenceTransformers Scikit-learn NumPy Pandas Matplotlib FAISS Whisper ASR

Agentic Systems & AI Safety

Synthetic Agent Societies Workflow Agents Norm Enforcement Risk Modeling Auditability Decentralized Governance Trustworthy AI

Blockchain, Security & Provenance

Ethereum Solidity Hyperledger Fabric Hardhat IPFS Smart Contracts Cryptographic Hashing Keccak-256 Blockchain Provenance

Programming & Tools

Python C/C++ Java JavaScript SQL Git LaTeX Jupyter Linux/Bash REST APIs JSON Pipelines
Contact

Let’s Connect

Open to academic collaborations, invited talks, research discussions, and future opportunities in multimodal, trustworthy, and safety-critical AI.

Profile

Md Motaleb Hossen Manik
Ph.D. Student, Computer Science
Rensselaer Polytechnic Institute, Troy, NY, USA