Ph.D. Student · Computer Science · RPI

Multimodal AI · Medical Imaging · Trustworthy Systems

Md Motaleb Hossen Manik

I am currently a Ph.D. student in Computer Science at Rensselaer Polytechnic Institute, where my research centers on multimodal AI for safety-critical domains. My work focuses particularly on medical imaging education, workflow auditability, and synthetic agent societies, combining benchmark design, interactive system development, and risk-aware evaluation with emphasis on reproducibility, provenance, and accountability.

Google Scholar LinkedIn

Ph.D. student in Computer Science at Rensselaer Polytechnic Institute, advised by Prof. Ge Wang, and Assistant Professor on study leave from KUET.

Current profile

Research directions Multimodal AI, medical imaging education, workflow auditability, synthetic agent societies, provenance, and trustworthy AI

Current institution Rensselaer Polytechnic Institute · Wang-AXIS Lab

Current citation profile 462+ Google Scholar citations · h-index 8

Professional direction Interested in faculty, research scientist, and collaboration opportunities in large language models, multimodal AI, medical imaging, and agentic systems

Research

Research Overview

My research centers on multimodal AI for safety-critical domains, especially medical imaging education, workflow auditability, and synthetic agent societies. Across these themes, I focus on structured, verifiable, and human-centered AI.

Multimodal AI, LLMs & Medical Imaging Education

2019 — Present

Ph.D. at RPI; earlier ML/NLP foundation

My recent research focuses on multimodal AI systems for medical imaging education, benchmark design, and interactive learning. I built MEDI-SLATE, a slide-lecture aligned dataset containing 1,117 high-resolution slides and 262,182 refined narration tokens from a full 23-lecture undergraduate medical imaging course.

I designed MILU, a benchmark for structured lecture understanding across four open-source VLMs, generating 15,000+ JSON artifacts in 9–11 GPU-hours on 4 × NVIDIA RTX A5000 GPUs. Across 1,117 slides, parsing coverage remained 92–99%, while semantic agreement stayed low, with pairwise concept Jaccard of 0.03–0.09 and triple-level F1 of 0.001–0.033.

I also contributed to ALIVE, a fully local avatar-lecture interaction engine integrating ASR, FAISS-based retrieval, local LLM reasoning, text-to-speech, and talking-head synthesis for real-time lecture-grounded interaction. Earlier work in ML and NLP included Bangla and phonetic Bangla text analysis, where a manually annotated dataset of 1,500 reviews achieved 75.58% accuracy with SVM, and later work on unified sentiment and emotion recognition.

AI Safety & Synthetic Agent Systems

2025 — Present

Ph.D., RPI

My current safety-oriented work studies risky instruction propagation, social regulation, and decentralized governance in synthetic AI societies. I developed OpenClaw on Moltbook, an agent-only social environment for analyzing emergent behavior among autonomous agents.

In an empirical study of 39,026 posts and 5,712 comments from 14,490 agents, I found that 18.4% of posts contained action-inducing language and that such posts were more likely to elicit norm-enforcing responses, while toxic responses remained rare.

I also developed ADAPT, an AI-driven decentralized publishing framework that models scholarly publishing as a closed-loop governance system with bounded policy adaptation under overload, disagreement, and collusion-related stress. This work led to U.S. Provisional Patent Application No. 63/975,609.

Blockchain Provenance, Secure Retrieval & Applied AI

2020 — Present

M.Sc. at KUET + continuing work

My M.Sc. and related work focus on blockchain-based trust, provenance, and secure information systems. I completed my M.Sc. thesis on a blockchain-based secure framework for user-centric multi-party skyline queries, introducing multi-party ElGamal, re-encryption and shuffling, targeted queries, and blockchain-based integrity with distinct blocks for each party.

I built SlideChain, a blockchain-backed semantic provenance framework for educational AI, using four VLMs over 1,117 lecture slides and achieving approximately one-slide-per-second registration throughput, 100% tamper detection, and deterministic reproducibility with Jaccard = 1.0.

I also developed secure data-sharing and applied AI systems including ShaEr, a privacy-preserving medical data sharing and monetisation framework, and contributed to a blockchain-aided heart disease detection system integrating seven datasets and achieving 89.2% accuracy, with 85.3% precision, 97.0% recall, and 90.8% F1 using a voting ensemble with private blockchain support.

Academic Profile

Education

Ph.D. in Computer Science

Aug 2024 – Present

Rensselaer Polytechnic Institute, Troy, NY

Advisor: Prof. Ge Wang · Wang-AXIS Lab

M.Sc. Eng. in Computer Science and Engineering

Sep 2020 – Mar 2023

Khulna University of Engineering & Technology

CGPA 4.00 / 4.00 · Thesis on blockchain-based secure multi-party skyline queries

B.Sc. Eng. in Computer Science and Engineering

Nov 2015 – Mar 2020

Khulna University of Engineering & Technology

CGPA 3.93 / 4.00 · Ranked 1st out of 134 graduates

Teaching

Teaching & Academic Positions

Teaching Assistant

Aug 2024 – Present

Department of Computer Science, RPI

Courses include Principles of Software, Programming Languages, Software Design and Documentation, and RCOS.

Assistant Professor (On Study Leave)

Jun 2024 – Present

Department of Computer Science and Engineering, KUET

On study leave while pursuing doctoral research at RPI.

Lecturer

Feb 2022 – May 2024

Department of Computer Science and Engineering, KUET

Taught microprocessors, robotics lab, mobile computing, technical writing, and digital system design.

Record

Publications

Complete publication record across journals, conferences, preprints, and manuscripts under review.

2026

MEDI-SLATE: A slide-lecture aligned text ensemble for medical imaging education

Manik, M. M. H., Islam, M. Z., and Wang, G. Visual Computing for Industry, Biomedicine, and Art, in press.

Journal

2026

MILU: A Consensus Ensemble Benchmark for Multimodal Medical Imaging Lecture Understanding

Manik, M. M. H., Islam, M. Z., and Wang, G. Journal of Medical Imaging, 13(6):062202.

Paper

Journal

2026

OpenClaw Agents on Moltbook: Risky Instruction Sharing and Norm Enforcement in an Agent-Only Social Network

Manik, M. M. H., and Wang, G. arXiv:2602.02625.

Manik, M. M. H., Habib, M. A., and Ahmed, T. International Conference on Machine Intelligence and Emerging Technologies.

Paper

Conference

2022

Classification of DNA Sequence Using Machine Learning Techniques

Habib, M. A., Manik, M. M. H., and Khulna, B. EasyChair preprint.

Preprint

2021

Machine learning approaches for tackling novel coronavirus (COVID-19) pandemic

Rahman, M. M., Islam, M., Manik, M. M. H., Islam, M. R., and Al-Rakhami, M. S. SN Computer Science, 2, 1–10.

Paper

Journal

2020

An automated system to limit COVID-19 using facial mask detection in smart city network

Rahman, M. M., Manik, M. M. H., Islam, M. M., Mahmud, S., and Kim, J. H. IEMTRONICS 2020.

Paper

Conference

2019

Opinion Mining from Bangla and Phonetic Bangla Reviews Using Vectorization Methods

Haque, F., Manik, M. M. H., and Hashem, M. M. A. EICT 2019.

Paper

Conference

Recognition

Honors, Funding & Patent

Honors & Awards

Best Paper Award, ECCE 2023
Dean’s Award for three consecutive years at KUET
Merit Scholarship during undergraduate study

Grant Support

Graduate research collaborator on an NVIDIA Academic Grant Program project led by Prof. Ge Wang, focused on Iterative Avatar Teaching in the NVIDIA Omniverse.

Patent

Co-inventor on the U.S. provisional patent for ADAPT: AI-Driven Decentralized Adaptive Publishing Testbed.

Service

Professional Service & Mentorship

Journal Reviewer Technology in Society, Elsevier · Journal of Systems and Software, Elsevier

Conference Reviewer IEEE COMPAS 2025 · EICT 2025 · ICCAconf 2024 · EICT 2023

Mentorship & Supervision Supervised 3 undergraduate B.Sc. thesis groups and 2 undergraduate project groups at KUET across machine learning, blockchain systems, and LLM-related topics.

Invited Talk “All About AI: Avatarized Online Course,” ECSE Best Practices Series, Department of Electrical, Computer and Systems Engineering, RPI, August 7, 2025.

Presentation Context Invited by Prof. Shayla Sawyer. Presented the ALIVE avatarized lecture system to a cross-departmental faculty audience and received commendation from senior RPI faculty.

Skills

Technical Skills

Machine Learning, LLMs & Multimodal AI

Large Language Models Vision–Language Models Multimodal Learning Transformers Retrieval-Augmented Generation Prompt Engineering Benchmark Design Model Evaluation Medical Imaging AI

Frameworks & Libraries

PyTorch TensorFlow Hugging Face SentenceTransformers Scikit-learn NumPy Pandas Matplotlib FAISS Whisper ASR

Agentic Systems & AI Safety

Synthetic Agent Societies Workflow Agents Norm Enforcement Risk Modeling Auditability Decentralized Governance Trustworthy AI

Blockchain, Security & Provenance

Ethereum Solidity Hyperledger Fabric Hardhat IPFS Smart Contracts Cryptographic Hashing Keccak-256 Blockchain Provenance

Programming & Tools

Python C/C++ Java JavaScript SQL Git LaTeX Jupyter Linux/Bash REST APIs JSON Pipelines

Contact

Let’s Connect

Open to academic collaborations, invited talks, research discussions, and future opportunities in multimodal, trustworthy, and safety-critical AI.

Profile

Md Motaleb Hossen Manik
Ph.D. Student, Computer Science
Rensselaer Polytechnic Institute, Troy, NY, USA

Email manikm@rpi.edu

↗

Google Scholar Publications and citation profile

↗

LinkedIn mh-manik

↗

CV Download current PDF

↗