Projects per year
Personal profile
Biography
Dr. Abdelkader Baggag is a Machine Learning Senior Scientist at the Qatar Computing Research Institute. He is a member of the Qatar Center for Artificial Intelligence, with a joint appointment as Associate Professor in the Information and Computing Technology Division, Hamad Bin Khalifa University, where he teaches a graduate course on Generative AI Foundations. Dr. Baggag holds a Ph.D. in Computer Science from the Department of Computer Science and Engineering, University of Minnesota, USA, with a concentration on Machine Learning and Scalable Numerical Linear Algebra.
Prior to joining the Qatar Computing Research Institute, Dr. Baggag was an academic at McGill University, and then a tenured Associate Professor at Laval University in Canada.
Dr. Baggag has extensive experience in, and in-depth knowledge of, applied machine learning and numerical methods for large-scale systems from engineering applications. He has gained this expertise while working at leading High-Performance Computing research centers, namely, the Computing Research Institute at Purdue University; the Institute for Computer Applications in Science and Engineering (ICASE) at NASA Langley Research Center in Hampton, VA; and the Army High Performance Computing Research Center, and the Minnesota Supercomputer Institute -in Minnesota, USA.
Dr. Baggag research is broad and spans all aspects of machine learning. Particular strengths are in Bayesian and numerical linear algebra approaches to modeling and inference in multimodal large language models (LLMs). The type of work ranges from studying fundamental concepts, e.g., reducing -to linear- the quadratic complexity of the Transformer, in terms of memory and computation with respect to the sequence length, all the way to getting the algorithms to perform competitively against the state-of-the-art in big-data applications.
Dr. Baggag research interests span Generative AI; Representation Learning; and multimodal Large Language Models (LLMs). Dr. Baggag is also interested in optimal transport and matrix completion -exploiting techniques about matrix functions and Random Matrix Theory for machine learning. Worked on AI and ML applications that include AI for Wearable Data Analytics, Traffic Prediction and Missing Data Imputation, and AI for Resilient Smart Cities.
Currently, Dr. Baggag is working on `Multimodal Large Language Models and Application-Driven Offline Reinforcement Learning, i.e., offline reinforcement learning research and prescriptive learning, with applications in LLMs, e.g., RLHF and implicit reward methods for alignment in LLMs such as Direct Preference Optimization (DPO).
Current projects include:
- The Linear Algebra of Large Language Models.
- Mass fact editing in LLM.
- Watermarking of LLMs.
Vision -- What are the main open questions in LLMs? There are a lot of topics, e.g., going on beyond transformer architectures to learn (Mamba for example), working on small models which are highly efficient, making LLMs talk to each other (AI agents), grounding LLMs to reality with sensing capabilities, how to do large-scale distributed training. We are just starting to do the real research in LLMs. All before was based on engineering craft --which is already hard.
Experience
- PhD in Computer Science (Summa Cum Laude), major in Machine Learning, Scalable Numerical Linear Algebra, HPC, Random Matrix Theory for Machine Learning.
- Capacity Building in Artificial Intelligence in Qatar:
- Mentoring HBKU doctoral and masters students | Recruiting and training Postdocs, Scientists, Research Assistants, Research Associates and Summer Interns.
- A team leader experienced in guiding engineers and scientists.
- Expertise in Machine Learning and Artificial Intelligence, Generative AI, High-Performance Computing, Reinforcement Learning, Large Language Models.
- Data Analytics | Design of data-driven tools for real-world applications.
- Penalized models such as Lasso, ElasticNet, GroupLasso, Ridge, etc.
- Data representation and reduction: Nonnegative matrix factorization, data representation in domain transfer learning problem, multi-manifolds.
Education/Academic qualification
Computer Science, PhD, Linear System Solvers in Particulate Flows, University of Minnesota Twin Cities
Award Date: 15 Feb 2002
Applied Mathematics, Master, Finite Element Method in Turbulent Flows, Ecole Polytechnique of Montreal
Award Date: 15 Jul 1993
Ingénieur d’État, Bachelor, A Finite Difference Method for Diphasic Flows., Ecole National Polytechnique d'Alger
Award Date: 15 Jun 1990
Keywords
- QA75 Electronic computers. Computer science
- Artificial Intelligence, Numerical Linear Algebra, Iterative Solvers and Preconditioners for Large Linear Systems, HPC, Random Matrix Theory for ML, Optimization
- Q Science (General)
Fingerprint
- 6 Similar Profiles
Collaborations and top research areas from the last five years
Projects
- 1 Active
-
QCRI-CORE-000019: Qatar Diabetes Prevention Program (QDPP)
Saad, M. (Lead Principal Investigator), Aupetit, M.J.-M. (Principal Investigator), Baggag, A. (Principal Investigator) & Al Thani, D. A. (Principal Investigator)
1/01/18 → 31/12/25
Project: Basic Research
-
ClustML: A measure of cluster pattern complexity in scatterplots learnt from human-labeled groupings
Hamza, M. M., Ullah, E., Baggag, A., Bensmail, H., Sedlmair, M. & Aupetit, M., Apr 2024, In: Information Visualization. 23, 2, p. 105-122 18 p.Research output: Contribution to journal › Article › peer-review
Open Access2 Citations (Scopus) -
Exploring the Applications of Explainability in Wearable Data Analytics: Systematic Literature Review
Abdelaal, Y., Aupetit, M., Baggag, A. & Al-Thani, D., 2024, In: Journal of Medical Internet Research. 26, e53863.Research output: Contribution to journal › Article › peer-review
-
How Much Wearable Data is Enough for the Utility and Trust of Augmented Artificial Intelligence Systems? A Scenario-Based Interview with Medical Professionals
Abdelaal, Y., Aupetit, M., Baggag, A., Bashir, M. & Al-Thani, D., 21 Sept 2024, In: International Journal of Human-Computer Interaction.Research output: Contribution to journal › Article › peer-review
Open Access -
ACTIVE LEARNING FOR SCALABLE ARRANGEMENT AND GROUPING OF DATA
Aupetit, M.J.-M. (Inventor), Baggag, A. (Inventor) & Abuthawabeh, A. (Inventor), 23 Nov 2023, IPC No. G06F 3/ 04817 A I, Patent No. US2023376164, Priority date 18 May 2023, Priority No. US202318199175Research output: Patent
-
OutSingle: a novel method of detecting and injecting outliers in RNA-Seq count data using the optimal hard threshold for singular values
Salkovic, E., Baggag, A., Salem, A. G. R., Bensmail, H. & Sadeghi, M. A., 1 Apr 2023, In: Bioinformatics. 39, 4, btad142.Research output: Contribution to journal › Article › peer-review
Open Access1 Citation (Scopus)