Northeastern University: Foundations of Large Language Models

Jeremy WeaverJanuary 27, 2025

Premium

Summary: The content explores foundational methods and advanced techniques in large language model development, including pre-training, generative architectures like Transformers, scaling strategies, alignment through reinforcement learning and instruction fine-tuning, and various prompting methods.

Northeastern University: Foundations of Large Language Models

https://www.podbean.com/player-v2/?from=embed&i=ud73m-17d3413-pb&square=1&share=1&download=1&fonts=Arial&skin=1&font-color=auto&rtl=0&logo_link=episode_page&btn-skin=7&size=300</a>" loading="lazy" allowfullscreen="">

Summary of Read" class="text-blue-600 hover:text-blue-800" target="_blank" rel="noopener noreferrer">https://arxiv.org/pdf/2501.09223'>Read Full Report

Detail foundational concepts and advanced techniques in large language model (LLM) development. It covers pre-training methods, including masked language modeling and discriminative training, and explores generative model architectures like Transformers.

The text also examines scaling LLMs for size and context length, along with alignment strategies such as reinforcement learning from human feedback (RLHF) and instruction fine-tuning.

Finally, it discusses prompting techniques, including chain-of-thought prompting and prompt optimization methods to improve LLM performance and alignment with human preferences.

← PreviousPrinceton University: Cognitive Architectures for Language Agents Next →New York City: 2025 Artificial Intelligence Advantage – Driving Economic Growth and Technological Transformation

The MCP Context Window Problem: Why AI Agent Architecture Matters More Than Model Size

MCP servers are consuming up to 72% of AI agent context windows before a single user message is processed. Here is why smart agent architecture — not bigger models — is the real solution.

ibl.aiMarch 16, 2026

Amazon's AI Coding Crisis Reveals What Every Organization Needs: Controlled Agent Infrastructure

Amazon's recent production outages from AI coding agents reveal a fundamental truth: organizations need AI infrastructure they own and control. Here's what the industry can learn.

ibl.aiMarch 15, 2026

Why 1 Million Tokens of Context Changes Everything — If You Own the Infrastructure

Anthropic just made 1 million tokens of context generally available. Here's why long context only matters if the infrastructure running it belongs to you.

ibl.aiMarch 14, 2026

What Amazon's AI Coding Agent Outage Teaches Us About Deploying Agents in Production

Amazon's AI coding agent Kiro caused a 13-hour AWS outage by deleting a production environment. The incident reveals why organizations need owned, sandboxed AI infrastructure with proper governance — not just smarter models.

ibl.aiMarch 13, 2026

See the ibl.ai AI Operating System in Action

Discover how leading universities and organizations are transforming education with the ibl.ai AI Operating System. Explore real-world implementations from Harvard, MIT, Stanford, and users from 400+ institutions worldwide.

View Case Studies

Get Started with ibl.ai

Choose the plan that fits your needs and start transforming your educational experience today.

ibl.ai AI Education Blog

Topics We Cover

Featured Research and Reports

For University Leaders