IBM: Granite 3.0 LLM Enterprise Development Considerations
Summary of https://github.com/ibm-granite/granite-3.0-language-models/blob/main/paper.pdf
This document details the development and release of Granite 3.0, a new family of open-source, lightweight foundation language models from IBM. The paper provides a thorough overview of the models' design, including their architecture, training data, and post-training techniques.
It also explores the models' performance across various benchmarks, focusing on their capabilities in general knowledge, instruction following, function calling, retrieval augmented generation, and cybersecurity.
The paper concludes by discussing the socio-technical harms and risks associated with LLMs and outlines IBM's efforts to mitigate these concerns through responsible AI practices.