Abstract:
Representative large models and their related applications, such as Bidirectional encoder representations from transformers (BERT), Generative pretrained transformer (GPT), Segment anything model (SAM), ChatGPT, DALL-E, Wenxin, and Pangu, have made astounding strides and exerted considerable influence across various fields domestically and abroad. They constantly attract the attention and follow-up of diverse societal sectors, including enterprises, universities, and research institutions. Large model applications have been successfully applied in scenarios such as biology, medicine, law, and social governance. Designing, modifying, and constructing domain-specific large models are crucial for truly harnessing their application value. Therefore, this paper provides inspiration for the application of large models in materials science. First, it provides an overview of large models, introducing their basic concepts, developmental process, technical classification, and features. Second, from the perspectives of the general domain and specific large models, this paper summarizes the applications of large models and analyzes the application scenarios and functions of various types of large models. Subsequently, considering the specific needs and current state of research in the field of materials science, this paper reviews the application of large language models, large visual models, and large multimodal models. It integrates engineering strategies and zero-shot knowledge transfer learning from specific tasks in natural language processing and computer vision and referencing typical application cases, clarifying current research paradigms and limiting factors for applying large models to materials science. To verify the effectiveness and potential of the visual large model, basal experiments of image segmentation and key structure extraction are performed on the microscopic image data of four types of materials using improved SAM, including Ni-superalloy, superalloy, polycrystalline pure iron grain, and Inconel 939. The experimental results reveal that the zero-shot segmentation capability of SAM has enormous potential for accurate and efficient representation of material microstructures. With the help of tailored prompt engineering, precise masks of the precipitated phase, grain boundaries, and cracks can be outputted without any label. Finally, this paper proposes future research opportunities for technologies and methods related to large models in materials science. This paper assesses the feasibility and technical challenges for the development and tuning of unimodal to comprehensive multimodal large models. With continuous innovations and collaborations, the horizon for large models in materials science seems boundlessly promising. The integration of these models can produce a new era of advanced research, leading to advancements that were previously considered unattainable. The symbiosis between materials science and large models can pave the way for unforeseen discoveries, enriching our scientific prowess.