Accelerating material discovery for CdTe solar cells using knowledge intense word embeddings
Thin film CdTe is the most successful second-generation solar photovoltaic technology, and further development will significantly contribute to net zero emission targets. Natural language processing technologies are applied to accelerate research on CdTe solar cells towards new material discoveries. In this work, various language models are used to extract the most frequently used words from the CdTe literature. The performance of these language models is tested and compared using a customised evaluation dataset. The optimised GloVe language model is exploited to construct a knowledge diagram in the vector space and track the material application timeline. The data-driven approach provides useful insights for future research and will accelerate material discoveries in CdTe solar cells.
Funding
Doped emitters to unlock lowest cost solar electricity
Engineering and Physical Sciences Research Council
Find out more...History
School
- Mechanical, Electrical and Manufacturing Engineering
Published in
2024 IEEE 52nd Photovoltaic Specialist Conference (PVSC)Source
2024 IEEE 52nd Photovoltaic Specialist Conference (PVSC)Publisher
IEEEVersion
- AM (Accepted Manuscript)
Rights holder
© IEEEPublisher statement
Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.Publication date
2024-11-15Copyright date
2024ISBN
9781665464260; 9781665475822ISSN
0160-8371eISSN
2995-1755Publisher version
Language
- en