
Ontario High School Science Word List (OHSWL)
Abstract
This research aims to explain the development of an Ontario High School Science Corpus and subsequently an Ontario High School Science Word List (OHSWL). The OHSWL is a list of the most frequent technical words in the Ontario high school science curriculum. The science corpus was compiled from Ontario science textbooks and public written lecture material. A total of 803 lemmas were identified as part of the OHSWL. The coverage of the OHSWL in the science corpus vs non-science corpus is 7.79% and 1.52% respectively. The high frequency vocabulary (top 3,000 words) of the Corpus of Contemporary American English (COCA) and OHSWL had a coverage of 85.44% and 75.67% in the science corpus compared to the non-science corpus. With an approximately 10% difference in coverage, the OHSWL proves to be a significant source of vocabulary for an Ontario science learner. While coverage of the first and second 1,000 words of the COCA were greater in the science corpus compared to the OHSWL, coverage of the third 1,000 words was only marginally greater. Therefore, past the top 3,000 words of the COCA, the greatest value for someone learning the Ontario science curriculum is achieved by knowing the OHSWL. This corpus-based study has the potential of helping students in Ontario, regardless of whether they speak English as their first language or not.