Scottish Corpus of Texts and Speech

From Wikipedia, the free encyclopedia

SCOTS logo

Contents

The Scottish Corpus of Texts & Speech (SCOTS) is an ongoing project to build a corpus of modern-day (post-1940) written and spoken texts in Scottish English and varieties of Scots. SCOTS has been available online since November 2004, and can be freely searched and browsed. By the end of the project, in mid-2007, SCOTS aims to increase the size of the text collection to 4 million words.

The project is a venture by the Department of English Language and STELLA project at the University of Glasgow. SCOTS is grant-funded by the Arts and Humanities Research Council.

SCOTS contains texts in Scottish English and varieties of broad Scots, including Doric, Lallans, urban varieties such as Glaswegian and Insular Scots. SCOTS contains a geographical spread of texts as well as a demographic spread. Each text is accompanied by extensive metadata, including such information as author’s decade of birth, gender, occupation, birthplace and place of residence, and details about the text such as publication information, audience, date and genre.

SCOTS is a multimedia corpus, containing written texts and spoken texts, available as orthographic transcriptions, accompanied by source audio or video files. SCOTS includes a large number of genres and text types, including prose fiction, poetry, business and personal correspondence, religious texts, parliamentary and administrative documents, emails, conversations and interviews.

SCOTS can be investigated in various ways, depending on the user’s interest. The corpus can be browsed, for example by the author’s name or date of the text, and all texts can be downloaded in plain text format.

Transcriptions are synchronised with audio / video files, which are streamed and may also be downloaded.

An Advanced Search facility allows the user to build up more complex queries, choosing from all the fields available in the metadata. Geographical results are plotted on an interactive map, so regional variation may be investigated.

Advanced Search results can also be viewed as a KWIC concordance, which can be reordered to highlight collocational patterns.

Advanced Search
Included Web Search Engines


Safe Search

close

Top Matching Results

Occasionally Search.com will highlight specialized results that are based on the context of your query. Examples of specialized results include specific links to news, images, or video.

Top Matching Results may highlight information from other Search.com pages, content from the CNET Network of sites, or third party content. The listings are based purely on relevance. Search.com does not receive payment for listings in this section but our partners that provide this data may get paid for listing these products.

Sponsored Links

This section contains paid listings which have been purchased by companies that want to have their sites appear for specific search terms and related content. These listings are administered, sorted and maintained by a third party and are not endorsed by Search.com.

Search Results

Search.com sends your search query to several search engines at one time and integrates the results into one list which has been sorted by relevance using Search.com's proprietary algorithm. You can customize the list of search engines included in your metasearch from the preferences.

The search engines that are used in your metasearch may allow companies to pay to have their Web sites included within the results. To view the Paid Inclusion policy for a specific search engine, please visit their Web site. Search.com does not accept payment or share revenue with any search engine partner for listings in this section.