()   

 

Now available! At Corpusdoportugues.org, we're introducing a new way to interact with corpus data. Using Large Language Models (LLMs) like GPT, Gemini, and Claude, users can now have collocates, phrases, and frequency data clustered, categorized, and explained automatically. The underlying corpus data remains unchanged — but AI provides an optional layer of analysis to help users spot patterns and connections more quickly.

Just click on any of the green links on a corpus results page to use these features.   [Sample searches | Get started]   [More]

The Corpus do Português NOW corpus (News on the Web) contains about 1.1 billion words of data from web-based newspapers and magazines in four Portuguese-speaking countries from 2012-2019.

Click on any of the links in the search form on the search page for context-sensitive help, and to see the range of queries that the corpus offers. You might pay special attention to the comparisons between dates and countries and virtual corpora, which allow you to create personalized collections of texts based on (sub-)register, website, and even words in the web pages .