To test how well Google did, we looked up a number of words and constructions from web pages that focused on dialects of Portuguese (example), where a particular word or construction was supposedly more common in a given country or region. The fact that the following words and phrases do appear much more frequently in that country suggests that Google's categorization is quite good.
Lexical
While the contrasts above focused on Brazil and Portugal, the
corpus can of course be used to compare Angola and Mozambique to
the other dialects. For example, the following are words that
are more common in Angola: Syntactic and morphological
Of course the corpus can be used to look at syntactic and morphological
differences between dialects as well. The following are just a few examples of
differences between Brazil and Portugal:
|