Can you construct a linguistic tree by connecting (chunking) all the words and phrases (of a specific sentence) that have an entry in Wikipedia? And which semantic layer can you add by, furthermore, adding the categories as deployed by the Wikipedia Community for each phrase?
Have a look at the following two examples. The first layer in red signify all the phrases that have an entry in Wikipedia. The X means, that there are more than one entries in Wikipedia that start with the first chunk followed by one or more words.
Example 1: The people want the downfall of the regime. (One of the most used chants by the protesters in the Middle East and North-Africa: Ash-sha`b yurid isqat an-nizam. In Arabic: .الشعب يريد إسقاط النظام)
Example 2: The Blood of the martyrs does not go to waste. (From the Arabic chant used by the protesters in the Middle East and North Africa: dam ashuhada. ma yimsheesh haba. In Arabic: .دم الشهداء. ما يمشيش هباء). Click to see the tree bigger.