wrote a small python program, which investigates Zipf law. from this, we realize that for most of the words of a text, we don't have many examples and our data about words, based on their frequencies, are sparse.
Zipf law diagram for Quran
zipf law diagram for Bijankhan corpus