Take the DBLP database of papers in Computer Science, grab all the titles, do a frequency count for the words, keep only nouns. You get the list of the 20 most used nouns in CS paper titles:
- systems
- system
- data
- analysis
- networks
- model
- design
- algorithm
- approach
- information
- time
- software
- distributed
- learning
- network
- performance
- parallel
- web
- control
- algorithms
(Note that I did not conflate singular with plural usages.) Remembering Shannon's theory, next time when you see a title made out of almost only these words you should realize that it is a CS paper, but not much more is revealed by the title.
No comments:
Post a Comment
Note: (1) You need to have third-party cookies enabled in order to comment on Blogger. (2) Better to copy your comment before hitting publish/preview. Blogger sometimes eats comments on the first try, but the second works. Crazy Blogger.