Difference between revisions of "Category:Linguistics"

From Christoph's Personal Wiki
Jump to: navigation, search
(External links)
(Lists)
 
(39 intermediate revisions by the same user not shown)
Line 8: Line 8:
 
* Spanish (college level)
 
* Spanish (college level)
  
== Web phrases ==
+
==George Orwell's writing advice==
;hacks : ''ad hoc'' implementations
+
''Note: From George Orwell's "[http://www.orwell.ru/library/essays/politics/english/e_polit Politics and the English Language]" (see also: [[wikipedia:Politics and the English Language]] for background information).''
;Web 3.0 : semantic web
+
;screen scraping : a technique in which a computer program extracts text data from the display output of another program (see: "web scraping")
+
  
== External links ==
+
A scrupulous writer, in every sentence that he writes, will ask himself at least four questions, thus:
* [http://www.askoxford.com/?view=uk AskOxford] — a free online dictionary resource from OUP
+
# What am I trying to say?
* [http://www.ldc.upenn.edu/ Linguistic Data Consortium]
+
# What words will express it?
* [http://www.link.cs.cmu.edu/link/ The Link Grammar Parser] — a syntactic parser of English, based on link grammar, an original theory of English syntax.
+
# What image or idiom will make it clearer?
* [http://www.ahds.ac.uk/creating/guides/linguistic-corpora/index.htm Developing Linguistic Corpora: a Guide to Good Practice]
+
# Is this image fresh enough to have an effect?
* [http://corpora.informatik.uni-leipzig.de/ Wortschatz] — Search in 17 Corpus-Based Monolingual Dictionaries (by the Universität Leipzig)
+
* [http://home.unilang.org/wiki3/index.php/Main_Page UniLang Wiki] — a database of language- and linguistic-related information
+
* [http://www.webcorp.org.uk/ WebCorp] — The Web as a Corpus
+
=== Wikipedia articles on Linguistics ===
+
* [http://en.wikipedia.org/wiki/Linguistics Wikipedia article on '''Linguistics''']
+
* [http://en.wikipedia.org/wiki/Oxford_spelling Wikipedia article on '''Oxford spelling''']
+
* [http://en.wikipedia.org/wiki/N-gram N-gram]
+
* [http://en.wikipedia.org/wiki/List_of_Latin_abbreviations List of Latin abbreviations]
+
* [http://en.wikipedia.org/wiki/Latin_declension Latin declension]
+
* [http://en.wiktionary.org/wiki/Category:English_words_spelled_with_diacritics_or_ligatures Category:English words spelled with diacritics or ligatures]
+
  
=== UTF-8 ===
+
And he will probably ask himself two more:
* [http://www.unicode.org/charts/ The Unicode Character Code Charts By Script]
+
# Could I put it more shortly?
* [http://www.unics.uni-hannover.de/nhtcapri/multilingual1.html Unicode (UTF-8) test]
+
# Have I said anything that is avoidably ugly?
* [http://www.w3.org/2001/06/utf-8-test/UTF-8-demo.html UTF-8 encoded sample plain-text file] — original by Markus Kuhn, adapted for HTML by Martin Dürst.
+
* [http://www.slovo.info/testuni.htm test page for UNICODE UTF-8 encoding] — no longer maintained.
+
* [[wikipedia:UTF-8]]
+
  
[[Category:Science]]
+
One can often be in doubt about the effect of a word or a phrase, and one needs rules that one can rely on when instinct fails. I think the following rules will cover most cases:
 +
# Never use a metaphor, simile, or other figure of speech which you are used to seeing in print.
 +
# Never use a long word where a short one will do.
 +
# If it is possible to cut a word out, always cut it out.
 +
# Never use the passive where you can use the active.
 +
# Never use a foreign phrase, a scientific word, or a jargon word if you can think of an everyday English equivalent.
 +
# Break any of these rules sooner than say anything outright barbarous.
 +
 
 +
==Double meaning==
 +
*The bandage was '''wound''' around the '''wound'''.
 +
*The farm was used to '''produce''' '''produce'''.
 +
*The dump was so full that it had to '''refuse''' more '''refuse'''.
 +
*We must '''polish''' the '''Polish''' furniture.
 +
*He could '''lead''' if he would get the '''lead''' out.
 +
*The soldier decided to '''desert''' his '''dessert''' in the '''desert'''.
 +
*Since there is no time like the '''present''', he thought it was time to '''present''' the '''present'''.
 +
*A '''bass''' was painted on the head of the '''bass''' drum
 +
*When shot at, the '''dove''' '''dove''' into the bushes.
 +
*I did not '''object''' to the '''object'''.
 +
*The insurance was '''invalid''' for the '''invalid'''.
 +
*There was a '''row''' among the oarsmen about how to '''row'''.
 +
*They were too '''close''' to the door to '''close''' it.
 +
*The buck '''does''' funny things when the '''does''' are present.
 +
*A seamstress and a '''sewer''' fell down into a '''sewer''' line.
 +
*To help with planting, the farmer taught his '''sow''' to '''sow'''.
 +
*The '''wind''' was too strong to '''wind''' the sail.
 +
*Upon seeing the '''tear''' in the painting I shed a '''tear'''.
 +
*I had to '''subject''' the '''subject''' to a series of tests.
 +
*How can I '''intimate''' this to my most '''intimate''' friend?
 +
 
 +
==Useful language-learning sentences==
 +
see: [http://www.fourhourworkweek.com/blog/2007/11/07/how-to-learn-but-not-master-any-language-in-1-hour-plus-a-favor/ How to Learn (But Not Master) Any Language in 1 Hour]
 +
*The apple is red.
 +
*It is John's apple.
 +
*I give John the apple.
 +
*We give him the apple.
 +
*He gives it to John.
 +
*She gives it to him.
 +
*I must give it to him.
 +
*I want to give it to her.
 +
 
 +
==External links==
 +
*[[wikipedia:Fowler's Modern English Usage|''Fowler's Modern English Usage'']]
 +
*[[wikipedia:The King's English|''The King's English'']] — by H.W. Fowler[http://www.bartleby.com/116/]
 +
*[[wikipedia:The Elements of Style|''The Elements of Style'']] (aka "Strunk & White")
 +
*[[wikipedia:Plain Words|''Plain Words'']] — by Sir Ernest Gowers
 +
*[[wikipedia:Hart's Rules|''Hart's Rules for Compositors and Readers at the University Press, Oxford'']]
 +
*[http://www.oup.com/elt/catalogue/teachersites/oald7/oxford_3000/oxford_3000_list?cc=global The Oxford 3000 wordlist]
 +
*[http://dict.leo.org/?lang=en&lp=ende LEO]
 +
*[http://www.swif.it/foldop/ Free On-Line Dictionary of Philosophy] (aka "FOLDOP")
 +
*[http://foldoc.org/ Free On-Line Dictionary of Computing] (aka "FOLDOC")
 +
*[http://www.etymonline.com/ Online Etymology Dictionary]
 +
*[http://www.wordspy.com/index.asp Word Spy]
 +
*[http://www.wordorigins.org/ Wordorigins.org]
 +
*[http://www.westegg.com/etymology/ Etymologically Speaking]
 +
*[http://home.earthlink.net/~misaak/taxonomy/taxEtym.html Curiosities of Biological Nomenclature: Etymologies]
 +
*[http://www.askoxford.com/?view=uk AskOxford] — a free online dictionary resource from OUP
 +
*[http://www.ldc.upenn.edu/ Linguistic Data Consortium]
 +
*[http://www.link.cs.cmu.edu/link/ The Link Grammar Parser] — a syntactic parser of English, based on link grammar, an original theory of English syntax.
 +
*[http://www.thelatinlibrary.com/ The Latin Library]
 +
*[http://www.ahds.ac.uk/creating/guides/linguistic-corpora/index.htm Developing Linguistic Corpora: a Guide to Good Practice]
 +
*[http://corpora.informatik.uni-leipzig.de/ Wortschatz] — Search in 17 Corpus-Based Monolingual Dictionaries (by the Universität Leipzig)
 +
*[http://home.unilang.org/wiki3/index.php/Main_Page UniLang Wiki] — a database of language- and linguistic-related information
 +
*[http://www.webcorp.org.uk/ WebCorp] — The Web as a Corpus
 +
*[ftp://ftp.cpsc.ucalgary.ca/pub/projects/text.compression.corpus/ Calgary Corpus]
 +
*[http://www.ibiblio.org/pub/Linux/utils/text/ concordancer + utils] — by Ralph Meyer of Princeton
 +
*[http://www.thinkvitamin.com/features/biz/how-to-name-your-company How To Name Your Company] — (strong consonants: B, C, D, G, K, P, Q, T)
 +
*[[wikipedia:Game_of_Circ|Game of Circ]]
 +
*[http://www.clres.com/dict.html SIGLEX Resource Links] — by Special Interest Group on the Lexicon of the Association for Computational Linguistics
 +
*[http://www.lifehack.org/articles/communication/how-to-punctuate-a-sentence.html How to punctuate a sentence]
 +
*[http://www.coli.uni-saarland.de/projects/sfb378/negra-corpus/ NEGRA corpus] — a syntactically annotated corpus of German newspaper texts
 +
*[http://www.lexique.org/listes/liste_mots.txt liste_mots.txt] — from lexique.org (~130k words)
 +
*[http://deafandblind.com/word_frequency.htm Word / letter frequencies]
 +
*[http://www.mindpixel.com/ Mindpixel]
 +
*[http://www.kent.k12.wa.us/ksd/MA/resources/greek_and_latin_roots/transition.html Greek and Latin roots]
 +
*[http://www.dumblittleman.com/2007/07/50-tools-that-can-improve-your-writing.html 50 Tools that can Improve your Writing Skills]
 +
*[http://www.fallacyfiles.org/taxonomy.html The Taxonomy of Logical Fallacies]
 +
*[http://www.geocities.com/SoHo/Square/3472/balzacmining.html Balzac Text Data Mining]
 +
===Tools/Resources===
 +
*[http://www.juiciobrennan.com/hyphenator/ Lyric Hyphenator]
 +
*[http://www.wordscount.info/hw/syllable.jsp Syllable Counter]
 +
*[http://www.wordscount.info/hw/smog.jsp SMOG Calculator]
 +
===Software===
 +
*[http://linguistlist.org/sp/Software.html Linguist List]
 +
 
 +
===Wikipedia articles on Linguistics===
 +
*[[wikipedia:Linguistics|Linguistics]]
 +
*[[wikipedia:Category:English_grammar|Category:English_grammar]]
 +
*[[wikipedia:Oxford_spelling|Oxford spelling]]
 +
*[[wikipedia:N-gram|N-gram]]
 +
*[[wikipedia:Latin_declension|Latin declension]]
 +
*[[wikipedia:Concordancer|Concordancer]]
 +
*[[wikipedia:KWIC|KWIC]]
 +
*[[wikipedia:AntConc|AntConc]] — a freeware concordance program for Linux developed by Laurence Anthony.
 +
*[[wikipedia:Sapir–Whorf hypothesis|Sapir–Whorf hypothesis]]
 +
*[[wikipedia:Toponymy|Toponymy]]
 +
*[[wikipedia:Serial comma|Serial comma]]
 +
*[[wikipedia:Split infinitive|Split infinitive]]
 +
*[[wikipedia:German spelling reform of 1996|German spelling reform of 1996]]
 +
*[[wikipedia:Zipf's law|Zipf's law]]
 +
*[[wikipedia:Index of coincidence|Index of coincidence]]
 +
*[[wikipedia:Frequency analysis (cryptanalysis)|Frequency analysis (cryptanalysis)]]
 +
*[[wikipedia:Dependency grammar|Dependency grammar]]
 +
*[[wikipedia:Bigram|Bigram]]
 +
*[[wikipedia:Incremental encoding|Incremental encoding]]
 +
*[[wikipedia:Syllable coda|Syllable coda]]
 +
*[[wikipedia:Phonotactics|Phonotactics]]
 +
====Lists====
 +
*[[wikipedia:Category:English_words_spelled_with_diacritics_or_ligatures|Category:English words spelled with diacritics or ligatures]]
 +
*[[wikipedia:Old English language (list of prepositions)|Old English language (list of prepositions)]]
 +
*[[wikipedia:List of English prepositions|List of English prepositions]]
 +
*[[wikipedia:List of English irregular verbs|List of English irregular verbs]]
 +
*[[wikipedia:List of frequently misused English words|List of frequently misused English words]]
 +
*[[wikipedia:List of French phrases|List of French phrases]]
 +
*[[wikipedia:List of French phrases used by English speakers|List of French phrases used by English speakers]]
 +
*[[wikipedia:https://en.wikipedia.org/wiki/Glossary_of_French_expressions_in_English|Glossary of French expressions in English]]
 +
*[[wikiquote:French proverbs|French proverbs]]
 +
*[[wikipedia:List of German expressions in English|List of German expressions in English]]
 +
*[[wikipedia:List of German words and phrases|List of German words and phrases]]
 +
*[[wikipedia:List of Greek phrases|List of Greek phrases]]
 +
*[[wikipedia:Category:Latin_phrases|Category:Latin_phrases]]
 +
*[[wikipedia:List_of_Latin_abbreviations|List of Latin abbreviations]]
 +
*[[wikipedia:List of Latin phrases|List of Latin phrases]]
 +
*[[wikipedia:List of Latin words with English derivatives|List of Latin words with English derivatives]]
 +
*[http://www.orbilat.com/Languages/French/Vocabulary/French-International.html Internationally used French phrases]
 +
 
 +
===UTF-8===
 +
*[http://www.unicode.org/charts/ The Unicode Character Code Charts By Script]
 +
*[http://www.unics.uni-hannover.de/nhtcapri/multilingual1.html Unicode (UTF-8) test]
 +
*[http://www.w3.org/2001/06/utf-8-test/UTF-8-demo.html UTF-8 encoded sample plain-text file] — original by Markus Kuhn, adapted for HTML by Martin Dürst.
 +
*[http://www.slovo.info/testuni.htm test page for UNICODE UTF-8 encoding] — no longer maintained.
 +
*[[wikipedia:UTF-8]]
 +
 
 +
===Algorithms===
 +
*[http://svn.pugscode.org/pugs/t/examples/99problems/ 99 Problems]
 +
**'''P97 (**) Sudoku''': [http://www.norvig.com/sudoku.html Solving Every Sudoku Puzzle] — by Peter Norvig; in [[Python]]
 +
 
 +
[[Category:Hobbies]]

Latest revision as of 21:54, 17 November 2019

Linguistics is the scientific study of human language, and someone who engages in this study is called a linguist or linguistician.

I am very interested in this field, but more of as a hobby than as a career. In any case, I will document what I learn about this field in this category.

I also speak a few languages and am learning more. Below is a list of the languages I speak either fluently or with a working knowledge (and continually improving):

  • English (mother-tongue)
  • German (college level)
  • Spanish (college level)

George Orwell's writing advice

Note: From George Orwell's "Politics and the English Language" (see also: wikipedia:Politics and the English Language for background information).

A scrupulous writer, in every sentence that he writes, will ask himself at least four questions, thus:

  1. What am I trying to say?
  2. What words will express it?
  3. What image or idiom will make it clearer?
  4. Is this image fresh enough to have an effect?

And he will probably ask himself two more:

  1. Could I put it more shortly?
  2. Have I said anything that is avoidably ugly?

One can often be in doubt about the effect of a word or a phrase, and one needs rules that one can rely on when instinct fails. I think the following rules will cover most cases:

  1. Never use a metaphor, simile, or other figure of speech which you are used to seeing in print.
  2. Never use a long word where a short one will do.
  3. If it is possible to cut a word out, always cut it out.
  4. Never use the passive where you can use the active.
  5. Never use a foreign phrase, a scientific word, or a jargon word if you can think of an everyday English equivalent.
  6. Break any of these rules sooner than say anything outright barbarous.

Double meaning

  • The bandage was wound around the wound.
  • The farm was used to produce produce.
  • The dump was so full that it had to refuse more refuse.
  • We must polish the Polish furniture.
  • He could lead if he would get the lead out.
  • The soldier decided to desert his dessert in the desert.
  • Since there is no time like the present, he thought it was time to present the present.
  • A bass was painted on the head of the bass drum
  • When shot at, the dove dove into the bushes.
  • I did not object to the object.
  • The insurance was invalid for the invalid.
  • There was a row among the oarsmen about how to row.
  • They were too close to the door to close it.
  • The buck does funny things when the does are present.
  • A seamstress and a sewer fell down into a sewer line.
  • To help with planting, the farmer taught his sow to sow.
  • The wind was too strong to wind the sail.
  • Upon seeing the tear in the painting I shed a tear.
  • I had to subject the subject to a series of tests.
  • How can I intimate this to my most intimate friend?

Useful language-learning sentences

see: How to Learn (But Not Master) Any Language in 1 Hour
  • The apple is red.
  • It is John's apple.
  • I give John the apple.
  • We give him the apple.
  • He gives it to John.
  • She gives it to him.
  • I must give it to him.
  • I want to give it to her.

External links

Tools/Resources

Software

Wikipedia articles on Linguistics

Lists

UTF-8

Algorithms

Subcategories

This category has only the following subcategory.