Hostname: page-component-8448b6f56d-cfpbc Total loading time: 0 Render date: 2024-04-17T06:44:35.945Z Has data issue: false hasContentIssue false

Finnish Syntax in Text: Methodology and Some Results of a Quantitative Study1

Published online by Cambridge University Press:  22 December 2008

Auli Hakulinen
Affiliation:
Torpankuja 5 A 3, SF–40740 Jyväskylä 74, Finland.
Fred Karlsson
Affiliation:
Department of General Linguistics, University of Helsinki, Meritullinkatu 14 A 6, SF-00170 Helsinki 17, Finland.
Get access

Abstract

The Paper contains a report on the theory and methodology used in a project on finnish syntax as investigated in (some genres of written) text. 123 texts containing 10,149 clauses (5016 graphical sentences) were analyzed according to a syntactic coding frame containing 63 variables. The variables include sentences structure, clause structure, clause type, clause function, clause length, surface word order, constituent structure, the text semantic and referential properties of nominal constituents, and some movement and deletion transformations. The theoretical considerations behind the selection of the variables are discussed in detail, as are the principles for syntactic sampling. The usefulness of the computer for certain low-level tasks is demonstrated, such as finding correlations between variables. The relevance of quantitative data to ‘qualitative’ syntactic description is stressed.

Type
Research Article
Copyright
Copyright © Cambridge University Press 1980

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Aarts, J. 1978: Syntactic coding of a computer corpus. Paper read at the 5th AILA Congress, Montreal, August 2026.Google Scholar
Allén, , Sture, 1970: Nusvensk frekvensordbok. Almquist & Wiksell, Stockholm.Google Scholar
Andersson, , Erik, (ed.), 1978: Working Papers on Computer Processing of Syntactic Data. Publications of the Research Institute of the åbo Akademi Foundation, Nr. 37. åbo.Google Scholar
Anward, , Jan, 1976: Vem har ordet och vad innebär det? In Karlsson, , Fred, (ed.) 1976: Paper from the Third Scandinavian Conference of Linguistics. Turuk, pp. 1926.Google Scholar
Bergenholtz, H. and Schaeder, B. (eds.) 1978: Text-Corpora: Materialen für eine empirische Sprach-und Literaturwissenschaft. Kronberg, Taunus.Google Scholar
Cedergren, , Henrietta, J. and Sankoff, , David, 1974: Variable rules: performance as a statistical reflection of competence. Language 50 2, pp. 333355.Google Scholar
Einarsson, , 01 1978: Talad och skrivan svenska. Studentlitteratur, Lund.Google Scholar
Ellegard, , Alvar, 1978: The syntactic structure of Englsih texts. A computer-based study of four kinds of text in the Brown University Corpus. Acta Universitatis Gothoburgensis. Gothenburg Studies in English 43.Google Scholar
Hakulinen, , Auli, and Karlsson, , Fred, 1979: Nykysuomen lauseoppia. Suomalaisen Kirjallisuuden Seura, Helsinki.Google Scholar
Halliday, M. A. K. and Hasan, , Ruqaiya, 1976: Cohension in English. Longman, London.Google Scholar
Hansson, , åke, 1973: Eterspråkets grammatik 1. Textkarakteriserande aspekter. Lundastudier i nordisk språkvetenskap 6, pp. 443.Google Scholar
Hultman, , Tor, G. and Westman, , Margareta, 1977: Gymnasistsvenska. LiberLäromedel, Lund.Google Scholar
Ikola, , Osmo, 1975: Mechanical Treatment of Syntactic Material in Finnish Dialects. In Hallap, , Valmen, (ed.) 1975: Congressus Tertius Internationalis Fenno Ugristarum, Pars I, Acta Linguistica. Valgus, Tallinn, pp. 217220.Google Scholar
Ikola, , Osmo, and Karjalainen, , Yrjö, 1977: Syntactic Archives and Finnish Dialects for Computer Work. In Zampolli, and Calzolari, (eds.) 1977, pp. 405416.Google Scholar
Jörgensen, , Nils, 1970: On makrosyntagmer i informell och formell stil. Studentlitteratur, Lund.Google Scholar
Jörgensen, , Nils, . 1976. Meningsbyggnaden i talad svenska. Studentlitteratur, Lund.Google Scholar
Kiefer, , Ferenc, 1977: Review of Janos S. Petöfi and Hannes Rieser (eds.), Studies in Text Grammer. Journal of Pragmatics 1, pp. 177192.Google Scholar
Kohonsen, , Viljo, 1976: A Note on Factors Affecting the Position of Accusative Objects and Complements in Aelfric's Catholic Homilies I. In Enkvist, , Nils, Erik and Kohonen, , Viljo, (eds.) 1976: Reports on Text Linguistics: Approaches to Word Order. Publications of the Research Institute of the åbo Akademi Foundation, Nr. 8. åbo, pp. 175–76.Google Scholar
Kohonen, , Viljo, 1978: On the Development of English Word Order in Religious Prosearound 1000 and 1200 A.D. A Quantitative Study of Word Order in Context. Publications of the Research Institute of the åbo Akademi Foundation, Nr. 38. åbo.Google Scholar
Kohonen, , Viljo, and Salmela, , Jussi, 1978: Aineiston valinnan ja atutomaattisen tietojenkäsittelyn ongelmia kielitieteellisessä tutkimuksessa. In Andersson, , Erik, (ed.) 1978, pp. 144.Google Scholar
Kučera, , Henry, and Francis, , Nelson, W. 1967: Computational Analysis of Present Day Standard English. Brown University Press, Providence, Rhode Island.Google Scholar
Labov, , William, 1972: Sociolinguistic Patterns. Basil Blackwell, Oxford.Google Scholar
Loman, , Bengt, and Jörgensen, , Nils, 1971: Manual för analys och beskrivning av makrosyntagmer. Studentlitteratur, Lund.Google Scholar
Platzack, , Christer, 1974: Språket och läsbarheten. Gleerup, Lund.Google Scholar
Quirk, , Randolph, , Greenbaum, , Sidney, , Leech, , Geoffrey, and Svartvik, , 01 1972: A Grammer of Contemporary English. Longman, London.Google Scholar
Quirk, R. and Svartvik, , 01 1978: A Corpus of Modern English. In Bergenholtz, and Schaeder, (eds.) 1978.Google Scholar
Salmela, , Jussi, and Kohonen, , Viljo, 1977: CHITAB– a “Poor Man's” Shortcut to Computer Processing of Linguustic Data. In Gellerstam, , Martin, (ed.) 1977: Nordiska datalingvistikdagar 1977. Rapporter från Språkdata, Göteborgs universitet 1/1977, pp. 8286.Google Scholar
Saukkonen, , Pauli, 1976: Kielen mikrosysteemit. Oulun yliopiston suomen ja saamen kielen laitokson tutkimusraportteja 1. Oulu.Google Scholar
Saukkonen, , Pauli, 1977: Nykysuomen saneiston yleisyystilatoa saneenloppuisessa aakkojärjestyksessä.Oulun yliopiston suomen ja saamen kielen laitoksen tutkimustaportteja 9. Oulu.Google Scholar
Strand, , Hans, 1977: Tidningsspråk. Meddeland från Institution för nordiska språk vid Stockholms universitet 1, pp. 4051.Google Scholar
Svartvik, , 01 1980: Computer-aided grammatical tagging of spoken English. Survey of Spoken English, University of Lund.CrossRefGoogle Scholar
Teleman, , Ulf 1974: Manual för grammatisk beskrivning av talad och skriven svenska. Studentlitteratur, Lund.Google Scholar
Westman, , Margareta, 1974: Bruksprosa. En funktionell stilanalys med kvantitativ metod. LiberLäromedel, Lund.Google Scholar
Virkkunen, , Hilkka, 1977: Essiivi, translatiivi, abessiivi, komitatiivi ja instruktiivi 1960 luvvn suomen yleiskielessä. Oulun yliopiston suomen ja saamen kielen laitoksen tutkimusraportteja 11. Oulu.Google Scholar
Zampolli, A. and Calzolari, N. (eds.) 1977: Computational and Mathematical Linguistics. Proceedings of the International Conference on Computational Linguistica, Pisa 27.8−1.9. 1973. Firenze.Google Scholar