Cheap online TEFL courses | best cheapest online TEFL certificate

Assessment & Feedback: Lesson 19.2

Progress Tests Achievement Tests Proficiency Tests Standardized Testing IELTS / TOEFL / TOEIC

Progress tests serve multiple purposes beyond just assigning grades. They help students identify areas where they need more practice. They give teachers feedback about which topics might need re-teaching or additional reinforcement. They also provide motivation, as students can see concrete evidence of their improvement over time.

Achievement tests are broader than progress tests. While progress tests focus on recent material, achievement tests evaluate cumulative learning over longer periods, such as a semester or entire course. These are typically the big exams at the end of a course that determine whether students have achieved the overall learning outcomes. Achievement tests often combine content from multiple units and require students to integrate various skills and knowledge they've developed.

For instance, an end-of-semester achievement test might include reading passages that require application of various grammar structures learned throughout the term, a writing task that demonstrates organizational skills and range of vocabulary developed over months, and listening exercises that test comprehension strategies practiced repeatedly. Unlike progress tests that focus narrowly on recent content, achievement tests paint a broader picture of overall development.

Check Your Understanding

According to the text, what is the key distinction between progress tests and achievement tests in language teaching?

Progress tests are always written by external examining bodies and standardized across all teaching institutions, while achievement tests are created by individual classroom teachers to reflect the specific materials and methods used in their particular courses
While progress tests focus on recent material, achievement tests evaluate cumulative learning over longer periods such as a semester or course, combining content from multiple units and requiring integration of various skills and knowledge
Progress tests assess spoken language production only and are therefore more practical than achievement tests, which typically restrict themselves to written skills such as grammar, vocabulary, reading, and structured writing tasks
Achievement tests and progress tests share identical formats and assessment criteria but differ only in length, with achievement tests being significantly longer because they cover a greater volume of material than the shorter monthly progress checks

Likewise, proficiency tests measure general language ability against an external standard, regardless of what course or program the student has taken. These tests don't assess specific courses or teaching programs. Instead, they evaluate whether someone can use English for specific real-world purposes. Common proficiency tests include Cambridge examinations such as FCE and CAE, IELTS for immigration and university admissions, TOEFL for academic contexts, and TOEIC for business English.

Proficiency tests are standardized, meaning they're administered the same way everywhere, scored using consistent criteria, and produce results that can be compared across different test-takers in different locations. A score of 6.5 on IELTS means the same thing whether the test was taken in Tokyo, São Paulo, or Cairo. This standardization makes proficiency tests valuable for gatekeeping purposes: universities can set minimum IELTS scores for admission, immigration departments can require specific scores, and employers can use TOEIC results to evaluate job applicants.

The important distinction is that proficiency tests aren't tied to any particular curriculum. Two students might achieve the same IELTS score despite having studied completely different courses using different methods and materials. This makes proficiency tests quite different from achievement tests, which directly measure whether students learned what a specific course aimed to teach.

Check Your Understanding

According to the text, what specific feature of proficiency tests makes them valuable for "gatekeeping purposes"?

Proficiency tests are standardized - administered the same way everywhere, scored using consistent criteria, and producing results that can be compared across different test-takers in different locations, so that a score of 6.5 on IELTS means the same thing whether the test was taken in Tokyo, São Paulo, or Cairo
Proficiency tests are designed and updated regularly by teams of applied linguists who ensure the content reflects current research into communicative competence, giving employers and universities confidence that the tests accurately predict real-world language performance
Proficiency tests assess all four language skills simultaneously within a single examination, providing a fully integrated picture of communicative competence that achievement and progress tests are unable to capture through their separate, component-based assessment models
Proficiency tests are freely available and universally recognised across all sectors of education and employment, removing financial and administrative barriers that might otherwise prevent institutions from using language assessment to make fair admissions and hiring decisions

Page 1 of 6

Principles of Good Test Design

Validity Reliability Content Validity Construct Validity Face Validity

Of course, simply giving tests isn't enough. Tests must be well-designed to provide accurate, useful information. Three key principles guide good test design: validity, reliability, and washback.

Validity refers to whether a test actually measures what it claims to measure. A valid reading test actually assesses reading comprehension, not just vocabulary knowledge or background knowledge about the topic. A valid speaking test evaluates spoken language ability rather than just memorization of prepared speeches. Validity is perhaps the most important quality of any assessment because invalid tests mislead everyone: they give students inaccurate feedback about their abilities and teachers false information about learning.

There are several types of validity to consider. Content validity means the test covers a representative sample of the material it's supposed to assess. If your course covered ten grammar structures but your test only checks three of them, content validity is questionable. Construct validity means the test measures the underlying ability or skill you're interested in. If you want to test listening comprehension but your questions require complex inferences and background knowledge, you might actually be testing reasoning ability or prior knowledge rather than just listening.

Face validity is the extent to which a test appears appropriate and relevant to test-takers. While this might seem superficial, it matters for motivation and acceptance. If students take a speaking test in which they never actually speak, only answer written questions about speaking, the test lacks face validity, even if it somehow correlates with speaking ability. Students will rightfully question whether this is a fair assessment.

To ensure validity, test designers must be clear about what they're trying to measure and then create tasks that directly assess those specific skills or knowledge. This requires thoughtful consideration of task types, difficulty levels, and scoring criteria.

Check Your Understanding

The text describes a situation in which a listening test requires complex inferences and significant background knowledge. What type of validity problem does the text identify in this scenario?

This situation damages content validity because the test is not covering a representative sample of the listening material that was taught during the course, meaning some students will be assessed on topics that were never adequately introduced in class
This situation undermines face validity because the test appears to be assessing listening ability on its surface but, when test-takers examine the questions closely, it becomes apparent that background knowledge is more important than listening comprehension in determining the final score
This situation reduces reliability because students with different background knowledge will produce highly variable results even under identical test conditions, making it impossible to draw consistent conclusions about any individual learner's listening comprehension ability
This is a construct validity problem - the test may actually be measuring reasoning ability or prior knowledge rather than the underlying listening comprehension ability it is intended to assess

Reliability refers to the consistency of measurement. A reliable test produces similar results when taken multiple times by the same person under similar conditions, or when scored by different raters. If a student takes the same test twice with only a short time between attempts and gets wildly different scores despite no real change in ability, the test isn't reliable. Similarly, if two trained teachers score the same essay and assign completely different grades, reliability is compromised.

Several factors affect reliability. Test length is important - longer tests with more items tend to be more reliable because individual questions have less impact on the total score. Clear instructions and well-written questions improve reliability by reducing confusion. For productive skills like speaking and writing, detailed scoring rubrics help different raters evaluate consistently.

In contrast, reliability can be damaged by ambiguous questions, inadequate time limits that rush some students more than others, testing conditions that vary significantly, or subjective scoring without clear criteria. When reliability is low, test results become somewhat random, making them useless for making important decisions.

Check Your Understanding

According to the text, why do longer tests with more items tend to be more reliable than shorter ones?

Longer tests are more likely to include tasks from the full range of language skills, ensuring that strong performance in one skill cannot compensate for weakness in another and producing total scores that reflect a more balanced and complete picture of overall language competence
Longer tests reduce the cognitive pressure on students by providing more time per question, allowing them to read and consider each item more carefully than they would under the tighter time constraints of shorter test formats used at the end of individual lessons
Individual questions have less impact on the total score in longer tests, which means no single item disproportionately determines a student's result, and overall scores are therefore more stable and consistent
When tests are longer, trained raters who score open-ended tasks have more total responses to distribute their judgements across, which means any single scoring inconsistency has a proportionally smaller effect and leads to better inter-rater agreement at the end of the marking process

Page 2 of 6

Validity vs. Reliability Washback Positive Washback Negative Washback Assessment Alignment

Unfortunately, there's often tension between validity and reliability. Multiple-choice tests tend to be highly reliable because scoring is objective and consistent, but they may lack validity for testing productive skills. Open-ended speaking tasks may have high validity for assessing communication ability but lower reliability because different raters might score the same performance differently. Good test design requires balancing these concerns.

Washback refers to the effect that tests have on teaching and learning. When a test is known to be coming, both teachers and students naturally adjust their behavior based on what the test will assess and how it will be scored. This influence can be positive or negative, and thoughtful test design should aim to create positive washback.

Positive washback occurs when test preparation activities are themselves valuable for language learning. If students know they'll need to participate in an extended spoken conversation on the final exam, they'll practice speaking regularly, which is exactly what we want. If the test requires writing a well-organized essay with a clear introduction, body paragraphs, and conclusion, students will practice this important academic skill.

On the other hand, negative washback happens when test preparation undermines good teaching practices. If a test focuses exclusively on multiple-choice grammar questions, teachers might abandon communicative activities and speaking practice in favor of drilling grammar rules and test-taking strategies. Students might memorize decontextualized grammar patterns rather than developing real communication skills. This is the danger of misaligned testing - when tests don't reflect course objectives or real language use, preparation for those tests wastes time and narrows the curriculum.

Check Your Understanding

According to the text, what is the fundamental mechanism through which negative washback occurs in a language course?

Negative washback is primarily caused by teachers who allow test preparation to replace good teaching rather than forming part of it, reflecting an excessive focus on institutional accountability and insufficient commitment to communicative learning goals
When a test does not reflect course objectives or real language use, preparing for it causes teachers to abandon communicative activities in favour of drilling and causes students to memorize patterns rather than developing real communication skills, thereby wasting time and narrowing the curriculum
Negative washback occurs most severely in exam preparation contexts where students are extrinsically motivated by the need to obtain a qualification rather than by interest in learning English, causing both teachers and students to prioritise score-raising strategies over communicative development
Negative washback results primarily from over-reliance on standardized proficiency tests such as IELTS and TOEFL, which are not designed to align with any particular course curriculum and therefore inevitably distort the teaching that precedes them

Teachers should be aware of washback effects and design assessments that encourage truly beneficial learning activities. If we want students to read extensively, tests should include authentic reading passages. If we want students to write coherently, tests should require extended writing, not just gap-fill exercises. Aligning assessment with learning goals creates positive washback that supports rather than distorts the learning process.

Page 3 of 6

Assessing Receptive Skills: Reading and Listening

Reading Assessment Listening Assessment Sub-skills Authentic Texts Note-taking Phonological Awareness

Now let's explore how to assess different language skills effectively, starting with the receptive skills of reading and listening. These skills share common assessment approaches because both involve understanding the input, though each also has unique considerations.

Reading assessment must evaluate whether students can understand written texts at appropriate levels. Good reading tests present authentic or semi-authentic texts similar to what students would encounter in real life, followed by questions or tasks that check comprehension. The texts should be level-appropriate - not so easy that students can answer questions without really reading carefully, but not so difficult that vocabulary alone prevents comprehension regardless of reading skill.

Various question types can assess different reading sub-skills. Multiple-choice questions can test both general understanding and specific details, though they need to be carefully written to avoid confusing or ambiguous options. True/false statements check whether students can accurately extract information from the text. Matching exercises might ask students to match headings to paragraphs, testing their ability to identify main ideas. Short-answer questions require students to find specific information and express it in their own words.

More advanced reading assessments might include inference questions that test whether students can read between the lines and understand implied meanings. Questions about the author's purpose or tone evaluate deeper comprehension beyond literal understanding. Vocabulary-in-context questions assess whether students can determine word meanings from surrounding text rather than just testing isolated vocabulary knowledge.

When designing reading assessments, we need to consider what sub-skills we're targeting. Are we testing skimming for general meaning? Scanning for specific information? Detailed reading for complete understanding? Inferencing? Understanding text organization? Different question types and instructions guide students to use different reading strategies, so alignment between the skill being tested and the task type is essential.

It's important to avoid culturally-specific content that might give certain students an unfair advantage. If a reading passage assumes knowledge of American football and uses this as the basis for comprehension questions, students familiar with the sport have an advantage unrelated to reading ability. Similarly, avoid passages where questions can be answered through general knowledge without actually reading the text, as this doesn't test reading at all.

Listening assessment follows similar principles but with some unique challenges. Like reading tests, listening assessments should use realistic audio samples appropriate for students' levels. These might include conversations, announcements, short talks, interviews, or academic lectures, depending on students' needs and proficiency levels.

The biggest challenge in listening assessment is that students can't control the pace of input the way they can with reading. Once audio plays, it moves at a fixed speed, and students must process information in real time. This makes listening inherently more difficult for many students. Most listening tests address this by playing recordings two or three times, giving students multiple opportunities to catch information they might have missed initially.

Check Your Understanding

According to the text, how do most listening tests attempt to address the particular challenge that listening assessment presents?

Most listening tests provide students with a written transcript of the audio after the first listening, allowing them to check and refine their initial comprehension against the exact words spoken and thereby correct any decoding errors before submitting their final answers
Most listening tests simplify the audio input by using slow, carefully enunciated speech delivered by trained voice artists, ensuring that the pace and clarity of the recording removes the processing pressure that makes real-world listening more demanding than reading
Most listening tests allow students to pause and replay individual sections of the audio at their own pace, replicating the degree of learner control that reading assessment provides and thereby reducing the inherent difficulty gap between the two receptive skill modalities
Most listening tests play recordings two or three times, giving students multiple opportunities to catch information they might have missed initially

Just as with reading, different question types assess different listening sub-skills. Multiple-choice questions work well for testing main ideas and specific details. Gap-fill exercises, where students write missing words, require careful listening for exact information. True/false statements check accurate comprehension. Matching tasks might ask students to match speakers to opinions or match descriptions to pictures.

Listening assessment can also test students' ability to recognize pronunciation features. Can they distinguish between minimal pairs like "ship" and "sheep"? Can they identify whether a speaker used contractions like "don't" versus the full form "do not"? Can they recognize different question types by intonation patterns alone? These focused tasks assess phonological awareness, which supports overall listening comprehension.

Assessing note-taking skills provides valuable information about listening ability. When students listen to a short lecture, take notes, and then answer questions based on their notes, we're evaluating whether they can identify key information, recognize organizational patterns, and process extended listening input. This task type is highly authentic in academic contexts where students must listen to lectures and retain information.

When scoring listening assessments, be mindful of spelling. If a student clearly heard and understood the word but spelled it slightly wrong in their written answer, they've demonstrated listening comprehension even if their writing isn't perfect. Some listening tests specify that spelling doesn't count unless the misspelling creates a different word. This decision depends on whether you're testing listening only or combining listening and writing assessments.

Both reading and listening assessments benefit from incorporating a variety of text types and topics to maintain interest and reflect realistic language use. A reading test might include a news article, an advertisement, an email, and an excerpt from a story. A listening test might include a phone conversation, a weather forecast, part of a podcast, and someone giving directions. This variety keeps students engaged and assesses comprehension across different contexts.

Page 4 of 6

[ Section image ]

Assessing Productive Skills: Speaking and Writing

Speaking Assessment Oral Rubrics Assessment Formats Fluency Discourse Management

In contrast to receptive skills, assessing speaking and writing presents different challenges because we're evaluating students' ability to produce language rather than understand it. These assessments require clear criteria and often involve more subjective judgment. Speaking assessment is particularly challenging because speaking is ephemeral - it happens, and then it's gone unless recorded.

Additionally, speaking ability encompasses multiple components, including pronunciation, fluency, grammar, vocabulary, interactive communication, and discourse management. An effective speaking assessment must consider which of these elements to evaluate and how to score them fairly.

Common speaking assessment formats include oral interviews, presentations, role-plays, picture descriptions, and group discussions. Each format has strengths and limitations. One-on-one interviews allow for individualized questioning and can adapt to students' levels, but the teacher's presence may intimidate some students.

Presentations demonstrate extended speaking ability and organizational skills, but they might be memorized rather than spontaneous.

Role-plays create realistic communication contexts, but shy students may struggle to perform at their actual ability level.

Group discussions reveal interactive skills and turn-taking abilities, but assessment becomes complicated when multiple students speak simultaneously.

Check Your Understanding

According to the text, what specific limitation does it identify for group discussions as a speaking assessment format?

Assessment becomes complicated when multiple students speak simultaneously, making it difficult to evaluate individual performances fairly
Group discussions can only reveal whether learners manage conversational turn-taking and cannot produce reliable information about individual learners' accuracy, vocabulary range, or ability to sustain extended speaking across a range of topics
Group discussions are limited to classroom contexts and cannot be used in formal examination settings, making them unsuitable for summative assessment and therefore inappropriate for inclusion in end-of-course or end-of-semester testing programmes
The unequal distribution of speaking time in group discussions means that talkative students receive a disproportionate advantage over quieter students, and rubric scores therefore reflect personality and confidence rather than the range of speaking competences the assessment is designed to measure

Many speaking tests combine several formats to get a comprehensive picture of ability. For instance, a test might begin with simple personal questions to help students warm up, then move on to describing a picture or photo, and conclude with more abstract questions related to the picture topic. This progression from concrete to abstract tasks allows assessment across a range of difficulty levels.

Regardless of format, speaking assessment requires clear rubrics that define what's being evaluated and what performance at different levels looks like. A typical rubric might include separate criteria for fluency, pronunciation, grammar, vocabulary, and interactive communication. Each criterion would have descriptors for different score levels, like excellent, good, satisfactory, and needs improvement. For example, under fluency, an excellent rating might describe speech that flows naturally with minimal hesitation, while a need for improvement might describe speech with frequent long pauses and difficulty maintaining conversation.

Under pronunciation, excellent might indicate easily intelligible speech with appropriate word and sentence stress, while needs improvement might describe speech that's difficult to understand due to pronunciation issues. These descriptors help raters make consistent judgments.

Grading speaking tests feels subjective and messy compared to written work, where errors are visible on the page.

A simple rubric might list categories like fluency (10 points), accuracy (10 points), vocabulary (10 points), and pronunciation (10 points). The teacher listens to each student's presentation, gets a general impression, and assigns scores that feel about right.

But this approach creates problems.

When students ask, "Why did I get 7 for fluency instead of 8?" or "What do I need to do to improve my vocabulary score?" The teacher has difficulty providing clear answers. Vague responses like "speak more smoothly" or "use more varied words" frustrate both teacher and student.

Without clear criteria, the teacher second-guesses their own scoring, wondering if they're being fair and consistent. The transformation begins with a complete redesign of the rubric. Instead of just numbers, each category gets specific, observable descriptors at multiple levels. Before the next speaking test, students see the rubric and watch video examples of previous students (with permission) at each level.

The teacher pauses after each video and asks, "What did you notice about this speaker's fluency? Which level does this match?" During the actual test, scoring becomes straightforward - the teacher checks off which descriptors match each student's performance.

When students ask why they received a certain score, the teacher can point to specific descriptors: "You're at the developing level for accuracy because you made frequent basic grammar errors, like missing plural -s and incorrect verb tenses. To become proficient, you need to use basic structures accurately most of the time." Students stop arguing about grades because the criteria are transparent. Even better, they use the rubric to self-assess during practice presentations, identifying their own weak areas before the test. The detailed rubric becomes not just an assessment tool but a learning tool - students know exactly what "good" looks like.

Speaking assessment is time-consuming. Testing each student individually takes far more time than giving a written test to the whole class at once. This practical limitation means speaking assessment often happens less frequently than written assessment. Some programs conduct formal speaking tests once per term, supplemented by informal observation of speaking during regular classes. Recording students' performances allows for more careful assessment later and creates documentation for accountability or appeals.

Page 5 of 6

Writing Assessment Writing Rubrics Process Portfolios Focused Feedback Task Achievement

Writing assessment is more practical than speaking assessment in some ways: students can write simultaneously, and written work can be evaluated carefully afterward without time pressure. However, writing assessment still requires detailed rubrics and thoughtful consideration of what's being evaluated. Writing tasks should reflect the types of writing students have practiced and will need in real life. For general English students, this might include emails, informal letters, opinion essays, and descriptive paragraphs. For academic English students, tasks might include formal essays, research summaries, and academic reports. For business English students, appropriate tasks could include professional emails, proposals, and reports.

Whatever the task type, instructions must be crystal clear. Students need to know the required length, the specific purpose and audience for their writing, any organizational requirements, and how much time they have. Ambiguous prompts lead to confusion and invalid assessment because students might misunderstand what's expected.

Writing rubrics typically evaluate multiple dimensions of writing quality. Most include criteria for content and ideas, organization and structure, grammar and vocabulary, and mechanics like spelling and punctuation. Some rubrics also assess task achievement - did the student actually address the prompt fully? Under content and ideas, we're evaluating whether the writer develops the topic with sufficient detail and relevant examples. Does the essay actually answer the question asked? Are ideas explained clearly? Is there enough supporting evidence and elaboration?

Organization and structure examine whether the writing has a clear introduction, body, and conclusion, whether paragraphs are well-structured with topic sentences and supporting details, and whether transitions connect ideas smoothly. Grammar and vocabulary assessment considers range and accuracy. Does the writer use a variety of sentence structures and vocabulary, or is everything simple and repetitive? Are grammatical structures used accurately, or are there frequent errors that impede understanding?

Mechanics covers surface-level correctness, such as capitalization, punctuation, and spelling. While these matter, they're typically weighted less heavily than content and organization unless errors seriously interfere with meaning. It's important to address the perennial question of how much to weigh accuracy versus communication. A piece of writing might have numerous small grammatical errors but still communicate ideas effectively, while another piece might be grammatically perfect but boring and empty of real content. Most current approaches emphasize meaningful communication over perfect grammar, reflecting real-world writing where getting your message across matters most.

Another consideration in writing assessment is the authenticity of tasks and allowable resources. In real life, writers have time to plan, draft, revise, and edit. They can use dictionaries, grammar references, and spell-checkers. Timed, closed-book writing tests don't reflect this reality. On the other hand, take-home writing assignments might receive help from others that doesn't reflect the student's actual independent ability.

Process-writing portfolios offer one solution: students submit drafts that show their work over time, allowing assessment of both process and product while maintaining authenticity. Both speaking and writing assessments should provide students with clear rubrics beforehand so they know how they'll be evaluated. This transparency is fair and helps prepare effectively. When students understand that organization and idea development matter more than perfect grammar, they can focus their efforts appropriately.

Check Your Understanding

According to the text, what is the specific problem identified with take-home writing assignments as an assessment approach?

Take-home writing assignments are less practical than timed in-class tests because they require more complex prompts, place greater administrative demands on institutions in terms of submission tracking, and take significantly longer to mark due to the greater average length of student responses
Students completing take-home writing tasks tend to focus exclusively on surface-level accuracy such as grammar and spelling rather than developing ideas and organization, because access to reference tools allows them to eliminate the visible errors that would otherwise guide teachers toward important improvement priorities
Take-home writing assignments might receive help from others that doesn't reflect the student's actual independent ability
Take-home writing assignments produce inconsistent results because some students have access to better reference resources, quieter study environments, and longer preparation time than others, creating a fairness issue that in-class testing conditions remove by standardizing the circumstances under which all students complete the task

A common approach to marking is to be thorough: take essays home, spend hours meticulously marking every error in red ink, catch every grammar mistake, spelling error, awkward phrasing, and punctuation problem. Return essays that look like they've been through a battle, covered with corrections, circles, and notes in the margins. The intention is to be helpful - after all, isn't detailed feedback what students need to improve?

But the results tell a different story. Students glance at their heavily-marked papers, sigh, and stuff them into their bags. Few actually read the comments carefully. Worse, the same errors appear again in the next assignment. Hours of detailed corrections seem to make no difference. The problem is that comprehensive feedback is paralyzing. When a student sees thirty different errors marked on their essay, they don't know where to start. The feedback is so overwhelming that it actually prevents improvement rather than enabling it. What you should be doing is applying focused feedback on just two or three key areas - the most important issues holding back each student's writing.

When marking an essay, instead of correcting everything, the teacher writes a comment at the top: "Your ideas are clear and interesting. I want you to focus on two main areas: (1) Verb tense consistency - you're mixing past and present tense in the same paragraph, and (2) Using articles (a, an, the) correctly before nouns. I've marked examples of these in your essay. Let's work on these first." The teacher underlines examples of these two issues throughout the essay and ignores other errors for now. At the end, they write three specific things the student did well: "Excellent topic sentences," "Good use of transitions between paragraphs," and "Strong concluding statement."

The change in student response is dramatic. Students actually read the feedback carefully because it isn't overwhelming. They can focus their improvement efforts on two or three achievable goals. Students approach the teacher two weeks later: "I've been really watching my verb tenses. Can you check if I'm doing better?" They're engaged with their own learning in a way that comprehensive feedback has never achieved. A simple feedback form can structure this approach:

What you did well (2-3 specific strengths)
Focus areas for improvement (2-3 specific issues with examples)
Next steps (concrete actions: "In your next essay, try..." or "Review lesson notes on...")

The counterintuitive truth is that less feedback, when it's focused and actionable, is much more powerful than comprehensive feedback that paralyzes students. Students improve faster, and teachers spend less time marking. Everyone wins.

Page 6 of 6

Assessment & Feedback: Lesson 19.2

Principles of Good Test Design

Assessing Receptive Skills: Reading and Listening

Assessing Productive Skills: Speaking and Writing

Skip to Practice Quiz >

Teach English Abroad

Online TEFL Courses

Quick Links

Get In Touch