NewtFire logo: a mosaic rendering of a firebelly newt
newtFire {dh}
Maintained by: Elisa E. Beshero-Bondar (eeb4 at Creative Commons License Last modified: Sunday, 09-Jan-2022 23:37:34 UTC. Powered by firebellies.

Spring 2021: Classes meet M W F 1:25 - 2:15pm over Zoom. Zoom attendance is required for all students.

Schedule: Spring 2021

DIGIT 210: Lionpath class number: 5773. This course fulfills a core Digital Humanities requirement for the Digital Media, Arts, and Technology (DIGIT) major and an elective toward the Data Visualization Minor at Penn State.


Dr. Elisa Beshero-Bondar (Dr. B), Professor of Digital Humanities and Program Chair of DIGIT.

Text Analysis: Course Description

This course orients you to text and document data formats, and engages you in hands-on activities to manipulate these: to translate unstructured texts into structure to mark, track, and explore complex data. In this course, you will learn methods for marking, extracting, and analyzing data from digital documents to produce infographics such as graphs, charts, diagrams, maps, which you will design in the context of real projects. This course is meant to be complementary with DIGIT 110: Text Encoding, but where the emphasis in that course is on curating and preparing reading views of documents, this course concentrates on analyzing data. Neither course is meant to be a prerequisite for the other: you may take either one as a beginner. Returning students (in either semester) review and help mentor beginning students for overlapping units they have experienced in the other course.

Learning to Code: Our Context

You do not need any background at all with computer programming or web development to succeed in this course. We teach practical programming as a foundational skill (like reading, writing, and arithmetic) that all students should experience regardless of major or background. We also teach it in the writerly context of clear communication and documentation, which helps to build communities and connect projects over long periods of time.

Learning Objectives:

Optional Textbook and Other Class Resources

Other resources

More resources will be added as we work together this semester!

Explanatory Guides and Exercises: Complete List

Class Web Resources:


Homework Exercises (30%):

To keep up with this class, you must work on exercises regularly. Each day will involve some small assignment, to prepare you for the next of class, and to help you to build your course project. 90% Rule: If students do not submit at least 90% of the regular homework assignments, the grade for the homework portion is based on the percentage of homework they completed. Students should therefore aim to submit at least 90% of the regular homework assignments, and complete at least 90% of the work in each component of the course.

About homework assignments: Coding and project review exercises in this course are about your active learning, and not—as in other courses—a way of testing whether you have already learned something we covered in class or in an assigned reading. You may often need to look up how to do something that you don’t already know how to do. Often, there will be multiple ways of accomplishing the task and we are not simply looking for you to do things perfectly in just one way. We are instead looking for a record of your learning process as you take on a challenge. Documenting problems is key to learning, and sometimes just writing out what you are trying to do helps lead you to a solution! There may be times when you don’t get the result you want in the homework, and that is to be expected! In those cases you can still get full credit for the assignment if you’ve made a serious attempt and if you submit, along with your code, a description of what else you tried, what results you expected, what results you got, and what you think went wrong. Getting stuck is part of the learning process. You will see me get stuck sometimes, and I will need your eyes to help me fix something! As long as you’ve described your understanding of the problem and your attempts to resolve it on your own, you will do well: documentation of how you get stuck is key. One of our goals is to form a supportive coding community in this class, so we are comfortable with unsticking each other,

I will read and evaluate all student homework, and will post assessments on Canvas. Coding assignments are assessed as check plus, check, and check minus, or redo. Don’t think of these as grades, since, if you resubmit a redo to correct a serious problem, you will receive full credit for the assignment. My comments on homework are feedback for learning purposes. If you have not engaged with the assignment adequately (whether that means solving the tasks or discussing the coding obstacles you encountered and how you dealt with them), we will ask you to meet with us to review the issues and then complete a followup (redo) task in order to receive credit. For assignments with posted solutions, I will invite you to review the posted solution on GitHub and comment on it (we will show you how to do this) to address something you learned from the solution or did in a different way. For some assignments where we review posted solutions and line-comments together in person or in class, we will write back to you with individual comments only if your specific submission raises an issue that we don’t address elsewhere. If we don’t return your assignment, that means that we have nothing to add to our posted solution, but should you have any specific questions after you’ve read our posted solution, please ask the instructors. And we will go over assignments together to get the class unstick on things in our regular class meetings.

Issue posts: Throughout the course, we’ll assign discussion posts on our class GitHub site in which you will respond to online readings or evaluate web resources. Your posting should do more than summarize the article or site (which you could just do by skimming or reading the first paragraph), but should demonstrate a thoughtful reflection on specific ideas and issues. When evaluating a web resource, don’t simply praise or condemn it without going into details about why a key component is effective or poorly designed. Good posts demonstrate care and reflection, and you may choose to respond to the overarching ideas of a piece, or to selected details of specific interest. These posts are scored as check plus, check, and check minus.

Participation: In Class and on GitHub (15%):

Coding and programming in real life is a social activity, and professionals in the real world aren’t “know-it-all” experts who work alone, but rather are tuned into discussion boards and regularly ask and answer questions to stay sharp and to learn from their community. In this class, we want you to work together and talk to each other and your instructors as your community resource, so we have built this into our course participation grade as a formal expectation. Beginning by week two, we’ll expect each student to post at least once per week on our course GitHub repo, and we strongly encourage you to do more than this minimum. Earn an A in participation by asking questions, making suggestions, and sharing helpful resources you’ve found. Help each other out by trying to answer questions on GitHub (and read the instructor posts too as we wade in to help). Your instructors will likely be dominating the class time as we model concepts and methods, so the GitHub Issues board gives the students a good space to form into a coding community to help each other and reflect together. Also, if you have a question about an assignment, always think of our GitHub Issues board as your first resource to check for helpful hints and to post your questions, because others may have the same question and answers are best shared! Of course you may e-mail us, but we really prefer you go the discussion board first, and doing so is, after all, worth course credit as your participation grade.

Tests (25%):

As scheduled throughout the course there will be a few (three or four) tests on the concepts and various kinds of markup technologies we are learning in the course. All will be take-home or taken online in between classes. They are open-book, open notes, but they must be completed individually and are designed to demonstrate that you have learned from the class material, coding assignments, and posted solutions. Tests may resemble homework assignments, but unlike homework exercises, these are given letter grades. These are given grades because they are evaluative and involve demonstrating what you have learned after we have finished a coding unit.

Projects (30%):

This course involves working on a team-based semester project. Project work will be scheduled with paced due dates throughout the semester, and will give you experience with team work to explore a research question and to document methods and discoveries using the coding and text analysis technologies addressed in our course.

Grading Scale:

Grades for the course are calcuated and posted on Canvas, and follow this standard scale: A: 93-100%, A-: 90-92%, B+: 87-89%, B: 83-86%, B-: 80-82%, C+: 77-79%, C: 70-76%, D: 60-69%, F: 59% and below. In taking the course on a S / NC (pass-fail) basis, students must earn a C to receive Satisfactory credit.

Course Policies:

Each day we are covering material that builds on earlier material and assignments, so your success depends upon regular attendance and completing each assignment on time.

Due dates and why we need them:

Your daily homework for this course is time-sensitive! Coding assignments, response posts, and other homework exercises must be uploaded to Canvas (or GitHub or our web server as specified), by the due date and time indicated on the class schedule. Homework assignments will be posted online to our class website and linked from our schedule, so students who miss class are nevertheless expected to consult the schedule and submit assignments on time. Because we post and share answers to homework exercises after submission deadlines, we will usually not accept late homework submissions.

Exam Policy:

All exams will be take-home, to do on your own time, with submissions due in Canvas or by web submission. Because I will be posting answers and sharing them in class, I do not allow people to write exams after the solutions are posted. However, I will drop your lowest exam score for the class, so that you may miss one exam without penalty.

Zoom Attendance and Classroom Courtesy:

Our class meets regularly over Zoom. I am not putting a grade on your attendance, but I will expect your active presence and interaction with me and your classmates this semester, as we need to rely on each other to learn and develop projects.

Our class can be fast-paced and requires that we all be making the best use we can of our Zoom class sessions. Minimize distractions around you so you can concentrate on our class when we are meeting. Also try to participate from a location where you can speak freely.

If you need to miss classes for health reasons, make arrangements with me and your peers to catch up. We will always be meeting on line (via chat and GitHub asynchronously and via Zoom for class meetings) and we will find ways to keep you looped in.

Student (and Faculty) Health and Wellness Services

If any of us, you students or me, are feeling sick, with COVID or flu-like, or other serious ailments this semester, please contact Behrend Student Health & Wellness Services at 814-898-6217. None of us can be sure what will happen with the COVID pandemic, and we are taking on a great risk this semester. Reporting in when you do not feel well is not shameful; it is responsible and important to protect yourself and our community.

Also, this semester may be more stressful than usual with so much uncertainty! Many students at Penn State face personal challenges or have psychological needs that may interfere with their academic progress, social development, or emotional wellbeing. Seek help! The university offers a variety of confidential services to help you through difficult times, including individual and group counseling, crisis intervention, consultations, online chats, and mental health screenings. These services are provided by staff who welcome all students and embrace a philosophy respectful of clients’ cultural and religious backgrounds, and sensitive to differences in race, ability, gender identity and sexual orientation. Counseling and Psychological services are available through the Personal Counseling Office in Reed Union Bldg. Rm 1: 814-898-6504.


Penn State takes great pride to foster a diverse and inclusive environment for students, faculty, and staff. Acts of intolerance, discrimination, or harassment due to age, ancestry, color, disability, gender, gender identity, national origin, race, religious belief, sexual orientation, or veteran status are not tolerated and can be reported through Educational Equity via the Report Bias webpage (


Each student is issued a University email address ( upon admission. This email address may be used by the University for official communication with students. Students are expected to read email sent to this account on a regular basis. Failure to read and react to University communications in a timely manner does not absolve the student from knowing and complying with the content of the communications. The University provides an email forwarding service that allows students to read their email via other service providers (e.g., Hotmail, AOL, Yahoo). Students who choose to forward their email from their address to another address do so at their own risk. If email is lost as a result of forwarding, it does not absolve the student from responding to official communications sent to their University email address. To forward email sent to your University account, go to, log into your account, click on Edit Forwarding Addresses, and follow the instructions on the page. Be sure to log out of your account when you have finished.

Academic Integrity

Penn State Erie, The Behrend College, puts a very high value on academic integrity, and violations are not tolerated. Academic integrity is the pursuit of scholarly activity in an open, honest and responsible manner. Academic integrity is a basic guiding principle for all academic activity at The Pennsylvania State University, and all members of the University community are expected to act in accordance with this principle. Consistent with this expectation, the University’s Code of Conduct states that all students should act with personal integrity; respect other students’ dignity, rights and property; and help create and maintain an environment in which all can succeed through the fruits of their efforts. Academic integrity includes a commitment by all members of the University community not to engage in or tolerate acts of falsification, misrepresentation or deception. Such acts of dishonesty violate the fundamental ethical principles of the University community and compromise the worth of work completed by others.” (Senate Policy 49-20 and G-9 Procedures. Any violation of academic integrity will receive academic and possibly disciplinary sanctions, including the possible awarding of an XF grade which is recorded on the transcript and states that failure of the course was due to an act of academic dishonesty. All acts of academic dishonesty are recorded so repeat offenders can be sanctioned accordingly. More information on academic integrity can be found at:

Source Citation and Plagiarism: One goal of our course is to reflect on how best to cite sources in digital contexts. We will consider how and why such citations differ from documenting printed texts. We will also consider the ease and frequency with which digital texts and graphics are plagiarized on the worldwide web, and discuss how the omission of source citations detracts from the authority of a digital information resource. We expect you to practice mindful source citation, and plagiarism on your part will have very serious consequences.

Representing the voice of another individual as your own voice constitutes plagiarism, however generous that person may be in “helping” you with an assignment. Turning in an assignment generated collectively under the name of a single individual is considered plagiarism. When instructed to collaborate on a project, project collaborators share collective authorship and should identify themselves directly as a team. To avoid plagiarism, cite your sources whenever you quote, paraphrase, or summarize material, or use digital images from any outside source (including websites, articles, books, course readings, Courseweb or GitHub postings, or someone else’s notes). When using the “copy” and “paste” features as you read and research, be sure that you are carefully marking that these passages are unprocessed from their source, so that you know to process it later. Forgetting to do so not only produces sloppy work but (whether you intended it or not) results in a false representation. As long as you make a good faith and clear effort to cite your sources, you will not be faulted for plagiarism, but your work will be penalized if citations are inaccurate, unclear, or lack important information.

That said, the coding and digital development we do encourages collaboration, and for that reason we adopt our colleague David Birnbaum's Collaboration policy, since his course is very similar to ours. This policy specifies that students identify collaborators in a comment on submitted asignments and take care on projects that all students contribute equally (and no student is contributing excessively more than what everyone else has done). When joining a group homework session, always work on the assignment by yourself first so you can be an equal participant, and write up the assignment by yourself, after the session is over so you take care not to copy from the other students. While we want you to consult with each other, you are responsible for doing all your writing and coding by yourself, using your own words.

Disability Services:

Penn State welcomes students with disabilities into the University’s educational programs. Every Penn State campus has an office for students with disabilities. Student Disability Resources (SDR) website provides contact information for every Penn State campus ( For further information, please visit Student Disability Resources website ( In order to receive consideration for reasonable accommodations, you must contact the appropriate disability services office at the campus where you are officially enrolled, participate in an intake interview, and provide documentation: See documentation guidelines ( If the documentation supports your request for reasonable accommodations, your campus disability services office will provide you with an accommodation letter. Please share this letter with your instructors and discuss the accommodations with them as early as possible. You must follow this process for every semester that you request accommodations. Penn State Behrend’s Disability Services Coordinator is Stacey Walbridge (


We gratefully acknowledge David Birnbaum’s Digital Humanities course at the University of Pittsburgh as our starting point and supporting resource for much of our development. Other inspirational resources include:

Projects that inspire us:

  • Obdurodon: where we learned what we can teach, and where we’re still learning.
  • Venice Time Machine: very ambitious, enormous project team of faculty and students to study and model a thousand years of Venice, digitizing "kilometers of archives."
  • Map of Early Modern London
  • Lord Byron and His Times: The very thoughtful stylistic design of this important project reproduces the style of nineteenth-century print and layout. The content makes many rare materials about Lord Byron’s social network searchable and connected to the web of linked open data.
  • The Shelley-Godwin Archive: digitizes the manuscripts of Percy and Mary Shelley, and Mary Shelley’s parents, William Godwin and Mary Wollstonecraft—manuscripts often written in multiple hands. Provides an important study of the Frankenstein notebooks to demonstrate how much of a role Percy Shelley played in the writing of Frankenstein. The archive provides a good model of the use of TEI for manuscript encoding and of complex and multiple visualizations of manuscript texts.
  • TokenX: a text visualization, analysis, and play tool
  • A Tour Through the Visualization Zoo
  • Clay Shirky on Love, Internet Style (9 minutes of Youtube inspiration: on what lasts, and why community matters in our digital worlds.)

Previous versions of this course