SUMMARY:Making Web Maps with Google Fusion Tables
Fusion Tables is an experimental data visualization web application (launched in 2009) to gather, visualize, and share data tables. You can create web maps (e.g. dot maps, choropleths, heat maps) in a matter of minutes. It is a relatively easy-to-use tool and ideal for sharing data and visualizations with collaborators. Fusion Tables also has its own API for developers that allow for more complex data viz. We'll also discuss how it compares and can be used in conjunction with Google My Maps. No programming background required for this workshop. Although HTML or programming knowledge is beneficial.
SUMMARY:Regular Expressions
Regular expressions are perfectly suited for people who like puzzles. Regular expressions are a sequence of characters used to define a search pattern. They are commonly used to do "find" and "find and replace" string operations. They are also used to validate strings like phone numbers, passwords, etc. in data entry. Regular expression capabilities can be found in a variety of programming languages and software like ArcGIS, Java, Javascript, Matlab, Perl, PHP, Python, R, Visual Basic, etc. and some text editors. The workshop is part I of a two-part series and will start with the basics. The workshop will consist of hands-on example problems. Learn to search beyond "*.txt". The tutorials will be conducted using Python. A basic programming background is helpful but not required for this workshop.
SUMMARY:Statistics: A Review
A one-day, intensive review of common statistical methods of design, measurement analysis and presentation of scientific investigations. The workshop is designed for any scholar engaged in quantitative research. Statistics: A Review discusses answers to the following questions: 

What should we measure?
What are the main design types; what are the comparative advantages of each?
How are the sample sizes determined?
What are the appropriate inference procedures?
What do standard error, p-value and confidence level mean?
What are some dangers we need to avoid?
How should we display our results?
What are the statistical software options?
SUMMARY:SPSS I Introduction to SPSS
Note: Topic order is subject to change. 
This workshop is designed to introduce participants to SPSS. It will cover the fundamentals of SPSS, within-case transformations, data management with multiple files, and basic statistics and graphics. Useful for any scholar engaged in quantitative research. 
Fundamentals 
This portion introduces SPSS, the menu and the help systems, and the three main types of files used. It then addresses defining variables, attaching labels, defining missing values, and various ways to enter data into SPSS. Finally, it covers a brief introduction to obtaining frequency distributions, descriptive statistics, and cross tabulations of variables. 
Within-Case Transformations 
This portion introduces data management capabilities, including recoding variables (manual and automatic), computing new variables using formulas, and counting occurrences of values within subjects. Attention then turns to temporary transformations, conditional processing of transformations, and repetitive transformations. 
Data Management with Multiple Files 
This portion begins with a discussion of subsetting data files by drawing samples, selecting groups and excluding groups from analysis. Then, the two main methods of merging SPSS data files are covered: adding additional variables and adding additional cases. 
Basic Statistical Analysis 
The portion includes a brief demonstration of a statistical analysis in SPSS. While not delving deep into statistical theory, we will cover the basics of an analysis, as well as discuss the graphing facilities in SPSS.
SUMMARY:Regular Expressions II
Regular expressions are perfectly suited for people who like puzzles. Regular expressions are a sequence of characters used to define a search pattern. They are commonly used to do "find" and "find and replace" string operations. They are also used to validate strings like phone numbers, passwords, etc. in data entry. Regular expression capabilities can be found in a variety of programming languages and software like ArcGIS, Java, Javascript, Matlab, Perl, PHP, Python, R, Visual Basic, etc. and some text editors. This workshop is part II of a two-part series and will cover more advanced topics like captured groups, backreferences and assertions. The workshop will consist of hands-on example problems. Basic understanding of regular expressions is required. You should be able to understand expressions like "\w{3,}-\d{1,2}-\d{4}" and "des*ert?s?". The tutorials will be conducted using Python. A basic programming background is helpful but not required for this workshop.
SUMMARY:Android Application Development: Why and How
Mobile app development is one of the biggest new industries of the last decade. As of 2017, Google has stated that over 2 billion devices around the world use the Android operating system, alongside Apple's 1 billion iOS mobile devices around the globe. The capabilities and adopted use of modern smartphones can provide an easy gateway to collect data or engage with participants for research purposes. It might seem a daunting task for newcomers to write a program for the computer in their pocket, but modern tools and software make it more accessible than ever. We will present a brief overview of the differences between Android and iOS development requirements, and provide demonstrations on how to program an app for your Android phone using a Windows PC environment and a sample of free software packages. 
A familiarity with programming languages and logic is assumed. Program code presented during the workshop may be written in languages including (but not limited to) Java and C#, familiarity with syntax of these languages is recommended. It is not expected for attendees to program during the workshop, but those who wish to do so are recommended to install the "Android Studio" IDE in advance.
SUMMARY:Stata 1: Introduction to Stata
Note: Topics are subject to change. 
Topics: 

Basics – Interfacing with Stata, Do-files, getting help.
Working with Data Sets – Importing, opening, and saving data files.
Data Management – Getting familiar with your data, adding informative labels, basic checks for issues.
Data Manipulation – Generating new variables, working with subsets of data, merging files, reshaping files.
Programming – A basic and gentle introduction to some of the more advanced Stata programming.
Note: This workshop does not cover any statistical modeling; see Stata 2: Statistical Modeling in Stata for those topics.
SUMMARY:Statistical Analysis with R
This is a two day workshop (February 5 and 6) in R which is a free and open source environment for data analysis and statistical computing. While R contains many built-in statistical procedures, a powerful feature of R is the facility for users to extend these procedures to suit their own needs. Excellent graphing capability is another reason R is gaining wide popularity. 

How to Obtain R
Help Tools
Importing / Exporting Data
Data Management
Descriptive and Exploratory Statistics
Common Statistical Analyses (t-test, Regression Modeling, ANOVA, etc.)
Graphics
Creating Functions
SUMMARY:Stata 2: Statistical Modeling in Stata
This workshop is aimed at participants with some basic knowledge of Stata (e.g having taken Stata 1), and covers statistical analyses and graphing tools within Stata. Topics include descriptive statistics, t-tests, regression, mixed models, multiple imputation, and survey data. 
Topics: 
(Topics subject to change) 
We will cover the following statistical topics: 

Descriptive statistics
Data visualization
t-tests and chi-square tests
Regression (linear, logistic)
Mixed effects regression (a.k.a. repeated measures regression, hierarchical linear models, panel data regression)
Multiple Imputation
Analyzing survey data (using Stata's svy: prefix)

Each will begin with a short theoretical discussion of the method followed by a practical lesson and several examples.
SUMMARY:Mixed Models with R
Mixed models are an extremely useful modeling tool for situations in which there is some dependency among observations in the data, where the correlation typically arises from the observations being clustered in some way. For example, it is quite common to have data in which we have repeated measurements for the units of observation, or in which the units of observation are otherwise clustered (e.g. students within school, cities within geographic region). While there are different ways to approach such a situation, mixed models are a very common and powerful tool to do so. In addition, they have ties to other statistical approaches that further expand their applicability.
 
The goal of this workshop is primarily to provide a sense of when one would use mixed models and how to incorporate a variety of standard techniques. It is very applied in nature, and only assumes a basic understanding of standard regression models (and use of R for such models).
SUMMARY:SEM Series I: Graphical Models
This workshop will help participants develop skills in understanding graphical models, and specifically from the perspective of structural equation modeling (SEM). After a general overview of concepts, regression approaches with observed variables will be demonstrated (path analysis), as well as mediation models in particular. Alternative approaches and non-SEM settings will also be discussed. Prerequisites: One should have a firm understanding of basic regression estimation techniques. R will be the program of choice, but nothing beyond very basic skill is assumed.

 
While this workshop can serve as a standalone session, the content is required knowledge for SEM III: Structural Equation Modeling.
SUMMARY:Applied Survival Analysis (Event History\, Reliability Analysis)
This 2-day workshop (February 26 and 27) covers basic concepts and common analytical approaches for time-to-event data, known variously as survival analysis (in biological and medical sciences), event history analysis (in social sciences), or reliability analysis (in engineering). The workshop will be held in a computer lab and methods will be illustrated with hands-on exercises in SAS, R, SPSS, and/or Stata, as needed. Topics include Kaplan-Meier estimation, two-sample comparisons, Cox proportional hazards regression, and discrete time models.
SUMMARY:SEM Series II: Latent Variables
This workshop will help participants develop skills in understanding and conducting latent variable models, with particular from the perspective of structural equation modeling. After a conceptual overview, a broad view of matrix factorization techniques will be provided along with specific examples (e.g. PCA, 'factor analysis'). In addition, measurement error issues, reliability, and scale development will be discussed (e.g. 'confirmatory' factor analysis). Prerequisites: One should have a firm understanding of basic regression. R will be the program of choice, but nothing beyond very basic skill is assumed (e.g. import data, run a regression). 
While this workshop can serve as a standalone session, the content is required knowledge for SEM III: Structural Equation Modeling.
SUMMARY:SEM Series III: Structural Equation Models
This workshop will help participants develop skills in understanding and conducting structural equation models. After an initial review of path analytic and measurement model techniques with latent variables (covered in SEM Series I and II), focus will shift to putting both strategies into a single coherent structural equation modeling framework. As such, SEM I and II or understanding of the content are *required*. If you have not taken them, you must have gone over the relevant content, available here (m-clark.github.io/docs/sem).
SUMMARY:Machine Learning: Concepts and Application
Machine learning can be described as a form of data analysis, often even utilizing well-known and familiar techniques, that has bit of a different focus than traditional analytical practice in many disciplines. The key notion is that flexible, automatic approaches are used to detect patterns within the data, with a primary focus on making predictions on future data. Among other topics, we will look at the trade-offs between model interpretability and prediction accuracy, supervised versus unsupervised learning, and regression versus classification problems. 
A familiarity with standard regression analysis as typically presented in applied disciplines is assumed. Regarding programming, demonstrations and exercises with R and Python will be provided, so one should have familiarity with either. This will definitely NOT be an introduction to a programming language, an introduction to statistics, nor an introduction to statistical programming specifically. However, you do not need to be an expert in any of those.
SUMMARY:Introduction to Survey Design: Data Collection\, Questionnaire Design and Response Processes
This lecture-format workshop will present an overview of available modes and methods of survey data collection as well as an introduction to the survey response process and implications for questionnaire design. Participants will gain an appreciation of the tradeoffs inherent in survey design decisions and how design can affect data quality and survey errors. Topics will include: Survey errors, in particular measurement, coverage, and nonresponse error; what to consider when selecting a data collection method for a particular research question; Measurement (response) error and how to reduce it through question wording/format and questionnaire structure; the role of the interviewer and interviewer effects. 

Survey errors, in particular measurement, coverage, and nonresponse error.
What to consider when selecting a data collection method for a particular research question.
Measurement (response) error and how to reduce it through question wording/format and questionnaire structure.
The role of the interviewer and interviewer effects.
