My first Statistical Society of Canada Annual Meeting

conferences stats-ed

Notes and links from SSC2021. I often find I get the most from conferences if I take notes with the vague notion of sharing them. So this is mostly for me, but I hope it might be of interest to others, too. Note: This post will be updated sporadically.

Liza Bolton
2021-06-07
SSC 2021 Conference banner. Image description: Map of Canada with pink to red gradient from west to east on a black background. Test in top right says 2021 Annual Meeting - Virtual in English and French. All on black background.

Figure 1: SSC 2021 Conference banner. Image description: Map of Canada with pink to red gradient from west to east on a black background. Test in top right says 2021 Annual Meeting - Virtual in English and French. All on black background.

My SSC profile

Talks you should go to (very biased recommendations)

Independent Summer Statistics Community (ISSC): Building a sustainable online undergraduate student community with Cocurricular Activities and Experiential Learning Opportunities

1:45 PM - 2:00 PM EDT on Monday, June 7

Event link

Nathalie Moon

Abstract

In response to disruptions to our students’ plans due to COVID-19 and widespread feelings of isolation, we built a virtual community to help our students build portfolios, meet peers, and explore careers. In all, over 700 students in our programs signed up, and among the 164 students who responded to our end-of-summer survey, 41% were active and 48% passive participants; our data suggests that even passive participation was beneficial in making students feel more connected. As part of the ISSC, we held formal and informal data science workshops, social events, and career-building activities culminating in a DataFest COVID-19 Virtual Challenge. In all, 92 eligible teams applied to participate in DataFest, 62 were invited to compete, and 42 submitted complete submissions. In this talk, we will outline the principles guiding how the ISSC was structured including practical advice and tips for building a sustainable community, lessons learned, and plans for 2021.

Résumé

En réponse aux perturbations des plans des élèves en raison de la COVID-19 et au sentiment d’isolement généralisé, nous avons créé une communauté virtuelle pour aider les étudiants à monter leur portfolio, à rencontrer leurs pairs et à découvrir des carrières. En tout, plus de 700 étudiants se sont inscrits à nos programmes et, parmi les 164 étudiants qui ont répondu à notre enquête de fin d’été, 41 % étaient des participants actifs et 48 % des participants passifs. Nos données suggèrent que même la participation passive a été bénéfique en donnant aux étudiants le sentiment d’être plus connectés. Dans le cadre de la Communauté statistique indépendante d’été, nous avons organisé des ateliers formels et informels sur les sciences des données, des événements sociaux et des activités de développement de carrière, qui ont débouché au défi virtuel des données DataFest COVID-19. Au total, 92 équipes admissibles ont posé leur candidature pour participer à ce défi, 62 ont été invitées à concourir et 42 ont soumis des dossiers complets. Dans cet exposé, nous présenterons les principes qui ont guidé la structure de la Communauté statistique indépendante d’été, les conseils pratiques et les astuces qui ont contribué à créer une communauté durable, ainsi que les leçons apprises et nos plans pour 2021.

The Development and Implementation of a Toolkit for Learning R at all Levels.

3:45 PM - 4:00 PM EDT on Wednesday, June 9

Event link

Samantha-Jo Caetano

Abstract

Statistics and data science have become ubiquitous in the hard and social sciences. When working with large data or complex methodology it is crucial that the data analysts are able to program. R is a statistical programming language that is free and popular in the statistics community. R works well for data visualizations, wrangling and employing simple to complex methodology. As educators in statistics we noticed a variation of programming backgrounds in our senior students. Our team of 7 undergraduate students, 2 graduate students, and 2 assistant professors have developed a toolkit to help students improve their programming in R. The toolkit is a set of interactive modules that students complete autonomously. The modules start from the very basics of installing R to tidyverse to employing Bayesian methods. In this talk, we will outline the development and uses of this toolkit, and highlight some next steps.

Résumé

Les statistiques et les sciences des données sont devenues omniprésentes dans les sciences dures et humaines. En travaillant avec des données volumineuses ou des méthodologies complexes, il est primordial que les analystes de données soient capables de programmer. R est un langage de programmation gratuit et populaire au sein de la communauté statistique. Il fonctionne à merveille pour les visualisations de données, leur préparation préalable et l’adoption de méthodologies simples ou complexes. En tant qu’éducateurs en statistique, nous avons remarqué que les formations en programmation varient parmi nos étudiants de cycle supérieur. Notre équipe de sept étudiants de premier cycle, deux étudiants de cycle supérieur et deux professeurs assistants ont conçu une boîte à outils afin d’aider les étudiants à rehausser leurs aptitudes de programmation en R. La boîte à outils est composée d’un ensemble de modules interactifs que les étudiants terminent de façon autonome. Les modules commencent par la base de l’installation de R, puis progressent vers tidyverse jusqu’à l’emploi de méthodes bayésiennes. Lors de cet exposé, nous décrirons les grandes lignes du développement et de l’utilisation de cette boîte à outils, et soulignerons les étapes à venir.

Talk notes

Notes and reflections of varying quality from the talks I attended + resources shared.

Note: If I’ve written about talk and there is anything you’d like me to correct or add, please just let me know! Twitter/email links in the navigation bar.

Invited Presidential Address: Probability, Statistics, and Murder

Jeffrey Rosenthal, @ProbabilityProf

Jeff took us on a journey through his experiences with media and especially as an expert witness in court cases. There are a mish-mash of links below and much more on his website: http://probability.ca.

An example of a case that was especially interesting/surprising was when he was an expert witness in a case about a marijuana grow-op. This story was really neat because significant jail time rests on whether or not the number of plants was more or less than 500. More than 500 leads to a mandatory three-year jail term.

The level of aggressive attacks you end up facing as a expert witness sounds extremely off-putting! I’m glad Jeff’s skin is think enough to go out and do this kind of work, because mine certainly is not.

Some of Jeff’s final notes and advice:

Here are several links that were referred to during that talk or shared in chat. (Not exhaustive.)

Data privacy in official statistics (Panel Session)

Donald Estep, Canadian Statistical Sciences Institute and Simon Fraser University, @donestep1

Natalie Shlomo, University of Manchester

John Eltinge, US Census Bureau

Anne-Sophie Charest, Université Laval

Pierre Desrochers, Statistics Canada

My background is using official statistics for social and health research in Aotearoa New Zealand, so it was excellent to dip a toe back into this world and hear about the context here.

Big thanks to CANSSI for supporting this panel. @CANSSIINCASS

mverse: An R Library for Teaching and Conducting Multiverse Analysis

Michael Moon, University of Toronto, @micbonmoon

Great presentation from Michael, and I’m super excited about playing with this package!

Independent Summer Statistics Community (ISSC): Building a sustainable online undergraduate student community with Cocurricular Activities and Experiential Learning Opportunities

Nathalie Moon, University of Toronto

I was part of the ISSC team, so not notes, other than I think Nathalie did a fab job explaining it.

Unofficial list of Canadian universities doing ASA DataFest: U of T, UBC, Waterloo, MacEwan/University of Alberta — we should all be friends!

Online Homework Impact in an Introductory Statistics Course

Tharshanna Nadarajah, University of Toronto

Neat intro to MyOpenMath advantages and how Tharshanna has used it in teaching, student response and outcomes.

Tactile Response Experimental Analysis Toolkit (TREAT)

Sohee Kang, University of Toronto

Some lovely data collection activities that students can do in class from a phone or computer. Love this from the abstract: “Students often feel disengaged with data that they do not perceive as being”real" or “authentic”, and it is important that they believe that the data they are analyzing is representative of real-world problems."

Q&A for Statistical Education 2 session

Cautiously Constructing Charts

Michael Correll, Tableau

The Case Against Explainable Artificial Intelligence and Machine Learning

Boris Babic, INSEAD but soon to be U of T

Closer Than They Appear: A Bayesian Perspective On Individual-Level Heterogeneity In Risk Assessment

Kristian Lum, UPenn, @KLDivergence

On the Use of Auxiliary Variables in Multilevel Regression and Poststratification

Yajuan Si, University of Michigan, @yajuansi

List of papers about MRP authored by Si and colleagues

Figure 2: List of papers about MRP authored by Si and colleagues

Survey Calibration via the Generalized-Method-of-Moments (GMM)

Heng Chen, Bank of Canada

An initiative for promoting an inclusive, equitable and diverse environment at SSC

Nothing specific to say other than how glad I am that SSC and NSERC and CANSSI and partner institutions are committed to improving EDI.

Folks to follow: @BouchraNasri, @donestep1, @statacake, @alejandroadem

A Gentle Introduction to the Poisson Process for a Third Year Probability Course

Lengyi Han, UBC

The Development and Implementation of a Toolkit for Learning R at all Levels

Sam Caetano (presenting), University of Toronto @StatisticalSam

Also check out: @RohanAlexander, @michaelycchong, @pailfodgetts

Developing and Revising the Student Survey of Motivational Attitudes Toward Statistics: Results from a Pilot Study

Douglas Whitaker, Mount Saint Vincent University @DouglasWhitaker

I’m looking forward to seeing how these tools develop!

Using a “Midterm Warning System” to Improve Student Performance and Engagement in an Introductory Statistics Course: An ongoing RCT Study

Nooshin Rotondi, Ontario Tech University

Loved this talk! Excited to see next steps.

Data Science and Analytics Section Workshop

Chris Holdgraf, 2i2c.org, @choldgraf

An article about computational science in a scientific publication is not the scholarship itself, it is merely advertising of the scholarship. The actual scholarship is the complete software development environment and the complete set of instructions which generated the figures.

~ Buckheit and Donoho
WaveLab and Reproducible Research, 1995

The rest of the session was hand on workshops from David Liu, Nathan Taback ( @NathanTaback) and Nathaniel Stevens. https://github.com/ssc-datascience/pythonjupyter_wshop2021 They were really good!

Reading list

Books

Goodreads shelf for SSC2021

Articles


  1. In the sense of Frankfurt’s On Bullshit http://www2.csudh.edu/ccauthen/576f12/frankfurt__harry_-_on_bullshit.pdf↩︎

Citation

For attribution, please cite this work as

Bolton (2021, June 7). Liza Bolton: My first Statistical Society of Canada Annual Meeting. Retrieved from blog.lizabolton.com/posts/2021-06-07_ssc2021/

BibTeX citation

@misc{bolton2021my,
  author = {Bolton, Liza},
  title = {Liza Bolton: My first Statistical Society of Canada Annual Meeting},
  url = {blog.lizabolton.com/posts/2021-06-07_ssc2021/},
  year = {2021}
}