Use of Social Media Data in Research 

Data obtained from social media platforms is often used for research in the Social, Behavioural & Educational Research (SBER) domain. As these identifiable data are often publicly-available online, and there are usually no intervention or interaction with the poster (i.e. content creator), researchers often misunderstand that ethics approval is therefore not required.

Researchers are to note that using data from social media for research will usually require IRB review. Please refer to the table below for examples.

 

Scenario Examples IRB review
Social media content that is usually considered non-identifiable. Reddit.
whereby the poster (i.e. content creator) is usually pseudo-anonymised. 
 
Review Not Required (RNR):
If you are only accessing data from openly-available channels or groups, and no traceable content will be published.
 

Expedited Review:
(i) If you will be interacting with posters (i.e. content creators) by posting into openly-available channels or groups; or

(ii) If you will be joining closed channels or groups to access the data within. Note that permission from the platform owner should be sought before conducting your research.

Social media content that is usually considered identifiable. LinkedIn, Facebook, Instagram, X-Twitter, etc.
whereby the poster (i.e. content creator) is usually identifiable.
 
Exempt (Category 4):
Provided poster will not be contacted, and no traceable content will be published.
 

Expedited Review:
(i) If poster will be contacted to collect more data through interaction or intervention. 

(ii) If you will be joining closed channels or groups to access the data within. Note that permission from the platform owner should be sought before conducting your research.

Full Board Review:
If you will be scraping content off social media (e.g. using APIs) or collaborating with the social media platform to extract private data for your research, whereby consent from poster (i.e. content creator) will not be obtained.

 

In addition, meeting any of the conditions below may necessitate IRB reviews:

  1. The data on social media is identifiable.
  2. Private forums (or channels or groups) instead of public forums.
  3.  Any interaction (e.g. messaging, posting, recruitment) with posters (i.e. content creators).
  4. Harvesting or scrapping of data.
  5. Usage of traceable content (e.g. quotes, screenshots, images, usernames, verbatim) whereby the poster (i.e. content creator) can then be easily traced or re-identified.
  6. Research on sensitive topics (e.g. mental health, sexual harassment, racial violence).