Statistics

Marcus Bauer; Martina Vetter; Kathrin Stückrath; Meron Yohannes; Zelalem Desalegn; Tewodros Yalew; Yonas Bekuretsion; Tariku W. Kenea; Maureen Joffe; Eunice J van den Berg; Julien I. Nikulu; Kamate Bakarou; Shyam S. Manraj; Olufemi J. Ogunbiyi; Ima-Obong Ekanem; Festus Igbinoba; Mohenou Diomande; Clement Adebamowo; Charles P. Dzamalala; Angelica A. Anele; Annelle Zietsman; Moses Galukande; Milena Foerster; Isabel dos-Santos-Silva; Biying Liu; Pablo Santos; Ahmedin Jemal; Tamrat Abebe; Claudia Wickenhauser; Barbara Seliger; Valerie McCormack; Eva J. Kantelhardt

Improve Research Reproducibility A Bio-protocol resource

Home
Protocols

Concise Method

Statistics

MB Marcus Bauer

MV Martina Vetter

KS Kathrin Stückrath

MY Meron Yohannes

ZD Zelalem Desalegn

TY Tewodros Yalew

YB Yonas Bekuretsion

TK Tariku W. Kenea

MJ Maureen Joffe

EB Eunice J van den Berg

JN Julien I. Nikulu

KB Kamate Bakarou

SM Shyam S. Manraj

OO Olufemi J. Ogunbiyi

IE Ima-Obong Ekanem

FI Festus Igbinoba

MD Mohenou Diomande

CA Clement Adebamowo

CD Charles P. Dzamalala

AA Angelica A. Anele

AZ Annelle Zietsman

MG Moses Galukande

MF Milena Foerster

Id Isabel dos-Santos-Silva

BL Biying Liu

PS Pablo Santos

AJ Ahmedin Jemal

TA Tamrat Abebe

CW Claudia Wickenhauser

BS Barbara Seliger

VM Valerie McCormack

EK Eva J. Kantelhardt

This method is extracted from research article: Cancer Immunol Res, Jun 2023

Regional variation in the tumor microenvironment, immune escape and prognostic factors in breast cancer in sub-Saharan Africa

DOI: 10.1158/2326-6066.CIR-22-0795

Request a Protocol

Ask a question

Favorite

The Mann–Whitney U test was employed to compare clinical data. Variables originating from gene expression analyses were assessed by multivariate regression using generalized linear models (Binomial, Poisson or Gamma families, depending on the response variable) in order to estimate odds ratios (OR) and confidence intervals (CI) while adjusting for the following confounding factors: age (continuous variable, by year), grading (binary variable, grade 3 vs. 1 or 2), quality score (discrete quantitative variable, 1 to 10), stage (binary variable, advanced vs. early), IHC subtype (categorical variable) and region of origin (categorical variable). Higher OR denoted concordance between higher gene expression values and increasing variable values (age, grading, quality and stage) or as compared to a reference category (‘luminal A-like’ in the case of IHC subtype and ‘East’ in the case of Region). Missing age entries were imputed using the median age of patients of the country of origin. Otherwise, patients with missing information in any other variable were excluded from regression analyses. Multivariate regressions were performed using the statsmodels library for Python (www.statsmodels.org). Linear correlations between variables were estimated as Pearson’s correlations. Survival analyses were performed on 400 patients from SSA (recruited between 2005 and 2017, maximum follow-up time of 36 months) using Kaplan–Meier estimators and with differences calculated with log-rank tests (unadjusted) or Cox proportional hazard models (adjusted for confounding factors). Cox proportional hazard models were implemented and plotted alongside the corresponding Kaplan–Meier curves using the Lifelines library for Python (https://lifelines.readthedocs.io/). Except when stated otherwise, all statistical analyses were performed using IBM SPSS Statistics or GraphPad Prism v9.

Do you have any questions about this protocol?

Post your question to gather feedback from the community. We will also invite the authors of this article to respond.

Post a Question

0 Q&A

Share your protocol with your peers.

Submit a Preprint Protocol