MedStat Complete Guide: Getting Started with Medical Statistics

Welcome to the MedStat guide! This comprehensive tutorial will walk you through using MedStat for your medical statistics analysis. Whether you're new to statistical analysis or an experienced researcher, you'll find step-by-step instructions for each feature.

Getting Started
Data Preparation
Firth Logistic Regression
Propensity Score Matching
Survival Analysis
Interpreting Results
Exporting Results

Getting Started with MedStat

MedStat is a web-based tool designed for medical researchers and biostatisticians. No installation is required – simply access the tool from any modern web browser.

System Requirements

Modern web browser (Chrome, Firefox, Safari, Edge)
Stable internet connection
Data file in CSV or Excel format
No software installation needed

Privacy and Security

Your data is processed entirely in your browser using client-side computation. MedStat does not store, upload, or transmit your data to any server. Your medical data remains completely private and secure.

Preparing Your Data for Analysis

Proper data preparation is crucial for accurate statistical analysis. Follow these guidelines to prepare your dataset:

Data Format Requirements

File Format: CSV (.csv) or Excel (.xlsx)
First Row: Should contain column headers (variable names)
Data Type: Variables should be clearly labeled as numeric, categorical, or binary
Missing Values: Use empty cells or "NA" for missing data
Encoding: UTF-8 or ASCII recommended

Variable Definition

Variable Type	Description	Examples
Outcome Variable	The result you're measuring (dependent variable)	Mortality (0/1), Disease status (yes/no)
Predictor Variables	Factors that may influence the outcome	Age, Gender, Treatment type
Confounders	Variables that may affect both exposure and outcome	Comorbidities, Baseline disease severity
Time Variable	For survival analysis only	Follow-up time (days/months)

Data Quality Checklist

✓ All required variables are present in your dataset
✓ No typos or formatting inconsistencies in variable names
✓ Numeric variables are formatted as numbers (not text)
✓ Binary outcomes are coded as 0/1 or Yes/No
✓ Missing values are clearly marked
✓ Data ranges are realistic and within expected bounds
✓ Sample size is adequate for your analysis (n ≥ 30 recommended)

Firth Logistic Regression: Step-by-Step

Firth Logistic Regression is a modified version of standard logistic regression that handles small sample sizes and complete separation better. It's ideal for medical studies with limited sample sizes.

When to Use Firth Logistic Regression

Binary outcome variables (yes/no, success/failure, diseased/healthy)
Small sample sizes (n < 100)
Rare outcomes or events
Complete separation in data (one group has 100% outcome)
Clinical trials with binary endpoints

How to Perform Firth Logistic Regression in MedStat

Upload your data file (CSV or Excel)
Select "Firth Logistic Regression" from the analysis menu
Designate your outcome variable (binary: 0/1)
Select predictor variables (independent variables)
Optional: Include adjustment variables (confounders)
Click "Run Analysis"

Interpreting Firth Logistic Results

Coefficient (β): Direction and magnitude of effect
Odds Ratio (OR): Relative odds of outcome for unit increase in predictor
95% CI: Range where true effect likely lies
p-value: Statistical significance (p < 0.05 is significant)

Propensity Score Matching: Complete Tutorial

Propensity Score Matching (PSM) reduces bias in observational studies by creating comparable treatment and control groups. It's essential for non-randomized medical research.

Understanding Propensity Scores

A propensity score is the predicted probability of receiving treatment based on observed baseline characteristics. In PSM, treated and control patients with similar propensity scores are matched, creating balanced comparison groups.

PSM Analysis Steps in MedStat

Upload your observational study data
Select "Propensity Score Matching" from the menu
Specify treatment variable (exposed/unexposed)
Select baseline characteristics to match on
Choose matching algorithm (1:1, 1:2, or caliper matching)
Execute analysis and review matching quality
Compare outcomes between matched groups

Assessing Matching Quality

After matching, always check covariate balance:

Standardized difference should be < 0.1 for all variables
Distribution of propensity scores should overlap between groups
Sample size should be adequate after matching

Survival Analysis and Kaplan-Meier Curves

Survival analysis examines time-to-event data, essential for oncology, cardiology, and epidemiology research.

Types of Survival Analysis in MedStat

Kaplan-Meier Estimator: Non-parametric method for survival curves
Cox Proportional Hazards Regression: Multivariate survival analysis
Log-Rank Test: Compare survival between groups

Data Requirements for Survival Analysis

Time variable (follow-up duration in days/months/years)
Event indicator (0 = censored, 1 = event occurred)
Grouping variable (optional, for stratified analysis)
Covariate data for Cox regression

Kaplan-Meier Curve Interpretation

Y-axis: Probability of survival (0-1)
X-axis: Time from baseline
Vertical drops: Events occurred
Censoring marks: Patients lost to follow-up

Interpreting Statistical Results

Understanding your results is crucial for drawing valid conclusions from your analysis.

Key Statistical Concepts

P-value: Probability of observing results if null hypothesis is true (α = 0.05)
Confidence Interval: Range of plausible values for effect estimate
Effect Size: Magnitude of the relationship (OR, HR, β)
Model Fit: How well model explains observed data (R², AIC)

Exporting and Sharing Your Results

MedStat allows easy export of results for reporting and publications:

Export Options

Export tables as CSV or Excel
Download publication-ready figures (PNG, PDF)
Generate summary statistics
Create coefficient tables for manuscripts

Best Practices for Medical Statistics

Always conduct descriptive analysis before inferential statistics
Check assumptions of your statistical test
Report confidence intervals alongside p-values
Account for multiple testing when appropriate
Handle missing data appropriately (never just delete)
Consult with a biostatistician for complex analyses

Common Questions and Troubleshooting

Q: My analysis won't run. What should I do?

A: Check that your data format is correct, variables are properly defined, and there are no unexpected characters in column headers.

Q: Is my data secure in MedStat?

A: Yes! All computation happens in your browser. Your data is never sent to our servers.

Q: Can I use MedStat for my published research?

A: Absolutely! Many researchers worldwide use MedStat for publications. Please cite our tool in methods section.

Additional Resources

Need Help?

If you encounter any issues or have questions about using MedStat, please refer to our methods documentation or visit our GitHub repository for more information.

← Back to MedStat