{ "cells": [ { "cell_type": "markdown", "id": "d25254ef-77e8-414a-a4c8-f367e48a4eff", "metadata": {}, "source": [ "# Анализ данных FDA Drug Adverse Events\n", "\n", "В данной работе проводится анализ данных о побочных эффектах лекарственных препаратов. \n", "Цель работы — загрузить датасет, изучить его структуру, провести базовый анализ и построить визуализации с использованием библиотек `pandas`, `numpy`, `matplotlib`, `seaborn` и `tqdm`." ] }, { "cell_type": "code", "execution_count": 1, "id": "f9f3cdba-7699-400d-941d-b6aa79357639", "metadata": { "jp-MarkdownHeadingCollapsed": true }, "outputs": [], "source": [ "import pandas as pd\n", "import numpy as np\n", "import matplotlib.pyplot as plt\n", "import seaborn as sns\n", "from tqdm import tqdm" ] }, { "cell_type": "code", "execution_count": 2, "id": "1665af01-f42a-48f3-aa29-2e9c8b823f96", "metadata": {}, "outputs": [ { "data": { "text/html": [ "
| \n", " | report_id | \n", "country | \n", "occur_country | \n", "report_type | \n", "serious | \n", "death | \n", "life_threatening | \n", "hospitalization | \n", "disabling | \n", "receive_date | \n", "... | \n", "reaction_pneumonia | \n", "reaction_bleeding | \n", "reaction_complexity | \n", "year | \n", "month | \n", "day | \n", "weekday | \n", "report_type_risk | \n", "age_drug_risk | \n", "gender_drug_risk | \n", "
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | \n", "26016170 | \n", "US | \n", "US | \n", "1 | \n", "2.0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "2025-11-10 | \n", "... | \n", "0 | \n", "0 | \n", "2 | \n", "2025 | \n", "11 | \n", "10 | \n", "0 | \n", "0.080736 | \n", "0.651584 | \n", "0.000000 | \n", "
| 1 | \n", "26017452 | \n", "NP | \n", "NP | \n", "1 | \n", "1.0 | \n", "1 | \n", "0 | \n", "1 | \n", "0 | \n", "2025-11-10 | \n", "... | \n", "0 | \n", "0 | \n", "4 | \n", "2025 | \n", "11 | \n", "10 | \n", "0 | \n", "0.080736 | \n", "8.846939 | \n", "0.000000 | \n", "
| 2 | \n", "26017586 | \n", "US | \n", "US | \n", "1 | \n", "2.0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "2025-11-10 | \n", "... | \n", "0 | \n", "0 | \n", "3 | \n", "2025 | \n", "11 | \n", "10 | \n", "0 | \n", "0.080736 | \n", "0.000000 | \n", "0.000000 | \n", "
| 3 | \n", "26018910 | \n", "SG | \n", "SG | \n", "1 | \n", "1.0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "2025-11-10 | \n", "... | \n", "0 | \n", "0 | \n", "2 | \n", "2025 | \n", "11 | \n", "10 | \n", "0 | \n", "0.080736 | \n", "4.428571 | \n", "0.071429 | \n", "
| 4 | \n", "26018949 | \n", "EU | \n", "EU | \n", "3 | \n", "1.0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "2025-11-10 | \n", "... | \n", "0 | \n", "0 | \n", "2 | \n", "2025 | \n", "11 | \n", "10 | \n", "0 | \n", "0.178571 | \n", "0.000000 | \n", "0.000000 | \n", "
5 rows × 40 columns
\n", "