Добавлены итоговые блокноты, внешнее изображение

2025-05-15 00:54:26 +03:00 · 2025-05-15 00:54:26 +03:00 · a7a54edebc
commit a7a54edebc
parent a7003303f8
10 changed files with 734 additions and 581 deletions
--- a/.ipynb_checkpoints/README-checkpoint.md
+++ b/.ipynb_checkpoints/README-checkpoint.md
@ -0,0 +1,16 @@
 Описание задачи
 Цель: сегментировать изображение на несколько однородных регионов с использованием спектральной кластеризации .
 Алгоритм:
 Изображение преобразуется в граф
 Применяется спектральная кластеризация
 Результат отображается в виде контуров вокруг найденных регионов
 Поддерживаемые методы присвоения меток:
 'kmeans'
 'discretize'
 'cluster_qr'
 Для установки зависимостей:
 pip install -r requirements.txt
--- a/.ipynb_checkpoints/plot_coin_segmentation-checkpoint.ipynb
+++ b/.ipynb_checkpoints/plot_coin_segmentation-checkpoint.ipynb
@ -0,0 +1,179 @@
 {
 "cells": [
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "Сегментация изображения методом спектральной кластеризации\n",
    "Описание задачи\n",
    "Цель: разделить изображение монет на несколько регионов, используя спектральную кластеризацию .\n",
    "\n",
    "Алгоритм:\n",
    "\n",
    "Изображение преобразуется в граф\n",
    "Каждый пиксель — вершина графа\n",
    "Рёбра между соседними пикселями строятся на основе разницы яркости\n",
    "Применяется спектральная кластеризация для выделения регионов\n",
    "Источник данных:\n",
    "\n",
    "Встроенное изображение coins() из библиотеки skimage"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "1. Импорт библиотек"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [],
   "source": [
    "import time\n",
    "import numpy as np\n",
    "import matplotlib.pyplot as plt\n",
    "from scipy.ndimage import gaussian_filter\n",
    "from skimage.data import coins\n",
    "from skimage.transform import rescale\n",
    "\n",
    "from sklearn.cluster import spectral_clustering\n",
    "from sklearn.feature_extraction import image"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "2. Загрузка и подготовка изображения"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [],
   "source": [
    "# Загружаем изображение монет\n",
    "orig_coins = coins()\n",
    "\n",
    "# Уменьшаем размер изображения для ускорения обработки\n",
    "# Перед уменьшением применяем фильтр Гаусса, чтобы уменьшить артефакты\n",
    "smoothened_coins = gaussian_filter(orig_coins, sigma=2)\n",
    "rescaled_coins = rescale(smoothened_coins, 0.2, mode=\"reflect\", anti_aliasing=False)\n",
    "\n",
    "print(\"Форма исходного изображения:\", orig_coins.shape)\n",
    "print(\"Форма после масштабирования:\", rescaled_coins.shape)"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "3. Построение графа"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [],
   "source": [
    "# Преобразуем изображение в граф\n",
    "graph = image.img_to_graph(rescaled_coins)\n",
    "\n",
    "# Уменьшаем веса рёбер экспоненциальной функцией\n",
    "beta = 10\n",
    "eps = 1e-6\n",
    "graph.data = np.exp(-beta * graph.data / graph.data.std()) + eps\n",
    "\n",
    "print(\"Граф построен\")"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "4. Спектральная кластеризация"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [],
   "source": [
    "# Количество регионов\n",
    "n_regions = 26\n",
    "n_plus = 3  # дополнительные кластеры для лучшей сегментации\n",
    "\n",
    "# Выполняем кластеризацию тремя способами\n",
    "for assign_labels in (\"kmeans\", \"discretize\", \"cluster_qr\"):\n",
    "    t0 = time.time()\n",
    "    \n",
    "    labels = spectral_clustering(\n",
    "        graph,\n",
    "        n_clusters=(n_regions + n_plus),\n",
    "        eigen_tol=1e-7,\n",
    "        assign_labels=assign_labels,\n",
    "        random_state=42,\n",
    "    )\n",
    "\n",
    "    t1 = time.time()\n",
    "    labels = labels.reshape(rescaled_coins.shape)\n",
    "\n",
    "    plt.figure(figsize=(5, 5))\n",
    "    plt.imshow(rescaled_coins, cmap=plt.cm.gray)\n",
    "    plt.xticks(())\n",
    "    plt.yticks(())\n",
    "\n",
    "    title = f\"Спектральная кластеризация: {assign_labels}, {t1 - t0:.2f}с\"\n",
    "    print(title)\n",
    "    plt.title(title)\n",
    "\n",
    "    for l in range(n_regions):\n",
    "        colors = [plt.cm.nipy_spectral((l + 4) / float(n_regions + 4))]\n",
    "        plt.contour(labels == l, colors=colors)\n",
    "\n",
    "plt.show()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "5. Интерпретация результатов\n",
    "Каждое изображение показывает контуры найденных регионов. Цвета используются только для наглядности и не имеют семантической нагрузки.\n",
    "\n",
    "Методы:\n",
    "\n",
    "'kmeans' — стабильный, но может быть медленным\n",
    "'discretize' — итеративный метод, работает быстрее\n",
    "'cluster_qr' — новый экспериментальный метод, основанный на QR-разложении"
   ]
  }
 ],
 "metadata": {
  "kernelspec": {
   "display_name": "Python 3 (ipykernel)",
   "language": "python",
   "name": "python3"
  },
  "language_info": {
   "codemirror_mode": {
    "name": "ipython",
    "version": 3
   },
   "file_extension": ".py",
   "mimetype": "text/x-python",
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
   "version": "3.12.9"
  }
 },
 "nbformat": 4,
 "nbformat_minor": 4
 }
--- a/.ipynb_checkpoints/plot_coin_segmentation_upd-checkpoint.ipynb
+++ b/.ipynb_checkpoints/plot_coin_segmentation_upd-checkpoint.ipynb
@ -0,0 +1,6 @@
 {
 "cells": [],
 "metadata": {},
 "nbformat": 4,
 "nbformat_minor": 5
 }
--- a/README.md
+++ b/README.md
@ -1 +1,16 @@
-https://www.kaggle.com/datasets/sachinpatel21/az-handwritten-alphabets-in-csv-format/data
+##Описание задачи
 Цель: сегментировать изображение на несколько однородных регионов с использованием спектральной кластеризации .
 #Алгоритм:
 Изображение преобразуется в граф
 Применяется спектральная кластеризация
 Результат отображается в виде контуров вокруг найденных регионов
 Поддерживаемые методы присвоения меток:
 'kmeans'
 'discretize'
 'cluster_qr'
 Для установки зависимостей:
 pip install -r requirements.txt
--- a/app.py
+++ b/app.py
@ -1,84 +0,0 @@
 import pandas as pd
 import numpy as np
 import matplotlib.pyplot as plt
 from sklearn.model_selection import train_test_split
 from sklearn.svm import SVC
 from sklearn.metrics import classification_report, ConfusionMatrixDisplay
 # -----------------------------
 # 1. Загрузка данных
 # -----------------------------
 # Попробуем определить, есть ли заголовки
 try:
    df = pd.read_csv('A_Z_HandwrittenLetters.csv')
    if 'label' in df.columns:
        print("Заголовки найдены, используется столбец 'label'")
    else:
        raise Exception("Столбец 'label' не найден")
 except Exception as e:
    print(f"Ошибка: {e}, попытка чтения без заголовков")
    df = pd.read_csv('A_Z_HandwrittenLetters.csv', header=None)
 # Разделение на признаки и метки
 if 'label' in df.columns:
    y = df['label'].values
    X = df.drop('label', axis=1).values
 else:
    y = df[0].values
    X = df.drop(0, axis=1).values
 # -----------------------------
 # 2. Нормализация
 # -----------------------------
 X = X / 255.0
 # -----------------------------
 # 3. Визуализация первых изображений
 # -----------------------------
 _, axes = plt.subplots(1, 4, figsize=(10, 5))
 for ax, image, label in zip(axes, X[:4], y[:4]):
    ax.imshow(image.reshape(28, 28), cmap='gray')
    ax.axis('off')
    ax.set_title(f"Label: {label}\n({chr(label + ord('A'))})")
 plt.suptitle("Sample Training Images")
 plt.show()
 # -----------------------------
 # 4. Разделение выборки
 # -----------------------------
 X_train, X_test, y_train, y_test = train_test_split(
    X, y, test_size=0.3, random_state=42, shuffle=True
 )
 # -----------------------------
 # 5. Обучение модели
 # -----------------------------
 print("Обучение модели...")
 clf = SVC(gamma=0.001)
 clf.fit(X_train, y_train)
 # -----------------------------
 # 6. Предсказание
 # -----------------------------
 y_pred = clf.predict(X_test)
 # -----------------------------
 # 7. Визуализация предсказаний
 # -----------------------------
 _, axes = plt.subplots(1, 4, figsize=(10, 5))
 for ax, image, prediction in zip(axes, X_test, y_pred):
    ax.imshow(image.reshape(28, 28), cmap='gray')
    ax.axis('off')
    ax.set_title(f"Prediction: {prediction}\n({chr(prediction + ord('A'))})")
 plt.suptitle("Predicted Letters")
 plt.show()
 # -----------------------------
 # 8. Отчеты и матрица ошибок
 # -----------------------------
 print("\nClassification Report:")
 print(classification_report(y_test, y_pred))
 disp = ConfusionMatrixDisplay.from_predictions(y_test, y_pred)
 disp.figure_.suptitle("Confusion Matrix")
 plt.show()
--- a/coins.jpg
+++ b/coins.jpg
--- a/plot_coin_segmentation.ipynb
+++ b/plot_coin_segmentation.ipynb
--- a/plot_coin_segmentation_upd.ipynb
+++ b/plot_coin_segmentation_upd.ipynb
--- a/plot_digits_classification.ipynb
+++ b/plot_digits_classification.ipynb
--- a/plot_digits_classification.py
+++ b/plot_digits_classification.py
@ -1,128 +0,0 @@
 """
 ================================
 Recognizing hand-written digits
 ================================
 This example shows how scikit-learn can be used to recognize images of
 hand-written digits, from 0-9.
 """
 # Authors: The scikit-learn developers
 # SPDX-License-Identifier: BSD-3-Clause
 # Standard scientific Python imports
 import matplotlib.pyplot as plt
 # Import datasets, classifiers and performance metrics
 from sklearn import datasets, metrics, svm
 from sklearn.model_selection import train_test_split
 ###############################################################################
 # Digits dataset
 # --------------
 #
 # The digits dataset consists of 8x8
 # pixel images of digits. The ``images`` attribute of the dataset stores
 # 8x8 arrays of grayscale values for each image. We will use these arrays to
 # visualize the first 4 images. The ``target`` attribute of the dataset stores
 # the digit each image represents and this is included in the title of the 4
 # plots below.
 #
 # Note: if we were working from image files (e.g., 'png' files), we would load
 # them using :func:`matplotlib.pyplot.imread`.
 digits = datasets.load_digits()
 _, axes = plt.subplots(nrows=1, ncols=4, figsize=(10, 3))
 for ax, image, label in zip(axes, digits.images, digits.target):
    ax.set_axis_off()
    ax.imshow(image, cmap=plt.cm.gray_r, interpolation="nearest")
    ax.set_title("Training: %i" % label)
 ###############################################################################
 # Classification
 # --------------
 #
 # To apply a classifier on this data, we need to flatten the images, turning
 # each 2-D array of grayscale values from shape ``(8, 8)`` into shape
 # ``(64,)``. Subsequently, the entire dataset will be of shape
 # ``(n_samples, n_features)``, where ``n_samples`` is the number of images and
 # ``n_features`` is the total number of pixels in each image.
 #
 # We can then split the data into train and test subsets and fit a support
 # vector classifier on the train samples. The fitted classifier can
 # subsequently be used to predict the value of the digit for the samples
 # in the test subset.
 # flatten the images
 n_samples = len(digits.images)
 data = digits.images.reshape((n_samples, -1))
 # Create a classifier: a support vector classifier
 clf = svm.SVC(gamma=0.001)
 # Split data into 50% train and 50% test subsets
 X_train, X_test, y_train, y_test = train_test_split(
    data, digits.target, test_size=0.5, shuffle=False
 )
 # Learn the digits on the train subset
 clf.fit(X_train, y_train)
 # Predict the value of the digit on the test subset
 predicted = clf.predict(X_test)
 ###############################################################################
 # Below we visualize the first 4 test samples and show their predicted
 # digit value in the title.
 _, axes = plt.subplots(nrows=1, ncols=4, figsize=(10, 3))
 for ax, image, prediction in zip(axes, X_test, predicted):
    ax.set_axis_off()
    image = image.reshape(8, 8)
    ax.imshow(image, cmap=plt.cm.gray_r, interpolation="nearest")
    ax.set_title(f"Prediction: {prediction}")
 ###############################################################################
 # :func:`~sklearn.metrics.classification_report` builds a text report showing
 # the main classification metrics.
 print(
    f"Classification report for classifier {clf}:\n"
    f"{metrics.classification_report(y_test, predicted)}\n"
 )
 ###############################################################################
 # We can also plot a :ref:`confusion matrix <confusion_matrix>` of the
 # true digit values and the predicted digit values.
 disp = metrics.ConfusionMatrixDisplay.from_predictions(y_test, predicted)
 disp.figure_.suptitle("Confusion Matrix")
 print(f"Confusion matrix:\n{disp.confusion_matrix}")
 plt.show()
 ###############################################################################
 # If the results from evaluating a classifier are stored in the form of a
 # :ref:`confusion matrix <confusion_matrix>` and not in terms of `y_true` and
 # `y_pred`, one can still build a :func:`~sklearn.metrics.classification_report`
 # as follows:
 # The ground truth and predicted lists
 y_true = []
 y_pred = []
 cm = disp.confusion_matrix
 # For each cell in the confusion matrix, add the corresponding ground truths
 # and predictions to the lists
 for gt in range(len(cm)):
    for pred in range(len(cm)):
        y_true += [gt] * cm[gt][pred]
        y_pred += [pred] * cm[gt][pred]
 print(
    "Classification report rebuilt from confusion matrix:\n"
    f"{metrics.classification_report(y_true, y_pred)}\n"
 )