3.4.8.9. Compare classifiers on the digits data¶

Compare the performance of a variety of classifiers on a test set for the digits data.

LinearSVC: 0.9344942114287969
GaussianNB: 0.8332741681010102
KNeighborsClassifier: 0.9804562804949924
------------------
/opt/hostedtoolcache/Python/3.12.11/x64/lib/python3.12/site-packages/sklearn/svm/_base.py:1249: ConvergenceWarning: Liblinear failed to converge, increase the number of iterations.
  warnings.warn(
LinearSVC(loss='hinge'): 0.9294570108037394
LinearSVC(loss='squared_hinge'): 0.9344942114287969
-------------------
KNeighbors(n_neighbors=1): 0.9913675218842191
KNeighbors(n_neighbors=2): 0.9848442068835102
KNeighbors(n_neighbors=3): 0.9867753449543099
KNeighbors(n_neighbors=4): 0.9803719053818863
KNeighbors(n_neighbors=5): 0.9804562804949924
KNeighbors(n_neighbors=6): 0.9757924194139573
KNeighbors(n_neighbors=7): 0.9780645792142071
KNeighbors(n_neighbors=8): 0.9780645792142071
KNeighbors(n_neighbors=9): 0.9780645792142071
KNeighbors(n_neighbors=10): 0.9755550897728812

from sklearn importmodel_selection,datasets,metrics
from sklearn.svm importLinearSVC
from sklearn.naive_bayes importGaussianNB
from sklearn.neighbors importKNeighborsClassifier
digits=datasets.load_digits()
X=digits.data
y=digits.target
X_train,X_test,y_train,y_test=model_selection.train_test_split(
X,y,test_size=0.25,random_state=0
)
forModelin[LinearSVC,GaussianNB,KNeighborsClassifier]:
clf=Model().fit(X_train,y_train)
y_pred=clf.predict(X_test)
print(f"{Model.__name__}: {metrics.f1_score(y_test, y_pred, average='macro')}")
print("------------------")
# test SVC loss
forlossin["hinge","squared_hinge"]:
clf=LinearSVC(loss=loss).fit(X_train,y_train)
y_pred=clf.predict(X_test)
print(
f"LinearSVC(loss='{loss}'): {metrics.f1_score(y_test, y_pred, average='macro')}"
)
print("-------------------")
# test the number of neighbors
forn_neighborsinrange(1,11):
clf=KNeighborsClassifier(n_neighbors=n_neighbors).fit(X_train,y_train)
y_pred=clf.predict(X_test)
print(
f"KNeighbors(n_neighbors={n_neighbors}): {metrics.f1_score(y_test, y_pred, average='macro')}"
)

Total running time of the script: (0 minutes 0.267 seconds)

Gallery generated by Sphinx-Gallery

3.4.8.9. Compare classifiers on the digits data¶

Previous topic

Next topic

This Page