why smote raise "Found input variables with inconsistent numbers of samples"?
I try to classify emotion from tweet with dataset of 4401 tweet, when i use smaller sample of data (around 15 tweet) everything just work fine, but when i use the full dataset it raise the error of Found input variables with inconsistent numbers of samples: [7, 3520] the error happen when i try to oversampling the data using smote after transforming the data using countvectorizer. This is the code where the error raise # N-gram Feature and Term Frequency vectorizer = CountVectorizer(ngram_range=(1,3)) x_train_tf = vectorizer.fit_transform(str(x_train).split('\n')).toarray() x_test_tf = vectorizer.transform(str(x_test).split('\n')).toarray() df_output = pd.DataFrame(data =x_train_tf, columns = vectorizer.get_feature_names_out()) display(df_output) # the print shape is (7 rows × 250 columns) smote = SMOTE(random_state=42, k_neighbors=5) x_smote, y_smote = smote.fit_resample(x_train_tf, y_train) print("Total Train Data SMOTE : ",x_smote.shape), print("T...