机器学习之------监督、非监督、半监督
无监督学习、监督学习和半监督学习;这些方法分别通过不同的数据标注方式实现对数据的学习与分析
Within the realm of machine learning, could you clarify the distinctions among unsupervised, supervised, and semi-supervised paradigms?
And what are some of the main algorithmic approaches to look at?
机器学习的问题通常可被视为函数近似的不同变体,在分类、预测或建模方面都有体现。
In supervised learning, one is provided with input variables x_1, x_2, and so on, along with corresponding output variables y_1, y_2, etc., and is aimed at identifying a function that models this relationship effectively in an easily generalizable manner. The outputs can represent class labels in classification tasks or continuous values in regression scenarios—this is where the term supervision comes into play. When dealing with such problems, your training dataset consists solely of labeled examples. The central task here is typically to predict missing labels for new instances without any prior labeling.]
Within the framework of unsupervised learning algorithms, at their starting point, an agent accepts a series of inputs—x₁,x₂,…—but lacks both target outputs and rewards from its environment. The approach depends on two factors: the problem type—whether it's classification or prediction—and your understanding of the data space. Unsupervised learning involves utilizing various techniques such as density estimation—for estimating an underlying probability distribution function for making predictions—or k-means clustering—for classifying unlabeled real-valued data. Additionally,k-modes clustering is employed for categorizing unlabeled categorical data. Unsupervised learning occurs when a system has no labeled training data available. Clustering methods are commonly used in such scenarios.
Semi-supervised learning relies upon function estimation techniques for both types of datasets: labeled and unlabeled. This method originates from the observation that generating labeled datasets can be quite expensive, while unlabeled datasets are relatively easy to obtain. The primary difficulty lies in developing effective strategies to integrate and analyze these mixed datasets. For a comprehensive understanding of semi-supervised learning methods, refer to the literature survey titled "Semi-Supervised Learning Literature Survey."
Specifically, both labeled and unlabeled datasets are utilized. This serves as an instance within deep belief networks, where certain layers engage in unsupervised learning to uncover underlying patterns, while another layer is trained using supervised learning techniques to achieve classification tasks.
Besides these types of learning approaches, there are other learning methods. Such as reinforcement learning techniques where the interaction between the learner and its environment leads to the generation of actions a1, a2,... which result in rewards or punishments r1, r2,...
