Introduction to Algorithms: Part 1 Fundamentals

阅读量：

作者：禅与计算机程序设计艺术

1.简介

：

Algorithms form the core building blocks of computer science and are crucial in addressing a wide range of challenges encountered across diverse disciplines such as computing mathematics biology economics and engineering. This series of articles will delve into the essential algorithms integral to computer science employing examples from sorting searching graph traversal and linear programming. Additionally we will explore common data structures including arrays stacks queues and hash tables which facilitate efficient data storage and management. The emphasis will be on algorithm design methodologies time complexity evaluation as well as their significant applications spanning database indexing search engine functionality image processing and machine learning.

这是本博客系列的第一部分《算法入门》。它介绍了理解后续文章所需的基捶概念和术语。具体来说,它涵盖了以下主题:

Algorithmic Paradigms: This chapter covers two primary algorithmic approaches employed in the creation of algorithms. Among these approaches are the Greedy Algorithm and the Divide-and-Conquer technique. We will delve into the fundamental concepts underlying each approach through illustrative examples.

Elementary Data Structures: This chapter covers an introduction to four elementary data structures that are frequently used in algorithms, namely, Array, Stack, Queue, and Hash Table. These data structures are essential for efficient storage and manipulation of data during algorithm execution.

本章探讨了与时间复杂度分析相关的几个关键概念包括但不限于Big O符号最坏情况与平均情况对比最佳情况输入规模以及增长速率我们将学习如何通过分析算法在给定输入下的最坏或平均情况行为来计算其运行时间

Upon completing this series of articles, you would gain a thorough grasp of the fundamental principles and essential techniques needed to address intricate computational challenges efficiently. In conclusion, feel free to reach out with any inquiries or share your thoughts.

2. 基本概念术语说明：

2.1 算法与算法的属性

In the field of computer science, an algorithm represents a clearly defined specification outlining the steps necessary to execute a computation. It constitutes a finite sequence of precisely described operations designed to address specific problem categories or perform computational tasks. The process involves detailed instructions that ensure accuracy and efficiency in achieving desired outcomes through systematic execution.

概述而言, 算法是指为解决特定问题而设计的一系列操作步骤, 它通过明确指出了解决问题所需的计算步骤, 并提供了处理这些数据所需的条件; 通过对这些数据执行运算获得最终结果, 这些计算的结果即为算法的输出.

算法具有以下五个属性：

Input: 描述了输入信息的属性及其数量特征。
Output: 算法输出结果涵盖了数据特征及其范围。
Finiteness: 描述了算法运行时的行为模式。
Definability: 考虑到了算法是否可计算性问题。
Effectiveness: 着重于算法的有效性和正确性。

2.2 数据结构及其操作

2.2.1 数组

被称为数组的数据结构,它由一系列同一类型的数据元素组成,这些元素被排列在连续内存位置中,并且能够通过索引能被单独引用.

数组是一种排列在连续的一段内存空间中的相同类型的元素的组别。每个元素都可以通过索引单独引用。

操作符

A[i]：确定数组A中第i位置上的元素数值。
A[i] = x：设定数组A在第i位的位置上赋值给变量x。
len(A): 计算出反映数组A所包含元素数量的数据指标。

2.2.2 栈

The stack can be considered an abstract data type, functioning as a collection of elements. It features two primary operations: push(), which adds new elements to its top, and pop(), which removes the most recently added element that has not yet been removed from storage.

一个抽象的数据类型称为堆栈（stack），它类似于一个盘子叠加的结构，在这种机制中先进入的数据元素总是最先被取出。

操作符

push(x): 将元素x推入最上面一层的位置。
- pop(): 从堆栈的顶端取出元素并返回之。
- isEmpty(): 检测当前堆栈是否为空的状态。

2.2.3 队列

A queueing system, commonly referred to as a FIFO (first-in-first-out) structure, is an abstract data structure that organizes elements sequentially. It provides two fundamental methods: enqueue(), which adds an item to the end, and dequeue(), which removes an item from the front. Additionally, it supports other operations such as peeking and determining if it's empty.

一种常用的数据结构称为先进先出策略（FIFO），其中queue维护一个有序排列的元素序列。该数据结构支持维护一个有序排列的元素序列，并提供两个核心操作：add函数用于将新项添加到指定位置（通常为末尾），remove函数用于移除并返回第一个项。

操作符

add(x): 将元素x加入到队列的后端。
remove(): 从队列前端移除并返回一个元素。
check(): 检查队列是否为空。

2.2.4 哈希表

A hash table, also known as a dictionary or map, is an abstract data structure that implements an associative array. It allows for efficient lookups of values based on their keys. Each value in the hash table is stored in a bucket associated with a unique key, ensuring that each lookup operation typically operates in constant time.
The hash table computes the hash value of each key to determine the position of its corresponding bucket. This process allows for fast data retrieval and manipulation. The hash table uses collision resolution techniques to ensure that no two entries end up in the same bucket.

哈希表也称作字典或映射，在计算机科学中用于实现抽象数据类型中的关联数组。它通过键快速定位对应的值，并且使得在查找特定值时平均只需一次计算。此外，在哈希表中所有数值被组织在一个容器中，并通过唯一的键与之对应以实现高效的检索功能。

操作符

insert key-value pairs into the hash table: 将键值对插入到哈希表中。
- retrieve the value associated with a given key: 返回与给定键相关的值。
- check if a particular key exists within the hash table: 检查哈希表中是否存在特定的键。

3. 核心算法原理与具体操作步骤：

3.1 排序算法——选择排序

选择排序（Selection sort），也被称为Sorting algorithm，在计算机科学领域是一种基础且常用的排列算法。它的基本工作流程是：依次从未被排列的部分中挑选出最小的一个元素，并将其放置到已经排列好的部分的第一个位置上；接着，在剩余部分中继续搜索下一个最小值，并将该值放置到已有序列之后的位置上；如此反复循环这个步骤直至所有元素都被正确排列完毕。

下图展示了选择排序的过程：

实现选择排序的代码如下：

复制代码

    def selectionSort(arr):
    n = len(arr)
    
    # One by one move boundary of unsorted subarray 
    for i in range(n): 
        # Find the minimum element in remaining unsorted array  
        min_idx = i
        for j in range(i+1, n): 
            if arr[j] < arr[min_idx]: 
                min_idx = j 
    
        # Swap the found minimum element with the first element         
        arr[i], arr[min_idx] = arr[min_idx], arr[i] 
    
    return arr
    
      
      
      
      
      
      
      
      
      
      
      
      
      
      
    
    代码解读

这个算法的时间复杂度是 $O(n^2)$ , 但即使在最坏情况下也需要进行 $n$ 次比较与交换, 因此效率相对较低。然而, 由于其直观易懂且易于理解的特点, 它仍然是一个非常流行的排序算法之一。

3.2 搜索算法——二分查找

二分查找（Binary Search）是一种能够在有序数组中高效定位特定元素的方法。该算法通过不断缩小搜索区间来定位目标元素的位置，并最终确定其存在与否。

下图展示了二分查找的过程：

实现二分查找的代码如下：

复制代码

    def binarySearch(arr, l, r, x):
    while l <= r:
    
        mid = (l + r) // 2
    
        # Check if x is present at mid 
        if arr[mid] == x:
            return mid 
    
        # If x greater, ignore left half 
        elif arr[mid] < x:
            l = mid + 1
    
        # If x is smaller, ignore right half 
        else:
            r = mid - 1
    
    # If we reach here, then the element was not present 
    return -1
    
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
    
    代码解读

这个算法的时间复杂度为O(log n)。

3.3 遍历算法——深度优先搜索

深一层优先搜索（DFS）是一种用于遍历树或图的数据结构的方法。它朝着节点间的关系顺着层次走下去，并深入地探索各个分支。当节点v的所有邻居已探索完毕时，则会回溯并找到离v最近的一条前k路径。常用于在大规模网络中快速定位关键连接线路，并找到从源点到目标点之间的最短路径。

下图展示了深度优先搜索的过程：

实现深度优先搜索的代码如下：

复制代码

    from collections import deque
    
    def dfs(graph, start, k=None):
    visited = set([start])
    path = []
    queue = deque([(start, [])])
    
    while queue:
        vertex, ppath = queue.popleft()
    
        if k is None or k >= len(ppath)+1:
    
            if vertex not in visited:
                visited.add(vertex)
    
                neighbors = [n for n in graph[vertex]]
    
                queue += [(n, ppath + [vertex]) for n in neighbors if n not in visited]
    
            if vertex == goal:
                print('Found path:','-> '.join([str(v) for v in reversed(ppath + [goal])]))
                break
    
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
    
    代码解读

该算法的计算复杂度为 $O(V+E)$ ，其中 $V$ 代表节点数量, $E$ 代表边的数量.受图规模影响而变化,并随规模扩大而增大.

3.4 分治算法——归并排序

归并排序（Merge Sort）是一种基于归并操作的高效排序算法，在计算机科学中具有重要地位。它通过分治法（Divide and Conquer）将一个无序序列分解为多个有序子序列，并通过逐步合并这些子序列来实现整个序列的有序排列。具体而言，在处理包含n个元素的原始序列时，首先将其按照中位位置划分为两个较小的子序列，并对每个子序列递归地执行同样的划分过程直至每个子序列仅包含单个元素为止。随后通过有序地合并这些单个元素的子序列来构建最终的大规模有序排列结果。

下图展示了归并排序的过程：

实现归并排序的代码如下：

复制代码

    def mergeSort(arr):
    if len(arr) > 1:
    
        # Finding the mid of the array 
        mid = len(arr)//2
    
        # Dividing the array elements 
        L = arr[:mid] 
    
        # into 2 halves 
        R = arr[mid:] 
    
        # Recursively calling the function 
        mergeSort(L) 
        mergeSort(R) 
    
        i = j = k = 0
    
        # Copy data to temp arrays L[] and R[] 
        while i < len(L) and j < len(R): 
            if L[i] < R[j]: 
                arr[k] = L[i] 
                i+=1
            else: 
                arr[k] = R[j] 
                j+=1
            k+=1
    
        # Checking if any element was left 
        while i < len(L): 
            arr[k] = L[i] 
            i+=1
            k+=1
    
        while j < len(R): 
            arr[k] = R[j] 
            j+=1
            k+=1
    return arr
    
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
    
    代码解读

该算法的时间复杂度被评估为O(n log n)，它既具备稳定性又采用了递归策略。尽管实现过程较为复杂，在采用递归策略所带来的优势下，代码实现变得更加直观且易于理解。

4. 具体代码实例和解释说明：

4.1 数组

复制代码

    # 初始化数组
    arr = [64, 25, 12, 22, 11]
    
    print("初始化数组:")
    for i in range(len(arr)):
    print ("% d" % arr[i]),
    
    # 修改元素值
    arr[2]= 15
    print("\n修改后的数组:")
    for i in range(len(arr)):
    print("% d" % arr[i]),
    
    # 数组的长度
    print('\n数组的长度:',len(arr))
    
    # 最大值与最小值
    print("\n最大值:", max(arr))
    print("最小值:", min(arr))
    
    # 求和与平均值
    sum = 0
    for i in range(len(arr)):
    sum += arr[i]
    
    average = float(sum)/len(arr)
    print("\n求和:", sum)
    print("平均值:", average)
    
    # 从数组中删除元素
    del arr[2]
    print("\n删除元素后的数组:")
    for i in range(len(arr)):
    print("% d" % arr[i]),
    
    # 清空数组
    arr.clear()
    print("\n清空后的数组:", arr)
    
    # 切片
    print("\n切片后的数组:", arr[0:2])
    
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
    
    代码解读

4.2 栈

复制代码

    stack=[]
    
    # 入栈
    stack.append(4)
    stack.append(7)
    stack.append(11)
    
    # 查看栈顶元素
    print("栈顶元素:", stack[-1])
    
    # 出栈
    print("出栈元素:", stack.pop())
    print("栈顶元素:", stack[-1])
    
    # 判空
    if not stack:
    print("栈为空")
    else:
    print("栈不为空")
    
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
    
    代码解读

4.3 队列

复制代码

    import queue
    
    q=queue.Queue()
    
    # 入队
    q.put(4)
    q.put(7)
    q.put(11)
    
    # 查看队首元素
    print("队首元素:", q.get())
    
    # 队尾入队
    q.put(8)
    
    # 判空
    if q.empty():
    print("队列为空")
    else:
    print("队列不为空")
    
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
    
    代码解读

4.4 哈希表

复制代码

    hashTable={}
    
    # 添加键值对
    hashTable["John"]="John Doe"
    hashTable["Alice"]="Alice Lee"
    hashTable["Bob"]="Bob Wang"
    
    # 根据键查询值
    print("John:",hashTable["John"])
    
    # 是否包含键
    if "John" in hashTable:
    print("哈希表包含键'John'")
    else:
    print("哈希表不包含键'John'")
    
    # 更新键值
    hashTable["John"]="John Wang"
    print("'John'的旧值:",hashTable["John"])
    
    # 删除键值
    del hashTable["Alice"]
    print("\n删除'Alice'后的哈希表:")
    for key in hashTable:
    print(key,"=",hashTable[key])
    
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
      
    
    代码解读

5. 未来发展趋势与挑战：

在当前阶段中, 算法领域呈现出蓬勃发展之势。相较于传统计算机科学分支, 算法研究如今更广泛地渗透到现代应用开发实践中。本系列文章着重介绍了一种基于数据驱动型算法, 该算法不仅专注于数据处理过程, 同时也强调了其在高性能计算方面的优势。鉴于算法体系本身的复杂程度较高, 使用单一化的理论框架对之进行统一解析存在一定困难; 因此这类文章往往难以提供全面系统的整体评析; 只能从局部角度贡献一些独到见解。然而在计算机科学领域仍有许多关键问题尚未得到彻底解决; 而且随着时代发展与技术进步, 算法领域也在不断探索与创新中前进。

下一步将继续撰写一系列涉及数据结构、图论、机器学习、分布式系统、数据库系统等领域的文章；这些主题均与算法设计紧密相关。然而由于不同算法之间通常存在显著的技术鸿沟因此无法直接套用；在设计这些算法时我们可以参考《算法导论》中的相关内容并结合实际项目经验进行优化和改进

全部评论 (0)

还没有任何评论哟~

Introduction to Algorithms: Part 1 Fundamentals

作者：禅与计算机程序设计艺术 1.简介： Algorithmsareafundamentalbuildingblockofcomputerscienceandplayanessentialrolei...

1.1 Introduction to Algorithms

此专栏为浙江大学国际校区生物信息学大三的课程，供期末复习所用，欢迎指正。 1\.算法总览 2\.需要的软件/environment/学习用书老师推荐：python3、VENV、PIP、Conda、V...

Introduction to The Design and Analysis of Algorithms (1)

最近两周看了这本书的第一章和第二章，摘抄书中部分句子做个小总结。 1\.如果一个排序算法保留了等值元素在输入中的相对顺序，就可以说它是稳定的（stable. 例习题1.31 算法comparisonC...

An Introduction to Genetic Algorithms

作者：禅与计算机程序设计艺术 1.简介什么是遗传算法（GeneticAlgorithms）？它可以被认为是一个优化算法，旨在通过模拟自然进化过程来解决优化问题。

Week 1 An Introduction to Machine Learning(Part A)

目录一、什么是机器学习二、如何学习形成一个模型三、如何选择一个好的模型四、KNearestNeighborsKNNclassifier（Tutorial内容）五、总结一、什么是机器学习 1...

Lesson Three of MIT《Introduction to Algorithms》

文章目录 1.x^n 2.mergesort 3.BinarySearch 4.FibonacciNumbers 5.MatrixMultiplicationStrassen 6.VLSI Divid...

An Introduction to Evolutionary Algorithms for Neural N

作者：禅与计算机程序设计艺术 1.简介近年来，深度学习技术在图像、语音、文本等领域已经取得了显著的成果。其主要解决的是复杂的特征提取问题，并通过神经网络模型的训练实现对数据的泛化能力。然而，如何找到...

Evolutionary Algorithms in Python: An Introduction to G

作者：禅与计算机程序设计艺术 1.简介本文将向读者展示基于遗传编程和进化策略的方法来解决复杂多变的优化问题。首先，我们会讨论什么是遗传编程和进化策略？其次，我们将介绍遗传编程的基本概念、算法流程和操...

AUTOSAR SecOC Introduction -- Part 1

Introduction AUTOSARSecureOnboardCommunicationSecOC作用是提供一种机制用于保证ECU之间通信过程中的“重要”数据的完整性和身份验证。

AUTOSAR ECUM Introduction -- Part 1

ECUMBasicintroduction EcuM负责管理整个ECU的STARTUP/SHUTDOWN/WAKEUP/SLEEP流程 VectorSolution ECUMSTATE ECUMSTA...

是否确定退出登录?

Introduction to Algorithms: Part 1 Fundamentals

1.简介

2. 基本概念术语说明：

2.1 算法与算法的属性

2.2 数据结构及其操作

2.2.1 数组

操作符

2.2.2 栈

操作符

2.2.3 队列

操作符

2.2.4 哈希表

操作符

3. 核心算法原理与具体操作步骤：

3.1 排序算法——选择排序

3.2 搜索算法——二分查找

3.3 遍历算法——深度优先搜索

3.4 分治算法——归并排序

4. 具体代码实例和解释说明：

4.1 数组

4.2 栈

4.3 队列

4.4 哈希表

5. 未来发展趋势与挑战：

全部评论 (0)

相关文章推荐

Introduction to Algorithms: Part 1 Fundamentals

1.1 Introduction to Algorithms

Introduction to The Design and Analysis of Algorithms (1)

An Introduction to Genetic Algorithms

Week 1 An Introduction to Machine Learning(Part A)

Lesson Three of MIT《Introduction to Algorithms》

An Introduction to Evolutionary Algorithms for Neural N

Evolutionary Algorithms in Python: An Introduction to G

AUTOSAR SecOC Introduction -- Part 1

AUTOSAR ECUM Introduction -- Part 1