Watch the full video walkthrough here: Lecture 1 Part 1 Python Libraries and Modules
NumPy is a powerful Python library essential for data manipulation, scientific computing, and numerical analysis. This guide provides a step-by-step introduction to NumPy and explains the fundamentals you'll need to perform efficient computations on large datasets. Whether you're a beginner or refreshing your skills, this tutorial will serve as a valuable reference.
What is NumPy?
Note: "NumPy is one of the most popular Python libraries used for handling arrays and performing mathematical operations. In this video, we’ll explore how NumPy can make data processing faster and more efficient. Let's dive in!"
The Power of Vectorization
Example Code:
# Using lists (non-vectorized)
A = [1, 2, 3]
B = [4, 5, 6]
C = [A[i] * B[i] for i in range(len(A))]
# Using NumPy (vectorized)
import numpy as np
A = np.array([1, 2, 3])
B = np.array([4, 5, 6])
C = A * B
Note: "With vectorization, NumPy enables faster computation by operating on multiple elements at once, which is particularly valuable with large datasets."
Importing and Aliasing
np
.Example Code:
import numpy as np
Note: "Using the alias np
is a standard convention in the Python community, making our code shorter and more readable."
Creating NumPy Arrays
Examples:
# 1D Array
array_1d = np.array([1, 2, 3])
# 2D Array
array_2d = np.array([[1, 2, 3], [4, 5, 6]])
# 3D Array
array_3d = np.array([[[1, 2], [3, 4]], [[5, 6], [7, 8]]])
Note: "NumPy arrays, or ndarrays, can hold data in different dimensions. They’re similar to lists, but they allow more efficient manipulation."
Array Attributes
shape
: Dimensions of the array.ndim
: Number of dimensions.dtype
: Data type of the elements.Example Code:
print(array_2d.shape) # Output: (2, 3)
print(array_2d.ndim) # Output: 2
print(array_2d.dtype) # Output: int64
Note: "Attributes like shape
, ndim
, and dtype
help us understand an array’s structure and contents."
Reshaping Arrays
reshape()
to change array dimensions.Example Code:
# Reshape 2D array to 4 rows and 2 columns
array_reshaped = array_2d.reshape(4, 2)
print(array_reshaped)
Note: "Reshaping arrays is common in data analysis to adjust data layout without changing content."
Useful NumPy Functions
np.mean()
: Calculates the mean.np.log()
: Computes the natural logarithm.np.floor()
and np.ceil()
: Rounds down and up respectively.Example Code:
print(np.mean(array_1d)) # Output: Mean of array
print(np.log(array_1d)) # Natural log of elements
print(np.floor(array_1d)) # Floor values
print(np.ceil(array_1d)) # Ceil values
Note: "NumPy provides many built-in functions to perform mathematical operations efficiently."
Conclusion and Next Steps
Note: "Understanding NumPy sets a strong foundation for data analysis. Next, we’ll see how libraries like Pandas depend on it."