Implemented Suffix Tree Data Structure (#11554)
* Implemented KD-Tree Data Structure
* Implemented KD-Tree Data Structure. updated DIRECTORY.md.
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Create __init__.py
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Replaced legacy `np.random.rand` call with `np.random.Generator` in kd_tree/example_usage.py
* Replaced legacy `np.random.rand` call with `np.random.Generator` in kd_tree/hypercube_points.py
* added typehints and docstrings
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* docstring for search()
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Added tests. Updated docstrings/typehints
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* updated tests and used | for type annotations
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* E501 for build_kdtree.py, hypercube_points.py, nearest_neighbour_search.py
* I001 for example_usage.py and test_kdtree.py
* I001 for example_usage.py and test_kdtree.py
* Update data_structures/kd_tree/build_kdtree.py
Co-authored-by: Christian Clauss <cclauss@me.com>
* Update data_structures/kd_tree/example/hypercube_points.py
Co-authored-by: Christian Clauss <cclauss@me.com>
* Update data_structures/kd_tree/example/hypercube_points.py
Co-authored-by: Christian Clauss <cclauss@me.com>
* Added new test cases requested in Review. Refactored the test_build_kdtree() to include various checks.
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Considered ruff errors
* Considered ruff errors
* Apply suggestions from code review
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Update kd_node.py
* imported annotations from __future__
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Implementation of the suffix tree data structure
* Adding data to DIRECTORY.md
* Minor file renaming
* minor correction
* renaming in DIRECTORY.md
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Considering ruff part-1
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Considering ruff part-2
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Considering ruff part-3
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Considering ruff part-4
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Considering ruff part-5
* Implemented Suffix Tree Data Structure.
Added some comments to my files in #11532, #11554.
* updating DIRECTORY.md
* Implemented Suffix Tree Data Structure.
Added some comments to my files in #11532, #11554.
---------
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Christian Clauss <cclauss@me.com>
Co-authored-by: Ramy-Badr-Ahmed <Ramy-Badr-Ahmed@users.noreply.github.com>
2024-09-28 13:37:00 +00:00
|
|
|
# Created by: Ramy-Badr-Ahmed (https://github.com/Ramy-Badr-Ahmed)
|
|
|
|
# in Pull Request: #11532
|
|
|
|
# https://github.com/TheAlgorithms/Python/pull/11532
|
|
|
|
#
|
|
|
|
# Please mention me (@Ramy-Badr-Ahmed) in any issue or pull request
|
|
|
|
# addressing bugs/corrections to this file.
|
|
|
|
# Thank you!
|
|
|
|
|
kd tree data structure implementation (#11532)
* Implemented KD-Tree Data Structure
* Implemented KD-Tree Data Structure. updated DIRECTORY.md.
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Create __init__.py
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Replaced legacy `np.random.rand` call with `np.random.Generator` in kd_tree/example_usage.py
* Replaced legacy `np.random.rand` call with `np.random.Generator` in kd_tree/hypercube_points.py
* added typehints and docstrings
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* docstring for search()
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Added tests. Updated docstrings/typehints
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* updated tests and used | for type annotations
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* E501 for build_kdtree.py, hypercube_points.py, nearest_neighbour_search.py
* I001 for example_usage.py and test_kdtree.py
* I001 for example_usage.py and test_kdtree.py
* Update data_structures/kd_tree/build_kdtree.py
Co-authored-by: Christian Clauss <cclauss@me.com>
* Update data_structures/kd_tree/example/hypercube_points.py
Co-authored-by: Christian Clauss <cclauss@me.com>
* Update data_structures/kd_tree/example/hypercube_points.py
Co-authored-by: Christian Clauss <cclauss@me.com>
* Added new test cases requested in Review. Refactored the test_build_kdtree() to include various checks.
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Considered ruff errors
* Considered ruff errors
* Apply suggestions from code review
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Update kd_node.py
* imported annotations from __future__
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
---------
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Christian Clauss <cclauss@me.com>
2024-09-03 12:39:09 +00:00
|
|
|
import numpy as np
|
|
|
|
import pytest
|
|
|
|
|
|
|
|
from data_structures.kd_tree.build_kdtree import build_kdtree
|
|
|
|
from data_structures.kd_tree.example.hypercube_points import hypercube_points
|
|
|
|
from data_structures.kd_tree.kd_node import KDNode
|
|
|
|
from data_structures.kd_tree.nearest_neighbour_search import nearest_neighbour_search
|
|
|
|
|
|
|
|
|
|
|
|
@pytest.mark.parametrize(
|
|
|
|
("num_points", "cube_size", "num_dimensions", "depth", "expected_result"),
|
|
|
|
[
|
|
|
|
(0, 10.0, 2, 0, None), # Empty points list
|
|
|
|
(10, 10.0, 2, 2, KDNode), # Depth = 2, 2D points
|
|
|
|
(10, 10.0, 3, -2, KDNode), # Depth = -2, 3D points
|
|
|
|
],
|
|
|
|
)
|
|
|
|
def test_build_kdtree(num_points, cube_size, num_dimensions, depth, expected_result):
|
|
|
|
"""
|
|
|
|
Test that KD-Tree is built correctly.
|
|
|
|
|
|
|
|
Cases:
|
|
|
|
- Empty points list.
|
|
|
|
- Positive depth value.
|
|
|
|
- Negative depth value.
|
|
|
|
"""
|
|
|
|
points = (
|
|
|
|
hypercube_points(num_points, cube_size, num_dimensions).tolist()
|
|
|
|
if num_points > 0
|
|
|
|
else []
|
|
|
|
)
|
|
|
|
|
|
|
|
kdtree = build_kdtree(points, depth=depth)
|
|
|
|
|
|
|
|
if expected_result is None:
|
|
|
|
# Empty points list case
|
|
|
|
assert kdtree is None, f"Expected None for empty points list, got {kdtree}"
|
|
|
|
else:
|
|
|
|
# Check if root node is not None
|
|
|
|
assert kdtree is not None, "Expected a KDNode, got None"
|
|
|
|
|
|
|
|
# Check if root has correct dimensions
|
|
|
|
assert (
|
|
|
|
len(kdtree.point) == num_dimensions
|
|
|
|
), f"Expected point dimension {num_dimensions}, got {len(kdtree.point)}"
|
|
|
|
|
|
|
|
# Check that the tree is balanced to some extent (simplistic check)
|
|
|
|
assert isinstance(
|
|
|
|
kdtree, KDNode
|
|
|
|
), f"Expected KDNode instance, got {type(kdtree)}"
|
|
|
|
|
|
|
|
|
|
|
|
def test_nearest_neighbour_search():
|
|
|
|
"""
|
|
|
|
Test the nearest neighbor search function.
|
|
|
|
"""
|
|
|
|
num_points = 10
|
|
|
|
cube_size = 10.0
|
|
|
|
num_dimensions = 2
|
|
|
|
points = hypercube_points(num_points, cube_size, num_dimensions)
|
|
|
|
kdtree = build_kdtree(points.tolist())
|
|
|
|
|
|
|
|
rng = np.random.default_rng()
|
|
|
|
query_point = rng.random(num_dimensions).tolist()
|
|
|
|
|
|
|
|
nearest_point, nearest_dist, nodes_visited = nearest_neighbour_search(
|
|
|
|
kdtree, query_point
|
|
|
|
)
|
|
|
|
|
|
|
|
# Check that nearest point is not None
|
|
|
|
assert nearest_point is not None
|
|
|
|
|
|
|
|
# Check that distance is a non-negative number
|
|
|
|
assert nearest_dist >= 0
|
|
|
|
|
|
|
|
# Check that nodes visited is a non-negative integer
|
|
|
|
assert nodes_visited >= 0
|
|
|
|
|
|
|
|
|
|
|
|
def test_edge_cases():
|
|
|
|
"""
|
|
|
|
Test edge cases such as an empty KD-Tree.
|
|
|
|
"""
|
|
|
|
empty_kdtree = build_kdtree([])
|
|
|
|
query_point = [0.0] * 2 # Using a default 2D query point
|
|
|
|
|
|
|
|
nearest_point, nearest_dist, nodes_visited = nearest_neighbour_search(
|
|
|
|
empty_kdtree, query_point
|
|
|
|
)
|
|
|
|
|
|
|
|
# With an empty KD-Tree, nearest_point should be None
|
|
|
|
assert nearest_point is None
|
|
|
|
assert nearest_dist == float("inf")
|
|
|
|
assert nodes_visited == 0
|
|
|
|
|
|
|
|
|
|
|
|
if __name__ == "__main__":
|
|
|
|
import pytest
|
|
|
|
|
|
|
|
pytest.main()
|