Skip to content

Latest commit

 

History

History
191 lines (146 loc) · 8.25 KB

README.rst

File metadata and controls

191 lines (146 loc) · 8.25 KB

Background

This module introduces a natural interface for signal processing, and it makes signal processing code simpler and shorter. It also introduces various signal processing functions. It's based on numpy, scipy, and uses pint for units support.

Motivation

Scipy has great mathematical tools, but it lacks a * natural interface * for signal processing.

  • A signal as a continuous data (the interface), not vector of samples (internal implementation).

  • A signal should support physical unit. it's a measurement!

  • there are many common operations like:

    • plotting
    • fft
    • checking whether 2 signals has the same sample points.

    that are done much shortly and easily with a signal object.

Installation

  • it's preferred to install in in virtualenv
  • the installation takes a few minutes because it installs also numpy and scipy
  • Note: matplotlib requires a few libraries (like truetype) to be installed on your machine.

with pip:

pip install -e git+https://github.com/noamg/signal_processing.git#egg=signal_processing-master

Tutorial

should create a tutorial, in the form of ipython notebooks

Contact

You are welcome to put issues on https://github.com/noamg/signal_processing/issues and contact me [email protected], in anything related to this package:

  • help
  • unclear documentation
  • general ideas or suggestions to improve the design
  • problems

documentation

is compiled using sphinx, and numpydoc extension (to remove) the old documentation: https://github.com/noamg/signal_processing/doc/built/html/index.html the new documentation will be in readthedocs.

Data types - the interface for signal processing

the package introduces a few data types to represent different datas easily and efficiently:

1) Object that represent continuous data such as voice recording (a signal). other examples:

  • continuous spatial measurement. Such as mountain's profile.
  • continuous kinematic measurement: position of something as a function of time.
  • a distribution (continuous histogram)
  • a spectrum of another signal (FFT).
  1. Object that represent segments of interest within a signal, such as:

    • times and amplitudes of pulses of energy in an electromagnetic signal.
    • locations of seismic bursts in a continuous seismic measurement.
    • cluster: ranges that contain most of the value in a continuous distribution.

The use of these objects eliminates irrelevant information and thus simplifies the code.

Processes

various processes to manipulate ContinuousData and Segments:

  • frequency filtering, modulations, demodulatioins
  • Auto noise level detection, threshold operations, algorithms to cope with noise.

How to proceed

  1. look at the code examples.
  2. read the documentation of ContinuousData, ContinuousDataEven and Segments. it explains the they way this package percieves signals.

naming conventions

  1. As a rule of thumb, scipy and numpy naming conventions are preferable.
  2. XXX is a marker of highly problematic process.

2. is_each_..... a name for a boolian array which specifies whether each element in another array fullfils some condition. it's a mask.

Main issues before first release

  1. refactoring issues:

    1. create ContinuousData.new_values method, and then remove all the definitions of __add__ etc from ContinuousDataEven to ContinuousData. thus ease the reading
    2. shorten the name of some functions in the sub-modules (adjoin, filter. threshold etc.) since they are in a module, they can have a less indicative name
    3. change cotinuous/plots.py to continuous/visualisation.py and fix imports
    4. in tests and examples, use ContinuousDataEven.generate instead if creating a signal manualy
    5. use verbs instead of nouns as names of modules. generate, modulate, demodulate
    • use a refactoring tool. vim rope?
    • python-mode !
  2. testing:

    1. so far using nose. maybe should use pytest?
    2. use test classes.
    3. add tests
    4. grep all the "not tested" signs (there are warnings that some functions are not tested
      start with removing the user warings of "not tested" and put TODO instead
    5. add test to the examples
  3. choose a way to manage the issues and TODO's. grepable text that indicates issue in the code:

    • TODO
    • todo
    • XXX
    • design issus
    • not tested
    • rename
    • refactor
    • encapsulate
    • old old old
    • link to
    • deprecated
    • NotImpmentedError
  4. documentation:

    • make the documentation available at readthedocs
    • make some documentation that is more that API
    • scan all the documentation, and make sure it's written in numpy conventions.
    • replace all the TODO with .. todo directive.
  5. make sure to remove the use of old interface (like module generators)

  6. arrange the imports according to a certain order even within the package, for example: import warnings import numpy as np from signal_processing import uerg from signal_processing import extension from signal_processing import Segment from signal_processing import continuous

  7. find how to let some functions share documentation of parameters with same behaviour. very important when I wrap functions. maybe links?

  8. improve use of exceptions. design the package exceptions, and use pint exceptions for units errors

  9. use sphinx to compile documentation.

  10. tag a commit as release 0.1

  11. Tutorial!

  12. connect to readthedocs

  13. git management: two files from the past, were split to several other files, and I didn't delete them. the reason is I don't want to loose the history (log) of commits of their content, up to the split. I need to understand how to keep the log, and delete the files. these files are: signal_processing/continuous_data.py signal_processing/continuous/continuous_data.py signal_processing/segments.py

Design principles

  1. The API of the ContinuousData object have several distinguished layers and they have to be repected.
    1. the layer that accesses the internals. and returns the values, and sample times of the signal.
    2. mathematical operations, or mildly complex operations such as addition, absolute value etc. this methods to no use the internals, but instead use the first layer.
    3. there are some operations that are very common such doing fft to your signal, or plotting it to gain some intuition. They are methods, instead of external functions, because they are used all the time. However, they are percieved as a this connection to what actually does the logic (numpy.fft, or plt.plot). They must not contain any logic of there own. If they need any logic, it should be implemented as a second layer method.
  2. This package handles only signal_processing.n in some cases it needs a service that logically lies in the responsability of some other package (numpy, scipy, pint, matplotlib).n in this cases the service (function in most cases) is put on the coresponding file in the extensions/ sub-package. n the core sub-package: signal_processing, uses this services, but should never implement them on it's own, as it's not it's reponsability.n Ideally, in the future all this extension files would be incorporated to the cpresponding modules.

Design Issues

  1. some methods and functions in this package has several similar modes of action. usually, this mode chooses a different manipulation or function to apply on the input. such as: mean/min/max, sine/square. examples for such functions: ContinuousDataEven.modulate, ContinuousDataEven.generate, ContinuousDataEven.demodulate, threshold.estimate_noise_level

    current interface: they accept string as a parameter. (such as 'mean') possible other interface: maybe should accept function as input (such as np.sin, np.median) possible other interface: maybe they should accept enum to avoid typos (values errors), for modes like 'accurate' / 'fast'... of course, that when the function is a mere wrap around a numpy / scipy package, the interface should be as similar as possible