Annotated sequence data
API¶
Readers¶
These classes are designed to read data from a variety of file formats into a SeqData object.
|
|
|
|
|
|
|
|
|
|
|
Composing readers¶
These functions are designed to be used in a composable way to read data from a variety of file formats into a single SeqData object.
|
Save a SeqData to disk and open it (without loading it into memory). |
|
Save a SeqData to disk and open it (without loading it into memory). |
Default readers¶
These functions are special cases of the composable readers that are designed to be used for common use cases
|
|
|
|
|
|
|
|
|
|
|
|
|
Reads a bed-like (BED3+) file as a pandas DataFrame. |
Writing¶
SeqData writes all datasets to disk as Zarr stores using the following functions
|
PyTorch dataloading¶
SeqData provides a unified interface for converting SeqData objects into PyTorch dataloaders
|
Get a PyTorch DataLoader for this SeqData. |
Utilities¶
Some utility functions that are useful for working with SeqData objects
|
|
|
Label regions for binary or multitask classification based on whether they overlap with another set of regions. |
|