Kmer Class
Kmer.js is a module for calculating kmer frequencies and sequence probabilities
Constructor
Kmer
()
Item Index
Methods
Methods
binaryToSequence
-
x
Returns a biological sequence from a binary encoding
Parameters:
-
x
NumberAn integer encoding of a biological sequence
Returns:
A biological sequence
Throws:
If x is a float
Example:
>var testKmerIndex = 0; // Analogous to sequenceToBinary() example
>kmer.binaryToSequence(testKmerIndex);
'AAAAAAAAA'
constructor
-
k
-
letters
The constructor method for instantiation
Parameters:
Throws:
If the number of letters is not > 1
Example:
Kmer = require('kmer.js')
sampleA = new Kmer(9, "ACGT");
frequency
-
seq
Calculates a frequency of a sequence in the profile
Parameters:
-
seq
StringA sequence to retrieve the relative count/frequency
Returns:
The frequency (count/totalCounts) of the sequence from the profile
Throws:
If seq is not a kmer (has a length of k)
Example:
>var testKmer = "AAAAAAAAA";
>var testKmerFrequency = kmer.frequency(testKmer); // Returns a BigNumber.js
>console.log( testKmerFrequency.toNumber() );
0.123457
kmerArray
-
s
-
k
Returns an array of all k-length substrings. Takes a string and a k in that order.
Parameters:
Returns:
Returns an array of Strings, all of length k, and all substrings of s.
Throws:
If the length of s < k
Example:
var k = 7 var testString = "AAACCCCCGCACCCGCGGGGGTTTCAGCGTGTCG" var allKmersFromTestString = kmer.kmerArray(testString, 9)
probabilityOfSequence
-
seq
Calculates the Markov chain probability of a sequence from its transition probabilities
Parameters:
-
seq
StringA biological sequence
Returns:
Returns the Markov-chain probability of the input sequence
Throws:
If there are letters in seq that aren't in the sequence alphabet
Example:
>var testKmer = "AAACCCCCGCACCCGCGGGGGTTTCAGCGTGTCG";
>var testKmerProb = kmer.probabilityOfSequence(testKmer);
>console.log( testKmerProb.toNumber() );
0.000033333888887777710
profileAsArray
-
k
-
letters
Returns a 32-bit int array of zeroes, given an alphabet and choice of k
Parameters:
Returns:
Returns a typed array of length : letters.length ^ k
Throws:
If the number of letters is not > 1
sequenceToBinary
-
s
Returns a binary representation/encoding of a biological sequence
Parameters:
-
s
StringA biological sequence to convert into a binary integer
Returns:
Returns an integer encoding of a k-mer
Throws:
If there are letters in seq that aren't in the sequence alphabet
Example:
>var testKmer = "AAAAAAAAA" // Length of testKmer matches our initial value of k, 9
>var testKmerIndex = kmer.sequenceToBinary(testKmer);
>console.log( kmer.profile[testIndex] );
streamingUpdate
()
Through2
This method streams sequence data to update the kmer profile by side-effect
Returns:
A through2 stream wrapper
Throws:
If the input stream's objects don't have a .seq attribute
Example:
>var fasta = require('bionode-fasta'); // Bionode-fastq also
>var fs = require('fs');
>fs.createReadStream("/path/to/example.fasta", {encoding: "UTF-8"})
.pipe(fasta.obj())
.pipe(kmer.streamingUpdate())
.on('finish', function(){
console.log("Done!");
});
>console.log(kmer.profile)
>var AWS = require('aws-sdk');
>var s3 = new AWS.S3({apiVersion: '2006-03-01'});
>var fasta = require('bionode-fasta');
>var params = {Bucket: 'bucketname', Key: 'path/to/example.fasta'}
>s3.getObject(params).createReadStream()
.pipe(fasta.obj())
.pipe(kmer.streamingUpdate())
.on('finish', function(){
console.log("Done!");
});
>console.log(kmer.profile);
TotalProfileCounts
()
BigNumber
Sums the counts for the whole profile. It also updates the associated property .totalProfileCounts as a side-effect
Returns:
Returns a BigNumber.js sum of all counts from the profile array
Example:
>// After a streaming update, the attribute .totalProfileCounts isn't always updated
>console.log( kmer.totalProfileCounts );
0
>kmer.TotalProfileCounts();
10300
>console.log( kmer.totalProfileCounts );
10300
transitionProbability
-
seq1
-
seq2
Calculates the transition probability of one sequence to the next in a Markov chain
Returns:
The transition probability of seq1 to seq2
Throws:
If seq2 is not a kmer (has a length of k)
Example:
>var testKmer1 = "AAAAAAAAA";
>var testKmer2 = "AAAAAAAAT";
>var transProbTK1toTK2 = kmer.transitionProbability(testKmer1, testKmer2);
>console.log( transProbTK1toTK2.toNumber() );
0.111048
Properties
totalProfileCounts
BigNumber
A total of counts from an instantiated profile Relies on data being loaded through the streamingUpdate() method Recalculate after updating by running the TotalProfileCounts() method