Gene Tpau_3341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpau_3341 
Symbol 
ID9157515 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTsukamurella paurometabola DSM 20162 
KingdomBacteria 
Replicon accessionNC_014158 
Strand
Start bp3441246 
End bp3442472 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content69% 
IMG OID 
ProductHNH nuclease 
Protein accessionYP_003648264 
Protein GI296141021 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0650205 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGAGCATG GATGGGGAGT ACGAATCAGG TTCAGCTGCA ACAGGAGCGG CGATGCCATT 
GCCGGTTTGG GTGCGTTGGA GCGGGCTCGT GCGCGGGTGG TGTTCGATCA GTATCGGTTG
ATCGCCGAGT TGTTGCGAGT GCGGGTGTGT GAGCGGATCG CGGCCGGGGT GGCGCAGGAC
CGGTGGGAGG CGGGGGTGGC GGCGGAGGTC GCGTTGGCGT TGCGGGTGTC GCCGCATCGG
GCTGCGGGGA TGCTGTCGCG GGCTCGGACG CTCGTGAAGG ATCTGCCGGC GACGTTCGGG
CGGCTGCGCG ACGGTGATGT TTCGCCGGAG GCGGTGGAGG TGATCCTCGC TGGCCTCTCC
CATCTGGAGC CACGGCTGAA GTCCAAGGCC GATGCGGAGT TGTGCGGCGA ATCTTTCGCC
GCCGCCGGTT TGGGCGTGAA GCGGTTGCAG GATCAGGTCA AGCAGGTCGC GTACCGGCTC
GACGCCCAGG CCACCGTGGA TCGTGCGGCG CTGGCAGCGA AGGATCGTCG GGTGACGATC
CGGCCGGCGC CGGATTGCAT GGCGCGGGTA TCGATCCTGC TGCCGGTCGC CCAAGCGGTC
GGTGTGTACG CCGCCGTGAA GGCCGCCGCC GATGCTGCGG TCGGCACTCC CGGCGAACCA
CGCAGCCGAG CCCAGATCAT GGCCGATACC GCCTTCGCGC GGATCACCGG CCGCGAGGCG
GCAGAAGGGC AACCGGTGAC GGTGCACCTG ACCGTCCCTG CCTCTGTTCT GCTGGGCGAT
CAGCCTGGCA CCGCGCACCT CTCCGGCGGC GGCACGCTGC CCGCGGAGAT CGCGCGGCAT
CTGGTCGGGC GGGCGTCGGA GCACGCGGTC GCGTGGGTCA AACGGCTGTA TGTGCAACCG
GAGTCGGGTG CCGTCGTCGG GCTGGATTCC CGGTCACGAC TGTTCCCTTC CGGACTCGCC
GAGTTGATCG CGGCGCGGGA TCGGTACTGC CGGACCCCGT ACTGCGATGC ACCGATCGCG
CACACCGACC ACGTCACCGC GCACGCCCAC GGCGGCGCAA CCAGCCTGGA CAACGGGCAA
GGATTGTGCG CGGCCTGCAA CTACGCCAAA GAAGCAACAG GGTGGACCAG CCGCACCGTC
CACGACGACA GCGGACGGCA CACCGTCGAA ACCCGCACCC CGACAGGACA TCTCCACCGA
TCCACCGCAC CACCGCAGGC GGCGTGA
 
Protein sequence
MEHGWGVRIR FSCNRSGDAI AGLGALERAR ARVVFDQYRL IAELLRVRVC ERIAAGVAQD 
RWEAGVAAEV ALALRVSPHR AAGMLSRART LVKDLPATFG RLRDGDVSPE AVEVILAGLS
HLEPRLKSKA DAELCGESFA AAGLGVKRLQ DQVKQVAYRL DAQATVDRAA LAAKDRRVTI
RPAPDCMARV SILLPVAQAV GVYAAVKAAA DAAVGTPGEP RSRAQIMADT AFARITGREA
AEGQPVTVHL TVPASVLLGD QPGTAHLSGG GTLPAEIARH LVGRASEHAV AWVKRLYVQP
ESGAVVGLDS RSRLFPSGLA ELIAARDRYC RTPYCDAPIA HTDHVTAHAH GGATSLDNGQ
GLCAACNYAK EATGWTSRTV HDDSGRHTVE TRTPTGHLHR STAPPQAA