Gene Mvan_2157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_2157 
Symbol 
ID4649134 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp2304046 
End bp2305242 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content70% 
IMG OID639805642 
ProductDyp-type peroxidase family protein 
Protein accessionYP_952978 
Protein GI120403149 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2837] Predicted iron-dependent peroxidase 
TIGRFAM ID[TIGR01412] Tat-translocated enzyme
[TIGR01413] Dyp-type peroxidase family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.923175 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGAGC AGCGCTGGAC CCTCAACCGG CGGCGCCTCC TCACCGGAGG CGCCGCCGTG 
GCGGCGGGCG CCGCACTGAC CCAGTGCGCG ACCGCCGGCT CCTCGACACC AACAGGTTTC
GGTTCCGCGA CGGAGCCGTT CCACGGCCGG CACCAGGCCG GGATCGCAAC GCCGCCGCAA
GCGCACGCCC TGTTCGTCGC GCTGGACATG GCGCCCAGCG CAGATCGCAG CCCGCGGGAC
ACCCTGATTG CGATGCTGCG GCTGTGGAGT TCCGACGCCG CGCGCCTCAC CGCGGGCCAG
CCCGCCCTGG CGGACACCGA ACCCGAACTT GCGCAACACC CTTCACGGCT CACTGTGACG
GTGGGCATCG GACCACACGT CTTCGACCGG ATCGGGCTGG CCCACCGTCG TCCGGATTCG
GTGTCCGAGT TGCCGGCGTT CTCCACCGAC CGACTCGATC GGCGCTGGTG CGGTGGTGAC
ATCCTGCTGC AGATCTGTGC CGATGACCGG GTCGCCGTCG CACACGCCGC GCGGGTCCTA
CTCAAGAACG TACGCACGCT GACCGTGCAG CGGTGGCGGC AGGACGGGTT CCGAACCGCG
CGCGGCGCGG ACAAGTCCGG TGCGACGATG CGCAACCTGA TGGGGCAGGT CGACGGCACC
GCGAACCCAC GCGAGGATGC CGAACTCGAT CGTTACGTCT GGGACGACGG TTCGCAGCAA
CCGTGGTTCG CCGGCGGGAC CGTGCTCGTG ATCCGCCGCA TCCGGTCCGA GCTGGACACC
TGGGACGAAC TGGACCGCAC CAGCAAGGAA TTGACGCTGG GCCGGCGACT GGACACCGGG
GCGCCACTGA CCGGCGAGGG CGAGTTCGAC GAGCCCGACC TCGCCGCCAC CGAGAACGGC
ATACCGGTCA TCCCGCCGAA TTCGCATGTG GCACTGGCTC GGCGGCAGTC GGACGATGAG
CGCTTCCTGC GGCGGGGGTA CAACTACGAC GACCCGCCGA CGGTGGGCAC CACGGACGCG
GGACTGATCT TCGCGGCGTA CCAGCGTGAC CCGGCGCGGC AGTTCGTTCC GGTACAGCGA
CGGCTGGCCG AGGCGGACGC GATGAACCCG TGGATCACGA CGATCGGCTC CGCGGTGTTC
GCGATGCTAC CCGGGGTGCC TGAGGGCGGT TATCTGGGGC AGAACCTGTT GGGGTGA
 
Protein sequence
MAEQRWTLNR RRLLTGGAAV AAGAALTQCA TAGSSTPTGF GSATEPFHGR HQAGIATPPQ 
AHALFVALDM APSADRSPRD TLIAMLRLWS SDAARLTAGQ PALADTEPEL AQHPSRLTVT
VGIGPHVFDR IGLAHRRPDS VSELPAFSTD RLDRRWCGGD ILLQICADDR VAVAHAARVL
LKNVRTLTVQ RWRQDGFRTA RGADKSGATM RNLMGQVDGT ANPREDAELD RYVWDDGSQQ
PWFAGGTVLV IRRIRSELDT WDELDRTSKE LTLGRRLDTG APLTGEGEFD EPDLAATENG
IPVIPPNSHV ALARRQSDDE RFLRRGYNYD DPPTVGTTDA GLIFAAYQRD PARQFVPVQR
RLAEADAMNP WITTIGSAVF AMLPGVPEGG YLGQNLLG