Gene Mvan_5122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5122 
Symbol 
ID4647559 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5485294 
End bp5486319 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content66% 
IMG OID639808596 
ProductDyp-type peroxidase family protein 
Protein accessionYP_955899 
Protein GI120406070 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2837] Predicted iron-dependent peroxidase 
TIGRFAM ID[TIGR01413] Dyp-type peroxidase family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.963646 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.469662 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCTGCTC CCCTGCCACA GCCGGTCCTG GCACCGCTGA CACCCGCAGC GATCTTCCTG 
GTGGCCACCA TCGACGAGGG TGGCGAGGAG GCTGTGCACG ACGCGCTCGG TGACATCGCC
GGTCTGGTGC GCGCCGTCGG TTTCCGGGAC CCGGCCAAGC GGCTCTCGGT GGTGACGTCG
ATCGGGTCGC GGGCCTGGGA CCGACTGTTC TCCGGGCCGC GCCCGGCAGA GCTACATCCG
TTCATCGAGC TCGACGGCGG CCGCCATCAC GCACCGTCGA CGCCGGGCGA TCTGCTGTTC
CACATCCGCG CCGAGACGAT GGACGTCTGC TTCGAACTGG CGACCAAGCT GGTCGCCGCG
ATGGGCGCAA TCACCGTCGT CGACGAGACC CACGGCTTCC GGTTCTTCGA CAACCGGGAC
CTGCTCGGAT TCGTCGACGG CACCGAGAAC CCCGACGGTC CGCTGGCCGA CAGCGCCACC
CAGATCGGGG ACGAAGACCC CGACTTCGCG GGCGGCTGCT ACGTGCACGT GCAGAAGTAT
CTGCACCTGA TGGACGCGTG GAACGCGCTG TCGACCGAGG AACAGGAGCG GGTGATCGGC
CGCACCAAAC TCGACGACAT CGAACTCGAC GACGCGGTCA AGCCGGCCAA CTCCCATGTG
GCGCTCAACG TCATCGAAGA CGAAGACGGC ACCGAGCTCA AGATCGTGCG GCACAACATG
CCGTTCGGTG AAATCGGGGC CGGCGAGTTC GGCACCTACT TCATCGGATA CTCGCGCACC
GCGGCGGTCA CCGAGCGGAT GCTGCGCAAC ATGTTCATCG GCGACCCTCC CGGCAACACC
GACCGCGTGC TGGACTTCTC GACAGCCGTC ACCGGGTCGA TGTTCTTCAC GCCGATCATG
GATTTCCTCA ACAACCCTCC GCCGCTTCCG AATTCGGCGT CGGACACCGT GGCGCCCACC
GATGTGACAC CGGTCGGCTA TGCGGGATCA CTGGCGATCG GCAGTCTGAA AGGACAACCG
CAATGA
 
Protein sequence
MPAPLPQPVL APLTPAAIFL VATIDEGGEE AVHDALGDIA GLVRAVGFRD PAKRLSVVTS 
IGSRAWDRLF SGPRPAELHP FIELDGGRHH APSTPGDLLF HIRAETMDVC FELATKLVAA
MGAITVVDET HGFRFFDNRD LLGFVDGTEN PDGPLADSAT QIGDEDPDFA GGCYVHVQKY
LHLMDAWNAL STEEQERVIG RTKLDDIELD DAVKPANSHV ALNVIEDEDG TELKIVRHNM
PFGEIGAGEF GTYFIGYSRT AAVTERMLRN MFIGDPPGNT DRVLDFSTAV TGSMFFTPIM
DFLNNPPPLP NSASDTVAPT DVTPVGYAGS LAIGSLKGQP Q