Gene Mvan_3208 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_3208 
Symbol 
ID4648486 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp3415427 
End bp3417670 
Gene Length2244 bp 
Protein Length747 aa 
Translation table11 
GC content67% 
IMG OID639806684 
Productcatalase/peroxidase HPI 
Protein accessionYP_954015 
Protein GI120404186 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0376] Catalase (peroxidase I) 
TIGRFAM ID[TIGR00198] catalase/peroxidase HPI 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.996485 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGATA CCTCCGACGC CCGCCCACCG CACTCCGACG ACAAGACCCG CAGCCACAGC 
GAATCCGAGA ACCCGGCGAT CGACTCGCCG GAGCCCAAAG TGCATGCGCC GCTGACGAAC
AAGGACTGGT GGCCCGAGCA GGTCGACGTC TCGGTGCTGC ACAAGCAGAA CGAGAAGGGC
AACCCCCTCG GCGAGGACTT CGACTATGCG ACCGAGTTCG CCAAGCTCGA CGTCGAGGCG
TTCAAGCGTG ACGTCATCGA CCTGATCAAC ACCTCCCAGG ACTGGTGGCC CGCCGACTAC
GGCAGCTACG CCGGCCTGTT CATCCGCATG AGCTGGCATG CCGCAGGCAC CTACCGCATC
TTCGACGGCC GCGGCGGTGC CGGTCAGGGT TCCCAGCGGT TTGCGCCGCT CAACAGCTGG
CCGGACAACG CCAATCTGGA CAAGGCGCGC CGGCTGCTGT GGCCGATCAA GCGCAAGTAC
GGCAACAAGA TTTCGTGGGC CGACCTGATC GCGTACGCCG GCAACGCCGC GCTGGAGTCG
GCCGGCTTCC AGACCTTCGG CTTCGCGTTC GGCCGTGAGG ACATCTGGGA GCCCGAGGAG
ATGCTGTGGG GCCAGGAGGA CACCTGGCTG GGCACCGACA AGCGCTACGG CGGCACCAAC
GACAGCGACA CGCGGGAGCT GGCAGAACCC TTCGGCGCCA CCACGATGGG CCTGATCTAC
GTCAATCCCG AAGGCCCCGA AGGCAAGCCG GATCCGCTGG CGGCCGCCCA CGACATCCGC
GAGACGTTCG GCCGGATGGC GATGAACGAC GAGGAGACCG CGGCGTTGAT CGTCGGCGGC
CACACGCTGG GCAAGACCCA CGGCGCGGCC GATGTGAACG TCGGTCCCGA ACCGGAGGGC
GCGCCGATCG AGCAGCAGGG CCTGGGCTGG AAGTGTCCGT TCGGCACCGG CAATGCCGGT
GACACCGTCA CCAGCGGTCT GGAGGTCGTG TGGACCACCA CGCCGACCAA GTGGAGCAAC
GCCTATCTGG AACTGCTGTA CGGCTACGAG TGGGAGCTCA CCAAGAGCCC GGGCGGCGCA
TGGCAGTTCG AGGCCAAGGA CGCCGAAGCG ATCATCCCGG ATCCGTTCGG CGGGCCGCCG
CGCAAGCCCA CCATGCTGGT CACCGACGTC TCGATGCGGG TGGATCCCAT CTACGGCCCG
ATCACCCGGC GCTGGCTCGA TCATCCCGAG GAGATGAACG AGGCGTTCGC CAAGGCCTGG
TACAAGCTCA TGCACCGCGA CATGGGGCCG GTCAGCCGCT ACCTCGGGCC GTGGGTGGCC
GAGCCGCAGC TGTGGCAGGA CCCGGTGCCC GCGGTCGATC ACGAACTGAT CGACGAATCG
GACATCGCGG CACTGAAAGC CGCTGTGCTG CAATCGGGTC TGTCGGTCCC TCAGCTCGTG
AAGACGGCCT GGGCCTCGGC GTCGAGCTTC CGCGGGACCG ACAAGCGCGG CGGCGCCAAC
GGTGCGCGGC TGCGGCTCGA GCCGCAGCGC AGCTGGGAGG CCAACGAGCC GTCCGAGCTC
GACAAGGTGC TGCCGGTGCT GGAGAGGATC CAGCAGGACT TCAACGCCTC GGCGACCGGC
GGCAAGAAGG TCTCGCTCGC CGATCTGATC GTGCTGGCGG GTTCCGCGGC GGTCGAGAAG
GCGGCCAAGG ACGGCGGCTA CGAGATCTCG GTGCACTTCG CGCCCGGCCG GACCGACGCC
TCGCAGGAGC AGACGGACGT TGACTCGTTC GCAGTGCTCG AACCGCGTGC CGACGGCTTC
CGCAACTTCG CCAGGCCGGG GGAGAAGACC CCGCTCGAGC AGCTGTTGGT CGACAAGGCG
TACTTCCTCG ATCTGACCGC TCCGGAGATG ACGGCCCTGA TCGGCGGCCT GCGCACGCTG
AACGCCAACC ACGGCGGCAG CAAGCACGGG GTGTTCACCG AGCGGCCCGG CGTGCTGAGC
AATGACTTCT TCGTCAACCT GCTCGACATG GGCACCGAGT GGAAGCCGTC GGAGCTGACC
GAGAACGTCT ATGACGGCAA GGACCGAGCC ACCGGCCAGC CGAAATGGAC CGCCACGGCG
GCCGACCTGG TGTTCGGCTC GAACTCGGTG CTGCGCGCGG TGGTCGAGGT GTACGCCCAG
GATGACAACC AGGGCAAGTT CGTCGAGGAC TTCGTTGCCG CCTGGGTCAA GGTCATGAAC
AACGACCGGT TCGATCTGGG CTGA
 
Protein sequence
MTDTSDARPP HSDDKTRSHS ESENPAIDSP EPKVHAPLTN KDWWPEQVDV SVLHKQNEKG 
NPLGEDFDYA TEFAKLDVEA FKRDVIDLIN TSQDWWPADY GSYAGLFIRM SWHAAGTYRI
FDGRGGAGQG SQRFAPLNSW PDNANLDKAR RLLWPIKRKY GNKISWADLI AYAGNAALES
AGFQTFGFAF GREDIWEPEE MLWGQEDTWL GTDKRYGGTN DSDTRELAEP FGATTMGLIY
VNPEGPEGKP DPLAAAHDIR ETFGRMAMND EETAALIVGG HTLGKTHGAA DVNVGPEPEG
APIEQQGLGW KCPFGTGNAG DTVTSGLEVV WTTTPTKWSN AYLELLYGYE WELTKSPGGA
WQFEAKDAEA IIPDPFGGPP RKPTMLVTDV SMRVDPIYGP ITRRWLDHPE EMNEAFAKAW
YKLMHRDMGP VSRYLGPWVA EPQLWQDPVP AVDHELIDES DIAALKAAVL QSGLSVPQLV
KTAWASASSF RGTDKRGGAN GARLRLEPQR SWEANEPSEL DKVLPVLERI QQDFNASATG
GKKVSLADLI VLAGSAAVEK AAKDGGYEIS VHFAPGRTDA SQEQTDVDSF AVLEPRADGF
RNFARPGEKT PLEQLLVDKA YFLDLTAPEM TALIGGLRTL NANHGGSKHG VFTERPGVLS
NDFFVNLLDM GTEWKPSELT ENVYDGKDRA TGQPKWTATA ADLVFGSNSV LRAVVEVYAQ
DDNQGKFVED FVAAWVKVMN NDRFDLG