Gene Mvan_2984 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_2984 
Symbol 
ID4645112 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp3153802 
End bp3156015 
Gene Length2214 bp 
Protein Length737 aa 
Translation table11 
GC content66% 
IMG OID639806464 
Productcatalase/peroxidase HPI 
Protein accessionYP_953795 
Protein GI120403966 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0376] Catalase (peroxidase I) 
TIGRFAM ID[TIGR00198] catalase/peroxidase HPI 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.156049 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.603966 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCAGAAG CAACTGAACA TCCCCCGATT GGCGAAGCTC AGACCGAGCC CGCCCAGAGC 
GGCTGTCCGA TGGTCATCAA GCCGCCGGTG GAGGGTGGCA GTAACCGGGA CTGGTGGCCC
AATGCGGTCA ACCTGAAGAT GCTGCAGAAG GATCCCGAAG TCATCGACCC GATGGACGAA
GGCTACGACT ACCGCGAAGC CGTGCAGACG CTCGACGTCG ACCAGCTGGC CCGGGATTTC
GACGAGCTGT GCACCAACTC CCAGGACTGG TGGCCCGCCG ACTTCGGTCA TTACGGCCCG
CTGTTCATCC GGATGTCGTG GCACGCTGCA GGCACCTACC GGGTACAGGA CGGCCGCGGC
GGCGCCGGAA AGGGCATGCA ACGGTTCGCG CCGCTGAACA GCTGGCCCGA CAACGTCAGC
CTGGACAAGG CCCGCCGGTT GCTGTGGCCG CTGAAGAAGA AGTACGGCAA GAAGCTGTCG
TGGTCCGACC TCATCGTCTA CGCCGGAAAC CGGGCGATGG AGAACATGGG CTTCAAGACG
GCCGGATTCG CGTTCGGCCG CCCGGATTAC TGGGAGCCCG AGGAGGACGT CTACTGGGGC
GCCGAGCACG AGTGGCTCGG CTCGCAGGAT CGATACGCCG GCGCCAACGG TGACCGGACC
AAGCTGGAGA ACCCGCTCGG CGCCAGCCAT ATGGGCCTGA TCTACGTCAA CCCCGAAGGC
CCCGAGGGTA ATCCGGATCC GATCGCCGCG GCCATCGACA TCCGCGAGAC GTTCGGGCGG
ATGGCGATGA ACGATGTCGA GACCGCCGCG CTGATCGTCG GTGGGCACAC CTTCGGCAAG
ACCCACGGCG CCACCGACAT CGTGAACGGT CCGGAGCCCG AGGCGGCGCC GCTGGAGCAG
ATGGGCCTCG GCTGGAGCAA CCCGGGTGTC GGCATCGACA CCGTCAGCAG CGGTCTCGAG
GTGACCTGGA CCCACACCCC CACCAAGTGG GACAACTCGT TCCTGGAGAT CCTCTACGGC
AACGAATGGG AACTGTTCAA GAGCCCGGCC GGCGCCAATC AGTGGCGCCC GAAGGACAAC
GGCTGGGCCA ACTCGGTACC GATGGCGCAG GGCACCGGCA AGACCCACCC GGCGATGCTG
ACCACCGACC TGTCGATGCG GATGGATCCG ATCTACGGCG AGATCACCCG CCGCTGGCTG
GACCATCCCG AGGAACTGGC CGAGGAATAC GCCAAGGCCT GGTTCAAGCT GCTGCACCGT
GACATGGGCC CGGTCCAGCG CTACCTCGGG CCGCTGGTGC CCACGCAGAC CTGGCTGTGG
CAGGACATCG TCCCGGCCGG CAAGCCGCTG TCGGACGCCG ATGTCGCCAC CCTCAAGGGA
GCCATCGCCG ATTCGGGTCT GACTGTGCAA CAGCTGGTTT CGACCGCATG GAAGGCCGCC
TCGTCGTTCC GCATCAGCGA CATGCGTGGT GGTGCCAACG GTGGCCGGAT CAGGCTGCAG
CCCCAGTTGG GCTGGGAGTC CAACGAACCC GACGAGTTGG CGCAGGTCAT CAGCAAGCTG
GAGGAGATCC AGGGGTCGTC CGGGATCGAC GTGTCGTTCG CCGACCTGGT CGTGCTCGGC
GGCAACGTGG GAATCGAAAC TGCCGCCAAG GCGGCCGGAT TCGACATCGA GGTGCCGTTC
AGCTCGGGCC GTGGCGATGC CACCCAGGAG CAGACCGACG TCGAGGCGTT CTCCTACCTG
GAACCCAAGG CGGACGGCTT CCGTAACTAC GTCGGCAAGG GCCTGAACCT GCCTGCCGAG
TACCAGCTCA TCGATCAGGC CAACCTGCTC AACCTGTCGG CCCCGCAGAT GACCGTGCTG
ATCGGTGGCC TGCGGGCACT CGGCATCACC CACGGTGACA GCAAGCTCGG CGTACTCACC
GACACGCCGG GTCAGCTCAC GAACGACTAC TTCGTCAACC TCACCGACAT GGGCGTCAAG
TGGGCGCCGG CCCCCGCGGA CGACGGCACC TACGTCGGCA CCGACCGGGA CACCGGTGAG
GTGAAGTACA CGGCCAGCCG CGTCGACCTG CTGTTCGGTT CGAACTCGCA GCTGCGGGCG
CTGGCCGAGG TCTACGCCGA AGACGATTCC AGGGACAAGT TCGTCAAGGA TTTCGTCGCC
GCGTGGGTCA ACGTCATGGA TGCCGACCGT TACGACATCG GCAAGGGAGC CTGA
 
Protein sequence
MPEATEHPPI GEAQTEPAQS GCPMVIKPPV EGGSNRDWWP NAVNLKMLQK DPEVIDPMDE 
GYDYREAVQT LDVDQLARDF DELCTNSQDW WPADFGHYGP LFIRMSWHAA GTYRVQDGRG
GAGKGMQRFA PLNSWPDNVS LDKARRLLWP LKKKYGKKLS WSDLIVYAGN RAMENMGFKT
AGFAFGRPDY WEPEEDVYWG AEHEWLGSQD RYAGANGDRT KLENPLGASH MGLIYVNPEG
PEGNPDPIAA AIDIRETFGR MAMNDVETAA LIVGGHTFGK THGATDIVNG PEPEAAPLEQ
MGLGWSNPGV GIDTVSSGLE VTWTHTPTKW DNSFLEILYG NEWELFKSPA GANQWRPKDN
GWANSVPMAQ GTGKTHPAML TTDLSMRMDP IYGEITRRWL DHPEELAEEY AKAWFKLLHR
DMGPVQRYLG PLVPTQTWLW QDIVPAGKPL SDADVATLKG AIADSGLTVQ QLVSTAWKAA
SSFRISDMRG GANGGRIRLQ PQLGWESNEP DELAQVISKL EEIQGSSGID VSFADLVVLG
GNVGIETAAK AAGFDIEVPF SSGRGDATQE QTDVEAFSYL EPKADGFRNY VGKGLNLPAE
YQLIDQANLL NLSAPQMTVL IGGLRALGIT HGDSKLGVLT DTPGQLTNDY FVNLTDMGVK
WAPAPADDGT YVGTDRDTGE VKYTASRVDL LFGSNSQLRA LAEVYAEDDS RDKFVKDFVA
AWVNVMDADR YDIGKGA