Gene Mpe_A3740 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3740 
Symbol 
ID4786029 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3958566 
End bp3960713 
Gene Length2148 bp 
Protein Length715 aa 
Translation table11 
GC content70% 
IMG OID640092323 
Productcatalase 
Protein accessionYP_001022928 
Protein GI124268924 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0376] Catalase (peroxidase I) 
TIGRFAM ID[TIGR00198] catalase/peroxidase HPI 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.150597 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACCG AATCGAAGTG CCCGTTCAAC CACGCCGCCG GCGGCGGCAC GACCAACCAG 
GACTGGTGGC CCTCCCAGTT GCGGCTGGAG CTGCTGAACC AGCACTCGAG CAAGTCCGAT
CCGCTGGGTG CCGGCTTCGA TTACGCCGAA GAGTTCAAGA AGCTCGACTA CTTCGCGCTC
AAGAAGGACC TGCTGGCGCT GATGACCGAT TCGCAGGACT GGTGGCCGGC CGACTTCGGC
CACTACGGCC CGCAGTTCGT GCGCATGGCC TGGCACGCCG CCGGCACCTA CCGCACGGCC
GACGGCCGCG GCGGCGGCGG CCGCGGGCAG CAGCGCTTCG CGCCGCTCAA CAGCTGGCCC
GACAACGTGA ACATCGACAA GTCGCGCCGC CTGCTGTGGC CGATCAAGCA GAAGTACGGC
CAGCGCATCA GCTGGGCTGA CCTGCTGATC CTGACCGGCA ACGTGGCGCT GGAGTCGATG
GGCTTCCGCA CGTTCGGCTT CGCCGGCGGC CGAGAGGACG TGTGGGAACC CGACAACGAC
GTGAACTGGG GCAACGAGAC CACCTGGCTG GCCACCGACA AGCGCTTCAC CGGCGACCGT
GATCTCGACC AGCCACTGGC CGCCACCCAC ATGGGCCTGA TCTACGTGAA CCCCGAGGGT
CCCAACGCCA GCGGCGATCC GCTTGCCGCC GCCAAGGACA TCCGCGCGAC CTTCGGCCGC
ATGGCGATGG ACGACGAGGA GATCGTGGCG CTGATCGCCG GCGGCCACAC CTTCGGCAAG
GCGCACGGCG CCGCGCCCGA ATCGCACAAG GGTCCCGAGC CCGAAGGCGC GCCGCTCGAG
GCGCAGGGCC TGGGCTGGAC CAGCAGCTTC GGCAGCGGCC ACGGCAAGGA CACCGTCTCC
AGCGGCCTGG AGGTCACCTG GACCACCACG CCGGCGCGCT GGAGCAACGA CTTCTTCGAG
CACCTGTTCA AGTTCGAATG GGAGCTGACG CAATCGCCCG CCGGCGCGAA GCAGTGGACG
GCCAAGGATG CGCCGGAGAT CATTCCCGAC GCGCACGTCC CCGGCAAGAA GCTCAAGCCG
ACGATGCTCA CCACCGACCT GACGCTGCGC GTCGACCCCG AGTTCGAGAA GATCTCGCGC
CGTTTCCTCG ACAACCCGCA GGGCTTCGCC GACGCCTTCG CGCGCGCGTG GTTCAAGCTC
ACCCACCGCG ACATGGGCCC GAAGGTGCGC TACCTCGGCC CCGAGGTGCC GAAGGAAGAG
CTGCTCTGGC AGGACCCGCT GCCCCCGGCC ACGCTGCCGG CGCCGAATGC CGCCGACGTC
GCCGAGCTGA AGGCGAAGAT CGCCGCATCG GGCCTGACGG TCGCACAGCT CGTGGCCACC
GCCTGGGCCT CGGCCTCCAC CTTCCGCGGC GGCGACAAGC GCGGCGGGGC CAACGGCGCG
CGCCTTCGCC TGGCACCGCA GAAGGACTGG GAAGCCAACA CCCCGGGCGA GCTGGCCAAG
GTGCTGGCGA CGCTGGAGAC CCTCCAGAAG GCCTCGGGCA AGTTCTCGCT GGCCGACGTC
ATCGTGCTGG CTGGCGGCGT GGGCGTGGAG CAGGCCGCCA AGGCGGCGGG TGTCGGCATC
GAGGTGCCGT TCGCCCCCGG TCGCGTCGAC GCCACGCAGG AGCAGACCGA TGTGGAGTCC
TTCGCCTTCC TGGAACCGGT GGCCGACGGC TTCCGCAACT ACTTCAAGGG CCCGGGCAGC
GTGCCGGTGG AGCACCTGCT GGTCGACAAG GCGCAGCTGC TCACGCTCAC CGCTCCCGAG
ATGACGGTGC TGGTCGGCGG GCTGCGCGTG CTGGGCGCCA ACGCCGGCGG CAGCCGGCAT
GGCGTGTTCA CCGACCGGCC GGGCGTGCTG ACCCCCGACT TCTTCGTCAA CCTGCTCGAC
ATGCGCACAA CGTGGCAGCC GGCCAACGGT GTGTACGAGG GCAAGGACCG CCAGACCGGC
CAGCAGAAGT GGACCGCGAC GCGGGTGGAC CTGGCGTTCG GTTCCAACGC CGTGCTGCGC
GCGCTGGCCG AGGTGTACGC CAGCGCCGAC GGTCAGACCA AGTTCGTGCA CGACTTCGTG
GCCGCGTGGA CCAAGGTCAT GAACCTGGAC CGCTACGACC TGGCCTGA
 
Protein sequence
MATESKCPFN HAAGGGTTNQ DWWPSQLRLE LLNQHSSKSD PLGAGFDYAE EFKKLDYFAL 
KKDLLALMTD SQDWWPADFG HYGPQFVRMA WHAAGTYRTA DGRGGGGRGQ QRFAPLNSWP
DNVNIDKSRR LLWPIKQKYG QRISWADLLI LTGNVALESM GFRTFGFAGG REDVWEPDND
VNWGNETTWL ATDKRFTGDR DLDQPLAATH MGLIYVNPEG PNASGDPLAA AKDIRATFGR
MAMDDEEIVA LIAGGHTFGK AHGAAPESHK GPEPEGAPLE AQGLGWTSSF GSGHGKDTVS
SGLEVTWTTT PARWSNDFFE HLFKFEWELT QSPAGAKQWT AKDAPEIIPD AHVPGKKLKP
TMLTTDLTLR VDPEFEKISR RFLDNPQGFA DAFARAWFKL THRDMGPKVR YLGPEVPKEE
LLWQDPLPPA TLPAPNAADV AELKAKIAAS GLTVAQLVAT AWASASTFRG GDKRGGANGA
RLRLAPQKDW EANTPGELAK VLATLETLQK ASGKFSLADV IVLAGGVGVE QAAKAAGVGI
EVPFAPGRVD ATQEQTDVES FAFLEPVADG FRNYFKGPGS VPVEHLLVDK AQLLTLTAPE
MTVLVGGLRV LGANAGGSRH GVFTDRPGVL TPDFFVNLLD MRTTWQPANG VYEGKDRQTG
QQKWTATRVD LAFGSNAVLR ALAEVYASAD GQTKFVHDFV AAWTKVMNLD RYDLA