Gene Mpe_A3791 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3791 
Symbol 
ID4785960 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp4008908 
End bp4010704 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content70% 
IMG OID640092374 
Producthypothetical protein 
Protein accessionYP_001022979 
Protein GI124268975 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3387] Glucoamylase and related glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000380463 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGAGCGCAG CGGCTTCGCT CGAGCTGGGT CTGGTCGGCA ACTGCGCGAT CAGCGCGCTG 
ATCGACCGCG AGGCCCGCAT CGTCTGGTGC TGCCTGCCGC GCTTCGACGG CGACCCGACC
TTCCACGCGC TGATCGACAC GCCCGACGCG TGGCCGGCCG ACGGCAGCTG GTGCATCGAG
ATCGAGAACT TCGTGCGCAG CGAGCAGGCC TACGACGAAG GCACCGCCAT CCTGCGCACG
CGGCTGTTCA GTGCCGACGG CGATGTGATC GAGATCACCG ATTTCGCACC GCGCTTCCTC
AGCCGCGACC GCACCTTCCG GCCCGCGATG CTGGTGCGGC GTGTGCATCC GCTGCAGGGC
CATCCGCGCG TGCGCGTCAG CCTGCGGCCT CGCGCCGACT GGGGGCGCGT CGCGCCGGAG
GTCGCGCGCG GCAGCCATCA CTTGCGATAC GCCGGCCCGA CCGGGACGCT GCGTCTCACC
ACCGATGCGC CCATCACCTA TGTGCAGGAC GAGACCTGGT TCTCGCTGGC GGCGCCGCTG
AACCTGCTGC TCGGCCCCGA CGAGACGCTG GAGCGCGGCG CGGCCGAGGT GGCGCGCGAT
TTCGAGGAAC TCACCGCGCT CTACTGGCGC ACCTGGACCC GCCGGCTGGC CCTCCCGCTG
GAATGGCAGG CGGCCGTTAT CCGCGCCGCG ATCACGCTCA AGCTGTGCCT GTTCGACGAG
ACCGGCGCGA TCGTCGCGGC GATGACCACC AGCCTGCCCG AGGCGCCGGG CAGCGGCCGC
AACTGGGATT ACCGGTACTG CTGGCTGCGC GACGCCTTCT TCGTGGTGCG GGCGCTCAAC
AGCCTGTCCG AACTGGAGAC GATGGAGGAC TACCTGCGTT GGCTGCACAA CGTGGTGCGC
GACGCGCGCG GGGGCTTCAT CCAGCCGCTG TACGGCCTGG GCCTTGAGAA GGATCTGCCC
GAGGAAGAAC TCCCGCATCT GCGCGGCTAC CGCGGCATGG GTCCGGTGCG GCGCGGCAAC
CAGGCGCACA CGCACGCGCA GCACGACGTG TACGGCAACG TGCTGCTCGG GGCGTCGCAG
TCCTTCCATG ATCTGCGGCT GTTCCGCCGC GCTGACGCCG ACGACTTCGC GCGGCTCGAG
GCGGTGGGTG AGCAGGCCTG GCTGGTGCAC GACCAGCCCG ATGCGGGAAT GTGGGAGCTG
CGCACGCGCG CGCGGGTGCA CACCTCGTCG GCGCTGATGT GCTGGGCGGC CTGCGATCGG
CTGGCGAAGA TCGCCACCCG TCTGGGCCTG CCCGAGCGGG ACGCCCACTG GCGCGGCCGG
GCCGCGGCGA TCCGAGAGCG CGTGCTGCGC GAAGCCTGGA GCGACAAGCG CCAGGCCTTC
GCCGAGAGCC TGGGTGGCGA GAATCTCGAT GCCAGCGTGC TGCTGATGGC CGAAGTGGGC
TTCATCGACC CGATGGACCC GCGCTTCGTC GCGACGGTCG ACGCGCTGGA GGCGCACCTC
TGCGATGGCC CCTACATGCG TCGCTACGAG GCGGCCGACG ACTTCGGCAA ACCTGAGACG
GCCTTCAACA TCTGCGCCTT CTGGCGCATC GACGCGTTGG TGCGCATCGG CCGGCGCGAG
CAGGCCCGGC AGATCTTCGA ATCCATGCTC GCGTCGCGCA ATGCGCTGGG CCTGCTCTCG
GAAGACACCG ACGCGCGCAC CGGCGAGCTG TGGGGCAACT TCCCGCAGAC CTATTCGATG
GTCGGCATCA TCAATGCCGC AATGCGTCTG TCGGTGCCCT GGGACACGGT GATCTGA
 
Protein sequence
MSAAASLELG LVGNCAISAL IDREARIVWC CLPRFDGDPT FHALIDTPDA WPADGSWCIE 
IENFVRSEQA YDEGTAILRT RLFSADGDVI EITDFAPRFL SRDRTFRPAM LVRRVHPLQG
HPRVRVSLRP RADWGRVAPE VARGSHHLRY AGPTGTLRLT TDAPITYVQD ETWFSLAAPL
NLLLGPDETL ERGAAEVARD FEELTALYWR TWTRRLALPL EWQAAVIRAA ITLKLCLFDE
TGAIVAAMTT SLPEAPGSGR NWDYRYCWLR DAFFVVRALN SLSELETMED YLRWLHNVVR
DARGGFIQPL YGLGLEKDLP EEELPHLRGY RGMGPVRRGN QAHTHAQHDV YGNVLLGASQ
SFHDLRLFRR ADADDFARLE AVGEQAWLVH DQPDAGMWEL RTRARVHTSS ALMCWAACDR
LAKIATRLGL PERDAHWRGR AAAIRERVLR EAWSDKRQAF AESLGGENLD ASVLLMAEVG
FIDPMDPRFV ATVDALEAHL CDGPYMRRYE AADDFGKPET AFNICAFWRI DALVRIGRRE
QARQIFESML ASRNALGLLS EDTDARTGEL WGNFPQTYSM VGIINAAMRL SVPWDTVI