Gene Mext_0038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_0038 
Symbol 
ID5835558 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp43248 
End bp45533 
Gene Length2286 bp 
Protein Length761 aa 
Translation table11 
GC content68% 
IMG OID641365822 
ProductKojibiose phosphorylase 
Protein accessionYP_001637537 
Protein GI163849494 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1554] Trehalose and maltose hydrolases (possible phosphorylases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGAGG TGCTTCGACC GACACAGGAG CCCGGCTGGG TTCTCACGCA CGAAGGCTAC 
AGCGTGCTTA CGGAGAGCGC GGTCGAATCC CGCTTTGCTC TCGGCAACGG CTTCCTCGGC
ATGCGTGCCG CGCGCTCGAC GGGCCGAGGG CCGACCTGGG TGAGCTGGCT CGGCTACATC
CGATGGGCCT CGTGGCCGCG CTGCTACGTC GCCGGGCTGT TCGACATGCC CAACACCGAG
CCGCCTGTGC CGGCGCTCGT GCCCGTCGCC GACTGGTCGC GGATCCGCCT CATCCTCGAT
GGGGAGCCGC TGGTGGTGCG CGAAGGCGAG ATTCTTCATG GCATGCGGCG GCTCGACATG
CGGCGCGGCG TGCTTCTCTC CGAATGGACG CATCGGACAC CGGCGCAGGT GACCGCGAAG
GGCCACGAGC TGCGCCTCCT GTCGCTGGCG GACCGGTCGG TGGGGCTCCA GCTTCAGCAG
ATCGTGCTGG ACCGCGACGA CATCGACGTC CGCCTCGAAG CGAGCTTCGG GCTAGCCGGC
GTCGGTATGG AGCCGGTGCG TCTCGAAAAC GACCTCGGCG CGTGGCGCAC CGAGGGGACC
GGTAAGGTCG TGGCGATGGC AGGTGCCGCA TCGTTGCATC TTGATGGCGC CTTGGCCGAC
TCCGAGCGCC CATTTCCGCT GCGTTGGATC TGGCGTTGGC GCTCGAAGGC TGGCCAGGTG
GCGCAATTCG CCCGCCTCGT CGCCGTCGCT CGCGCCGAGC GGTCGGAAGA GGATCCTGCG
CCCCGCGCCG CGGCGACGCT CGCGCGCAGC ACATCGGTGG GCTGGCGCGC GATCCTCAAG
GCTCATGAAT CCGCATGGGA TGCACACTGG AGCGACAGCG GCATCGTCAT CGACGGCGAC
GATGACCTGC AGCGCGCGCT GCGGTTTGCC GTGTACCACC TGACGAGCGC CGCGAACCCG
AGCGACGACC GGGTTTCGAT CGGCGCGCGC GCGCTGACCG GCGATGCCTA TTTCGGCCAC
GTCTTCTGGG ACACCGAGAT CTATCTTCTG CCGTTCTACA CCGCGGTCTG GCCGGAAGCG
GCGCGCGCGC TGCTGATGTA CCGGTTCCAT ACGCTGCCCG GAGCACGGGC CAAGGCGACG
CTCGGCGGCT GGCGAGGCGC CCTCTATCCA TGGGAATCGG CCGACACCGG CGATGAGACT
ACGCCGGACT CGGTGCTGGG GCCCGACGGG AAGCCGATCG AGATCCTGAC TGGCAAGATG
GAGCACCACA TCAGCGCCGA CGTCGCCTAC GCGGTGTGGC AGTACTGGCG TGCCACCGGC
GACGACGATT TCTTCCGCGA TGCGGGGGCG GAAATTCTCC TTGAGACGGC GCGTTTCTGG
GCGTCCCGAG CCGTCGCCGA AGCGGATGGC CGGCGCCACA TCCGCCATGT GATCGGGCCG
GACGAGTACC ATGAGGATGT CGACGACAAC GCCTTCACCA ACGTGATGGC GCGCTGGAAC
ATCGGCTGCG CCCTGGAGGC GCTCGACCTG TTGCGCAAGG GTTGGCCGGA CCGTGCCGAG
GCGCTTCGAG ACAAGCTCGC GCTCGACGAC AGGGAACTCG ATGACTGGCG GGACGCGGTC
GCGCGGATCG TCACCGGCCT CGACCCCGCG ACCGGGCTGT ACGAGCAGTT CGCTGGCTTC
CACGGCCTCA AGCAGCTGAA CGTCGCGGAC TATGTCGACC ATGCACTGCC GATCGACGTG
GTCATCGGCC GGGAGCAGAC GCAAAGCTCG CAGGTGATCA AGCAAGCCGA CGTCGTCGCG
CTGATCGCCT TGTTGCCCCA GGAATTTCCC GGACAGGGAG CGGAGATCAA TTTCCGCCAT
TACGAGCCGC GCTGTGCCCA TGGCAGCTCC TTGAGCGCCG CGATGCATGC CCGCGTGGCC
GCGCGTCTGG GCGCCTCGGA CACGGCTCTT CGATACATGC GCGAGACCGC GTCTCTCGAC
CTCGACCTCG ATCCGAACAG CGCCGGCGGC GTCCGGATCG CCGGGCTCGG CGGGTTGTGG
CAGGCGGCGA TCCTGGGCAT CGCCGGCCTG AACTTAGCGG GCGACACGCT GGAGCTCGAT
CCCAAGCTGC CGCCTCAGTG GGATACCCTT TCGTTCAAGG TCTGGTGGAG AGGCCGATCC
GTCGGGCTCA GCGTCAGCCG CCCTATGCTG GAGGCCAGGC TGATGGACGG AGACGGGATG
GACGTCACGG TCGCGGGCGT GACGCAGCAC CTGACACCTG GATCGCCACT GCGATTCGAG
CTGTAG
 
Protein sequence
MLEVLRPTQE PGWVLTHEGY SVLTESAVES RFALGNGFLG MRAARSTGRG PTWVSWLGYI 
RWASWPRCYV AGLFDMPNTE PPVPALVPVA DWSRIRLILD GEPLVVREGE ILHGMRRLDM
RRGVLLSEWT HRTPAQVTAK GHELRLLSLA DRSVGLQLQQ IVLDRDDIDV RLEASFGLAG
VGMEPVRLEN DLGAWRTEGT GKVVAMAGAA SLHLDGALAD SERPFPLRWI WRWRSKAGQV
AQFARLVAVA RAERSEEDPA PRAAATLARS TSVGWRAILK AHESAWDAHW SDSGIVIDGD
DDLQRALRFA VYHLTSAANP SDDRVSIGAR ALTGDAYFGH VFWDTEIYLL PFYTAVWPEA
ARALLMYRFH TLPGARAKAT LGGWRGALYP WESADTGDET TPDSVLGPDG KPIEILTGKM
EHHISADVAY AVWQYWRATG DDDFFRDAGA EILLETARFW ASRAVAEADG RRHIRHVIGP
DEYHEDVDDN AFTNVMARWN IGCALEALDL LRKGWPDRAE ALRDKLALDD RELDDWRDAV
ARIVTGLDPA TGLYEQFAGF HGLKQLNVAD YVDHALPIDV VIGREQTQSS QVIKQADVVA
LIALLPQEFP GQGAEINFRH YEPRCAHGSS LSAAMHARVA ARLGASDTAL RYMRETASLD
LDLDPNSAGG VRIAGLGGLW QAAILGIAGL NLAGDTLELD PKLPPQWDTL SFKVWWRGRS
VGLSVSRPML EARLMDGDGM DVTVAGVTQH LTPGSPLRFE L