Gene Mext_2856 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_2856 
Symbol 
ID5834751 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp3203679 
End bp3205610 
Gene Length1932 bp 
Protein Length643 aa 
Translation table11 
GC content72% 
IMG OID641368657 
Productglucosyltransferase MdoH 
Protein accessionYP_001640317 
Protein GI163852274 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2943] Membrane glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAGCC CAGACTTGCC GCAGGCTGAA CGCTCCGTCG AGGCGGGGGC CGGGGGTGGA 
GCCCGGCCGA GGGGCGCGGG CCCTGCGCCG CGGCCGGCCT CGTTCCGCCT GCGCCGCCTC
GCGCTGGCGG GGCCGACGCT GGCCATCGCC GCGACCATCG CCGCCCTGGC GCTCGCGGCC
TACGGTGTGC CGGAGACGTG GCTCGGGCGC GCCGTGCTCG GGCTCTTCGT CGTGCTGATG
GCGTGGCAGA GCTTCACCGC GTGGCAGTAT CTCTACGGGC TCATCGCCGC ACTGATGGGC
GACCGCGCCC TGTCCGCGCT GGAGCGCCGC GCCGCCACGA TCTCGACCCG TCCCACGGGC
TTGAGCCGCA CGGCGGCCGT GGTGGCGATC CATGCCGAGG ACCCGCTCGC GGTGTTCTCG
GCCATCCGGG TCATGGCCCG CTCGCTCCAG CGCGAGGGCG GCGACGGTTC GGACATCGAC
ATCTTCGTCC TCTCCGATAC CCGCGAGGGT GCGATCAGCG CGGTCGAGGA GCACGAATTC
GCCCGCATCC AGGACTGGAG CCAGCGCGAA GGCCGCGGCA TGCCGCGCAT CCGCTACCGC
CGCCGCGCCG ACAATTCCGG CCGCAAGGCG GGCAACATCG CCGAATTCTG CACCACCTAC
GGGCACGAAT ACGACTTCAT GATCGTGCTC GACGCCGACA GCCTGATGAC CGGCGCCGCC
ATGCGCCGGC TCGCCCGGCT GATGGAGGAG AACCCGCGCA CCGGCCTGAT CCAGACCGTT
TCCTACGCCG CCGGCCGCGA CACCCTGTTC GCGCGTATCC AGCAATTCGC CGTGCGCCTC
TACGCGCCGC TCTCCTTGCG CTGCCTCGAG ACGTGGCAGG GGCCGGACGG CTCCTACTGG
GGTCACAACG CGATCCTGCG CATCGAGGCG TTCGCGAACA ACGCCGAGCT TCCGGTCCTC
TCCGGCAAGC CGCCTTTGGG CGGCGAGATC CTCTGCCACG ACATCGTCGA GGGGGCGCTG
CTGCGCCGCG CCGGCTGGGA CGTGCGCCTG CTGCCGGAGA TGGGCGGCAC CTGGGAGGAA
ATGCCCACCA ACCTCATCGA CCTGCTCGGG CGCGAGCGGC GCTGGTGCCA GGGCAACCTG
CAGCATCTCC GCGTGCTGAC GATGAAGGGG CTGCTCGGTG CGAGCCGCTG GCATCTCGGC
GTGGGTATCC TCGGCTACTG CGTCTATCCG CTCTGGATCG CTTTGCTCGC GCTCGGCACA
TGGCAGGCGG TGCGCTCCGG CGAACTCGGG CTGATCGGCT ACGGCCTCGA CGGCGGCAAC
GCGGCGGCCT GGGGGCTCGC CGCCCTCGTC ATCGCGGTGA TGGCGCTGCC GAAGCTCCTG
AGCCTCGGCT ACGTCCTCGC CTCAGCCCAG CGCCGGGCGG ATTTCGGCGG CACCCGCTCG
CTGCTGGTCA GCGCGGCGCT CGAACAGGCG ATCTGGGTCC TGCTCTGGCC GGTGATGGCG
CTGTTCGCGG CAGGGGCCGT GGTGACGACC CTGTTCGGGC GGGTGGTTCG CTGGGACACG
CAGTCCCGCG ACGATCGCAG CGTGCCGTGG CGGGAGGCGT TCCGCCTTCA GAGCGACGCG
GTCGCGGCGG GCGGGGCGCT CGCGGTGCTG CTCGCTTTCG GTAATTTCTG GCTCGCCCTG
TGGATGGCGC CGGTCGCCCT CGCCCTGCTG ACGAGCCCGT TCCAGAGCGT GCTCACCAGC
AGCACCCGCC TCGGCCTCGG CTCGAAGGCG CGCGGCCTCT TCCTCACCGA GGACGACACG
CGCCCGGCTC CGGAACTGCT CGAACTGCAC CAGAGCCGCA CCGCCGGTGC CGAGCCGGCG
GCGATCACCG CCGCGCCGTC CCCGTGGCTC CCCGTGACCA TCGACGAGGC GAGCGCGCCG
ACCCTGCGCT GA
 
Protein sequence
MSSPDLPQAE RSVEAGAGGG ARPRGAGPAP RPASFRLRRL ALAGPTLAIA ATIAALALAA 
YGVPETWLGR AVLGLFVVLM AWQSFTAWQY LYGLIAALMG DRALSALERR AATISTRPTG
LSRTAAVVAI HAEDPLAVFS AIRVMARSLQ REGGDGSDID IFVLSDTREG AISAVEEHEF
ARIQDWSQRE GRGMPRIRYR RRADNSGRKA GNIAEFCTTY GHEYDFMIVL DADSLMTGAA
MRRLARLMEE NPRTGLIQTV SYAAGRDTLF ARIQQFAVRL YAPLSLRCLE TWQGPDGSYW
GHNAILRIEA FANNAELPVL SGKPPLGGEI LCHDIVEGAL LRRAGWDVRL LPEMGGTWEE
MPTNLIDLLG RERRWCQGNL QHLRVLTMKG LLGASRWHLG VGILGYCVYP LWIALLALGT
WQAVRSGELG LIGYGLDGGN AAAWGLAALV IAVMALPKLL SLGYVLASAQ RRADFGGTRS
LLVSAALEQA IWVLLWPVMA LFAAGAVVTT LFGRVVRWDT QSRDDRSVPW REAFRLQSDA
VAAGGALAVL LAFGNFWLAL WMAPVALALL TSPFQSVLTS STRLGLGSKA RGLFLTEDDT
RPAPELLELH QSRTAGAEPA AITAAPSPWL PVTIDEASAP TLR