Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_2856 |
Symbol | |
ID | 5834751 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 3203679 |
End bp | 3205610 |
Gene Length | 1932 bp |
Protein Length | 643 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641368657 |
Product | glucosyltransferase MdoH |
Protein accession | YP_001640317 |
Protein GI | 163852274 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2943] Membrane glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCAGCC CAGACTTGCC GCAGGCTGAA CGCTCCGTCG AGGCGGGGGC CGGGGGTGGA GCCCGGCCGA GGGGCGCGGG CCCTGCGCCG CGGCCGGCCT CGTTCCGCCT GCGCCGCCTC GCGCTGGCGG GGCCGACGCT GGCCATCGCC GCGACCATCG CCGCCCTGGC GCTCGCGGCC TACGGTGTGC CGGAGACGTG GCTCGGGCGC GCCGTGCTCG GGCTCTTCGT CGTGCTGATG GCGTGGCAGA GCTTCACCGC GTGGCAGTAT CTCTACGGGC TCATCGCCGC ACTGATGGGC GACCGCGCCC TGTCCGCGCT GGAGCGCCGC GCCGCCACGA TCTCGACCCG TCCCACGGGC TTGAGCCGCA CGGCGGCCGT GGTGGCGATC CATGCCGAGG ACCCGCTCGC GGTGTTCTCG GCCATCCGGG TCATGGCCCG CTCGCTCCAG CGCGAGGGCG GCGACGGTTC GGACATCGAC ATCTTCGTCC TCTCCGATAC CCGCGAGGGT GCGATCAGCG CGGTCGAGGA GCACGAATTC GCCCGCATCC AGGACTGGAG CCAGCGCGAA GGCCGCGGCA TGCCGCGCAT CCGCTACCGC CGCCGCGCCG ACAATTCCGG CCGCAAGGCG GGCAACATCG CCGAATTCTG CACCACCTAC GGGCACGAAT ACGACTTCAT GATCGTGCTC GACGCCGACA GCCTGATGAC CGGCGCCGCC ATGCGCCGGC TCGCCCGGCT GATGGAGGAG AACCCGCGCA CCGGCCTGAT CCAGACCGTT TCCTACGCCG CCGGCCGCGA CACCCTGTTC GCGCGTATCC AGCAATTCGC CGTGCGCCTC TACGCGCCGC TCTCCTTGCG CTGCCTCGAG ACGTGGCAGG GGCCGGACGG CTCCTACTGG GGTCACAACG CGATCCTGCG CATCGAGGCG TTCGCGAACA ACGCCGAGCT TCCGGTCCTC TCCGGCAAGC CGCCTTTGGG CGGCGAGATC CTCTGCCACG ACATCGTCGA GGGGGCGCTG CTGCGCCGCG CCGGCTGGGA CGTGCGCCTG CTGCCGGAGA TGGGCGGCAC CTGGGAGGAA ATGCCCACCA ACCTCATCGA CCTGCTCGGG CGCGAGCGGC GCTGGTGCCA GGGCAACCTG CAGCATCTCC GCGTGCTGAC GATGAAGGGG CTGCTCGGTG CGAGCCGCTG GCATCTCGGC GTGGGTATCC TCGGCTACTG CGTCTATCCG CTCTGGATCG CTTTGCTCGC GCTCGGCACA TGGCAGGCGG TGCGCTCCGG CGAACTCGGG CTGATCGGCT ACGGCCTCGA CGGCGGCAAC GCGGCGGCCT GGGGGCTCGC CGCCCTCGTC ATCGCGGTGA TGGCGCTGCC GAAGCTCCTG AGCCTCGGCT ACGTCCTCGC CTCAGCCCAG CGCCGGGCGG ATTTCGGCGG CACCCGCTCG CTGCTGGTCA GCGCGGCGCT CGAACAGGCG ATCTGGGTCC TGCTCTGGCC GGTGATGGCG CTGTTCGCGG CAGGGGCCGT GGTGACGACC CTGTTCGGGC GGGTGGTTCG CTGGGACACG CAGTCCCGCG ACGATCGCAG CGTGCCGTGG CGGGAGGCGT TCCGCCTTCA GAGCGACGCG GTCGCGGCGG GCGGGGCGCT CGCGGTGCTG CTCGCTTTCG GTAATTTCTG GCTCGCCCTG TGGATGGCGC CGGTCGCCCT CGCCCTGCTG ACGAGCCCGT TCCAGAGCGT GCTCACCAGC AGCACCCGCC TCGGCCTCGG CTCGAAGGCG CGCGGCCTCT TCCTCACCGA GGACGACACG CGCCCGGCTC CGGAACTGCT CGAACTGCAC CAGAGCCGCA CCGCCGGTGC CGAGCCGGCG GCGATCACCG CCGCGCCGTC CCCGTGGCTC CCCGTGACCA TCGACGAGGC GAGCGCGCCG ACCCTGCGCT GA
|
Protein sequence | MSSPDLPQAE RSVEAGAGGG ARPRGAGPAP RPASFRLRRL ALAGPTLAIA ATIAALALAA YGVPETWLGR AVLGLFVVLM AWQSFTAWQY LYGLIAALMG DRALSALERR AATISTRPTG LSRTAAVVAI HAEDPLAVFS AIRVMARSLQ REGGDGSDID IFVLSDTREG AISAVEEHEF ARIQDWSQRE GRGMPRIRYR RRADNSGRKA GNIAEFCTTY GHEYDFMIVL DADSLMTGAA MRRLARLMEE NPRTGLIQTV SYAAGRDTLF ARIQQFAVRL YAPLSLRCLE TWQGPDGSYW GHNAILRIEA FANNAELPVL SGKPPLGGEI LCHDIVEGAL LRRAGWDVRL LPEMGGTWEE MPTNLIDLLG RERRWCQGNL QHLRVLTMKG LLGASRWHLG VGILGYCVYP LWIALLALGT WQAVRSGELG LIGYGLDGGN AAAWGLAALV IAVMALPKLL SLGYVLASAQ RRADFGGTRS LLVSAALEQA IWVLLWPVMA LFAAGAVVTT LFGRVVRWDT QSRDDRSVPW REAFRLQSDA VAAGGALAVL LAFGNFWLAL WMAPVALALL TSPFQSVLTS STRLGLGSKA RGLFLTEDDT RPAPELLELH QSRTAGAEPA AITAAPSPWL PVTIDEASAP TLR
|
| |