Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_3225 |
Symbol | |
ID | 5835538 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 3578738 |
End bp | 3580528 |
Gene Length | 1791 bp |
Protein Length | 596 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641369025 |
Product | glycosyl transferase family protein |
Protein accession | YP_001640683 |
Protein GI | 163852640 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0693647 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCCAC GGCCCTTCTC AACGAGGGGC TTCTCTCCGA GGAGGTCTTC TACTGTGCGC TCGCCCGCCG GCTCGGCGCT CCATTTCTCG TTGGGCCTCT CGCGATATGC GAAGGCGTTC CGTACGTGCG CTGTCTCATC TCCGGTGCCG CGCCGCTCGC GGGCTGCGAG GGGGGCGAAA ACCCTCGTCG CGGCACCGCG CGGAGCCGCG GTGGCGCGAC TGATCGCGGC GGCGGACCGG CTGGCGGAGC CTCCGGCCAT CACGACACCG ACGGCCCTGA GGCAGGCTCT GTTCGCCCGG TACGGCGCCG CCATTGCCGA GGACGCGTCC GAGACCCTGG GGCGTCTTCG TCCCGAATGG TCCTGCCGGC CCGGTCCGCT GGCGGTCGAC CTTGCCTTGG CCGGCGGCGT GCTCGCCCTC GTCGTCCTGC TCGCCCGCCT CCCGACGGCG GCGGGTTTCG TCCTGCTGCT GCTGGTGCAG GGTTTGATGC TGGCGCTGCT GACCTTCCGC CTCGCCGCGG CGGCGATCGG GGCGGCCGAG ATCGTGCGTG CCGATGCATC CGAGCCGAGA CCGGTTCTAC TGACGGACGA CGCGCTCCCG ACCTACACGA TTCTCGTCGC CCTCTACCGG GAGGCGCCGG TGGTGCCGCG TCTTCTGGGA GCCTTGCGCC GGCTCGACTA TCCGGCGGCC AAGCTCGACA TCAAGTTCCT GCTCGAGGCC GACGACGACG AGACCGCTGC GGCTTTCCGC GCGACACCGC TCCCGGCCCG CTTCGAGATC GTCACCGTGC CGGAGGGGAT GCCGCGCACC AAGCCGCGGG CGCTGAACGT GGCCCTGCCA CTCGCCCGCG GCGAGCACCT CGTGGTCTAC GACGCGGAGG ATGTGATCTC GCCTGAACAG CTTCGCCTCG CCGCGACGCT GTTCGCCCGC GCGCCGGACT CGACCGCCTG CCTCCAGGGT CGCCTCGTGA TCGACAATCA CGGCGATGGC TGGCTGCCGC GCCTGTTCGC CATCGAGTAC GCGGCTTTGT TCGATGTGCT CGGCCCGGCC CTGGCCGCGT GGCGGATGCC GACGCCGCTC GGCGGCACCT CGACGCATTT CCGCACCCGC GTCCTGCGCC AGCTCCACGG ATGGGACGCC TGGAACGTGA CCGAGGATGC CGATCTCGGC CTGCGCATGG CACTCGCGGG CTACCATGTC GGCGACCTGC CGAGCGCGAC CTTCGAGGAG GCGCTCGCCC ATCCGCGCAA GTGGCTGCGC CAGCGCACGC GCTGGATGAA GGGCTTCCTG CAGACGAGCT TCACCCACGG GCGGCGCCCG CTCGACCTCT ATCGCCGCCT CGGCGCGGCC GAGAGCCTGT GCGTGCTGGC GCTTCTGCCC GGCACGGTGG TCTCGGCCCT GTTCTATCCG TTCATGCTGC TCGGCGGCCT CGCCGACCTG ATCATCCCGG CCGAGGACGG CGACAGCCTC GCCATCGTCA AGCGGGCGGC CTCCACGACG GTGTTCCTCG GCGGGCTTGC GGCGATGTGC CTGCCCGGCC TCGTCGGCTG CCTCCGGCGC GGGTGGTTGG ATCTCGTGCC GCTCGTGCTG CTGCTGCCGG TCTACTTCCT TCTCGGCAGT CTCGCGGCGT GGCTCGCCGT GTTCGAGCTG GCGCGGCACC CGCACCGTTG GAACAAGACC GAGCACGGGC TGGCCCGCAC CTCCCGCACC GGAGCGCTCG CGCGGCGCCC CTCGGCGGGC ATCAGATCCG CTTCGACAGC TCCGCTGCCG TCTCCGCTGC CGGCCGGTTG A
|
Protein sequence | MPPRPFSTRG FSPRRSSTVR SPAGSALHFS LGLSRYAKAF RTCAVSSPVP RRSRAARGAK TLVAAPRGAA VARLIAAADR LAEPPAITTP TALRQALFAR YGAAIAEDAS ETLGRLRPEW SCRPGPLAVD LALAGGVLAL VVLLARLPTA AGFVLLLLVQ GLMLALLTFR LAAAAIGAAE IVRADASEPR PVLLTDDALP TYTILVALYR EAPVVPRLLG ALRRLDYPAA KLDIKFLLEA DDDETAAAFR ATPLPARFEI VTVPEGMPRT KPRALNVALP LARGEHLVVY DAEDVISPEQ LRLAATLFAR APDSTACLQG RLVIDNHGDG WLPRLFAIEY AALFDVLGPA LAAWRMPTPL GGTSTHFRTR VLRQLHGWDA WNVTEDADLG LRMALAGYHV GDLPSATFEE ALAHPRKWLR QRTRWMKGFL QTSFTHGRRP LDLYRRLGAA ESLCVLALLP GTVVSALFYP FMLLGGLADL IIPAEDGDSL AIVKRAASTT VFLGGLAAMC LPGLVGCLRR GWLDLVPLVL LLPVYFLLGS LAAWLAVFEL ARHPHRWNKT EHGLARTSRT GALARRPSAG IRSASTAPLP SPLPAG
|
| |