Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_1164 |
Symbol | |
ID | 5833952 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 1280056 |
End bp | 1282014 |
Gene Length | 1959 bp |
Protein Length | 652 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641366957 |
Product | cellulose synthase (UDP-forming) |
Protein accession | YP_001638637 |
Protein GI | 163850594 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.614869 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGTTTC CCTCGCCCGG CGACGCGGCC TCCGTGCTCG CGCTCTCGTT CGGCATCGCG CTCGGGCTTT TCACGCTCGC GGGCCTGTTG CGGCCGGAGC GCGCCTTCGA TCGCCTTCTG TTCGGAGCCC TGACGGCTGC GCTCATCGCC ACCTACGCGC GCTGGCGCTG GAGCGACACA CTGCCGCCGC TGACCCCCGA GGCCGGTGCC CTCTGGTCCT ACCTGTTCTT CGCCGCTGAG ATGGTGGCGG TGGTCTATAC CCTGCTCTCG GTGATCATCC TGCTGCGCTT CAAGGACCGG TCAAGGGAGG CCGACGCGGC CCAAGCGCGC CGCGAGGCGA GTGGCGAACG GCCGGCCGTC GACATCTTCA TCTGCACCTA CAACGAGCCG CGCGAGGTTG TGGAGAAGTC GATCCTGCCA TCGCTCGCCA TCGATTACGA ACCCAAGACC GTCTGGGTCT GCGACGACAC CCGCCGCGAC TGGCTGCGCG ACTATTGCGA GGAGGTCGGC GCCCGCTACA TCACGCGCCC GGACAACAAG GGGGCCAAGG CCGGCAACCT CAACAACGCC CTGCGTCACA CCGCTGAGCG GACCGACGCC CCCCTGATTC TCGTGCTCGA TGCGGATTTC GCGCCGCAGC CGAACATCCT CAAGCGCATG GTCGGCCTGT TCGATGATCC GAAGACGGGC GTGGTGCAGT CGCCGCAATT CTTCTTCAAT GCCGATCCGA TCCAGCACAA TCTCGCCGCC TCCGACAGCT GGGTCGATGA CCAGCGCATC TTCTTCGACG TGTTCCAGCC GGCAAAGGAC GCCTGGAACG CGGCCTTCTG TGTCGGCACC TCCTTCATCG TGCGCCGCGA CCGGCTCGGC GAGATCGGTG GCTTCCCGGA TGCCGCGATC TGCGAGGATC TCAACCTGTC GCTCGGCATG TCGCGCCGAG GCTACGAGAC CCACTGGCTG AACGAGCGGC TGAGCATGGG CCTCTCGGCT GAGGGCCTCC CGGAATACAT CACGCAGCGT ACCCGCTGGT GTCTCGGAAC GATTCAGATC GCGTTGCTCG CCGACGGGCC GCTTCGCGGG CCGGGCTATA CCCTGGTCCA GCGGATCCAC TTCCTGCATG GCGTGCTGAA CTGGGCCTGC AAACCCTACA TCGTGCTGAT GCTGCTGGCC CCGGCGGTCT ACTGGATCGC AGGGCTGCCG GCCTTCGAGG CCGACGTGCT GTCCTTCCTG CGATACGGTG CGCCCCCGCT CTTCGCTCTG TGGGCCTATA GCGGCTGGGT TTCGCGTTCA CGCACGCTGC CGATCTTCAT GGAGGTGACC CACGCGATCA GTGCGCTCGC CGTCACCATG ACGCTCATCC AGGCGGCCGT CCGACCGTTC GGCCGTCCGT TCAAAGTCAC GGAAAAGGGC GGTGATCGCT CCCAGATGCG CGTCCGCTGG CGCATGGCCT CGGCCTTCGG CGGCCTCTCC CTGCTTTCGG CCTTCAGCAT CGTCTGGGCC TTCATCGCCC CGACCGCGCC GGCCGAGATC TCGGATATCG ACTACTTCAA CCTCGTCTGG GCCGGAGTGG CGATGGTGCT GACCTTCATC TGCTTCCTCG TCTGTTTCGA ATACCCGCGC GTCGATCTCG CATTCCGCTA CGACGCCGAC GCCCGGATCG AAGCCGGCGG GACCAGCCAC GCCTGCCGCA TCGCGACCCT TTCGCCCGGC CGGGCGACCC TCGCCGAGGC GGGGGAGCCG GTCTCTGCGC TGGGTGCGCC GCTGATCCTG CACCTTCCGG GCATCGGCGC GATCGACGCG GTTGCGGACC CCGCCGGCCT TTCACTCGAT CCGACGCCGG AGCAGTACCG GGCTCTGGTC GTGGCCCTCT ACTCCACCCC GCGCGACACC ATTGCCCGTG CCGCCCGCTT CACACCGGCC GTCGGTGGCC TGCTCCGCCG GAGCCTGGGC CTCGGCTGA
|
Protein sequence | MMFPSPGDAA SVLALSFGIA LGLFTLAGLL RPERAFDRLL FGALTAALIA TYARWRWSDT LPPLTPEAGA LWSYLFFAAE MVAVVYTLLS VIILLRFKDR SREADAAQAR REASGERPAV DIFICTYNEP REVVEKSILP SLAIDYEPKT VWVCDDTRRD WLRDYCEEVG ARYITRPDNK GAKAGNLNNA LRHTAERTDA PLILVLDADF APQPNILKRM VGLFDDPKTG VVQSPQFFFN ADPIQHNLAA SDSWVDDQRI FFDVFQPAKD AWNAAFCVGT SFIVRRDRLG EIGGFPDAAI CEDLNLSLGM SRRGYETHWL NERLSMGLSA EGLPEYITQR TRWCLGTIQI ALLADGPLRG PGYTLVQRIH FLHGVLNWAC KPYIVLMLLA PAVYWIAGLP AFEADVLSFL RYGAPPLFAL WAYSGWVSRS RTLPIFMEVT HAISALAVTM TLIQAAVRPF GRPFKVTEKG GDRSQMRVRW RMASAFGGLS LLSAFSIVWA FIAPTAPAEI SDIDYFNLVW AGVAMVLTFI CFLVCFEYPR VDLAFRYDAD ARIEAGGTSH ACRIATLSPG RATLAEAGEP VSALGAPLIL HLPGIGAIDA VADPAGLSLD PTPEQYRALV VALYSTPRDT IARAARFTPA VGGLLRRSLG LG
|
| |