Gene Mext_1164 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1164 
Symbol 
ID5833952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp1280056 
End bp1282014 
Gene Length1959 bp 
Protein Length652 aa 
Translation table11 
GC content67% 
IMG OID641366957 
Productcellulose synthase (UDP-forming) 
Protein accessionYP_001638637 
Protein GI163850594 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.614869 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGTTTC CCTCGCCCGG CGACGCGGCC TCCGTGCTCG CGCTCTCGTT CGGCATCGCG 
CTCGGGCTTT TCACGCTCGC GGGCCTGTTG CGGCCGGAGC GCGCCTTCGA TCGCCTTCTG
TTCGGAGCCC TGACGGCTGC GCTCATCGCC ACCTACGCGC GCTGGCGCTG GAGCGACACA
CTGCCGCCGC TGACCCCCGA GGCCGGTGCC CTCTGGTCCT ACCTGTTCTT CGCCGCTGAG
ATGGTGGCGG TGGTCTATAC CCTGCTCTCG GTGATCATCC TGCTGCGCTT CAAGGACCGG
TCAAGGGAGG CCGACGCGGC CCAAGCGCGC CGCGAGGCGA GTGGCGAACG GCCGGCCGTC
GACATCTTCA TCTGCACCTA CAACGAGCCG CGCGAGGTTG TGGAGAAGTC GATCCTGCCA
TCGCTCGCCA TCGATTACGA ACCCAAGACC GTCTGGGTCT GCGACGACAC CCGCCGCGAC
TGGCTGCGCG ACTATTGCGA GGAGGTCGGC GCCCGCTACA TCACGCGCCC GGACAACAAG
GGGGCCAAGG CCGGCAACCT CAACAACGCC CTGCGTCACA CCGCTGAGCG GACCGACGCC
CCCCTGATTC TCGTGCTCGA TGCGGATTTC GCGCCGCAGC CGAACATCCT CAAGCGCATG
GTCGGCCTGT TCGATGATCC GAAGACGGGC GTGGTGCAGT CGCCGCAATT CTTCTTCAAT
GCCGATCCGA TCCAGCACAA TCTCGCCGCC TCCGACAGCT GGGTCGATGA CCAGCGCATC
TTCTTCGACG TGTTCCAGCC GGCAAAGGAC GCCTGGAACG CGGCCTTCTG TGTCGGCACC
TCCTTCATCG TGCGCCGCGA CCGGCTCGGC GAGATCGGTG GCTTCCCGGA TGCCGCGATC
TGCGAGGATC TCAACCTGTC GCTCGGCATG TCGCGCCGAG GCTACGAGAC CCACTGGCTG
AACGAGCGGC TGAGCATGGG CCTCTCGGCT GAGGGCCTCC CGGAATACAT CACGCAGCGT
ACCCGCTGGT GTCTCGGAAC GATTCAGATC GCGTTGCTCG CCGACGGGCC GCTTCGCGGG
CCGGGCTATA CCCTGGTCCA GCGGATCCAC TTCCTGCATG GCGTGCTGAA CTGGGCCTGC
AAACCCTACA TCGTGCTGAT GCTGCTGGCC CCGGCGGTCT ACTGGATCGC AGGGCTGCCG
GCCTTCGAGG CCGACGTGCT GTCCTTCCTG CGATACGGTG CGCCCCCGCT CTTCGCTCTG
TGGGCCTATA GCGGCTGGGT TTCGCGTTCA CGCACGCTGC CGATCTTCAT GGAGGTGACC
CACGCGATCA GTGCGCTCGC CGTCACCATG ACGCTCATCC AGGCGGCCGT CCGACCGTTC
GGCCGTCCGT TCAAAGTCAC GGAAAAGGGC GGTGATCGCT CCCAGATGCG CGTCCGCTGG
CGCATGGCCT CGGCCTTCGG CGGCCTCTCC CTGCTTTCGG CCTTCAGCAT CGTCTGGGCC
TTCATCGCCC CGACCGCGCC GGCCGAGATC TCGGATATCG ACTACTTCAA CCTCGTCTGG
GCCGGAGTGG CGATGGTGCT GACCTTCATC TGCTTCCTCG TCTGTTTCGA ATACCCGCGC
GTCGATCTCG CATTCCGCTA CGACGCCGAC GCCCGGATCG AAGCCGGCGG GACCAGCCAC
GCCTGCCGCA TCGCGACCCT TTCGCCCGGC CGGGCGACCC TCGCCGAGGC GGGGGAGCCG
GTCTCTGCGC TGGGTGCGCC GCTGATCCTG CACCTTCCGG GCATCGGCGC GATCGACGCG
GTTGCGGACC CCGCCGGCCT TTCACTCGAT CCGACGCCGG AGCAGTACCG GGCTCTGGTC
GTGGCCCTCT ACTCCACCCC GCGCGACACC ATTGCCCGTG CCGCCCGCTT CACACCGGCC
GTCGGTGGCC TGCTCCGCCG GAGCCTGGGC CTCGGCTGA
 
Protein sequence
MMFPSPGDAA SVLALSFGIA LGLFTLAGLL RPERAFDRLL FGALTAALIA TYARWRWSDT 
LPPLTPEAGA LWSYLFFAAE MVAVVYTLLS VIILLRFKDR SREADAAQAR REASGERPAV
DIFICTYNEP REVVEKSILP SLAIDYEPKT VWVCDDTRRD WLRDYCEEVG ARYITRPDNK
GAKAGNLNNA LRHTAERTDA PLILVLDADF APQPNILKRM VGLFDDPKTG VVQSPQFFFN
ADPIQHNLAA SDSWVDDQRI FFDVFQPAKD AWNAAFCVGT SFIVRRDRLG EIGGFPDAAI
CEDLNLSLGM SRRGYETHWL NERLSMGLSA EGLPEYITQR TRWCLGTIQI ALLADGPLRG
PGYTLVQRIH FLHGVLNWAC KPYIVLMLLA PAVYWIAGLP AFEADVLSFL RYGAPPLFAL
WAYSGWVSRS RTLPIFMEVT HAISALAVTM TLIQAAVRPF GRPFKVTEKG GDRSQMRVRW
RMASAFGGLS LLSAFSIVWA FIAPTAPAEI SDIDYFNLVW AGVAMVLTFI CFLVCFEYPR
VDLAFRYDAD ARIEAGGTSH ACRIATLSPG RATLAEAGEP VSALGAPLIL HLPGIGAIDA
VADPAGLSLD PTPEQYRALV VALYSTPRDT IARAARFTPA VGGLLRRSLG LG