Gene Mext_1133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1133 
Symbol 
ID5834408 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp1238255 
End bp1240273 
Gene Length2019 bp 
Protein Length672 aa 
Translation table11 
GC content64% 
IMG OID641366928 
Productcellulose synthase (UDP-forming) 
Protein accessionYP_001638608 
Protein GI163850565 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0362124 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.973174 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTTCAGC ATCTCCTAGA GCTCAACCTC GCCCCACTCG CGCCGACCTG CCTCGTCGGG 
GCTTTCTTTT ATCTTTTCGT GACGAACTGG CCGCGGCAGA AGACCTGGGC GCGGAGCATC
GCCTGCGCCT TCGTTCTCGC GGTGGCCGTG CGCTACCTCC TCTGGCGCTT CTTTCACACG
GTGCTGCCGC ATCCTGTGGA CGGCAGCTTC GCCCTCATCT GGACCTGGGC CGTCTTCTTC
GTGGAACTCG GCGCCTTCGC CGACATCCTG CTCTTCCTTG TGGTGATGAG CCGCTCGGTC
GACCGAAGCG CCGAGGCCAG CCGGTTGGAG CGAGCCTTCT TCGCACGCCC GGAGGCGGAG
TTGCCGACCG TCGATGTCTT CATCCCAACC TACAACGAGC CGCTCGACGT GCTGGAGCGG
ACCATCGTCG GGGCTCTCGC GCTCGATTAC CCGCGGGACA AGTTCAAGGT CTATGTCCTC
GACGACAAGA AGCGCGACTG GTTGAAGGCG TATTGTGAGG AGAAGGGCGC GATCCACGTC
ACCCGGCCGG ACAATTCCCA CGCCAAGGCC GGCAACATGA ACAACGGCCT CAAGGTCTCC
TCAGGCGACT TCATTGCGAT CTTCGACGCC GATTTCGTGC CCTACCGCAG TTTTCTGCGC
CGCACGCTGC CCTTCTTCAC GGACCCGACG ATTGGCATCG TTCAGACCCC GCAACATTTC
TTCAACAAGG ATCCGGTCCA GTCGAACCTC TCGCTCGAAA AGGTCTGGCC CGACGAGCAG
AGATTGTTCT TCGATGAGAT GGCCGCGAGC CGCGATGCCT GGGACGTGAG CTTCTGCTGC
GGTTCCTGCT CGATCGCCCG CCGCGCGGCC CTCGATGTAA TCGGCGGCTT CCCGCACGAC
TCGATCACCG AGGATCTCCT CACGACCCTC GCGATGCTCA ACAAGGGATA CAAGACCCGC
TACCTGAACG AGCGTCTGTC GATGGGTCTC GCGGCGGAGA ACCTGAAGGG ATACTTCGTT
CAGCGGGGCC GCTGGTGCCG CGGCGGCATC CAGACGATCT ACCTGCATAA CGGCCCGCTG
CGGGGGGCGG GCTTGAACCT GTTCCAGCGG GTGATGTTCC TGCCCCTGTC GTGGCTGATG
CAGTACACGA CGCGGTTCGT GATCCTCGTG GTACCGGCCG TCTACCTCTG GACCGGCGCA
GCACCGCTGT ACTTCACCGG CTCGCAGGAC ATCGTCTATT ACCAGTTGCC CGTGCTCACG
GCCTACTTCC TGCTGATGGG CTGGTTGACG CCGACCCGCT ACTTGCCACT GGTGTCGAGC
GCGGTGGGCA CCTTCGCGAC GTTCCGCATG CTGCCCGTTG TGGTCTCCAG CGTGATCAAG
CCGTTCGGCG TGCCGTTTCG CGTGACGCCG AAGGGCAGCG GCAACGAGGA GAACGCCTTC
GACGCCTACA CCTTCTTCTC GATCGCCTTC TGGATCGCCG TCACCGCGCT CGGCCTCGTC
ATCAACATCG TGCCGGAATG GTCGCGCATC GGCGAGGGTG AGTTCTCGCT CGTCTCGGCC
TACTGGGCCG CCCTCAACAT CCTCGTCCTC TTGGTCGCGG CCCTGATCTG CTTCGAGAAA
TCGCGTCCGC TCCTTGACAG TTTCGCAACA GACGAGCCGG CCCGCATCGT CGCAGCGGAC
CGCGCCTTGG ACGCGCGGAT CGTCAATCTG TCGCTCGATC GAGGGATCGC ATCCTTCCCG
GCCGATCCCG AGCTATGCCC CGGCGACCAG ATCTGGATCG AGATGGAGCG CTTCCCGCAC
CTTGAAGCCA CGGTCGAAGG CGTGACGCCG GGCAGGCGTC GGAGCCCCGC TTGCGTCCGA
TTTTCCTACA ATCTCGAGGG CGCCTGCCGC GACGTGATGA TTGTCCGCCT CTACACCGGC
CAGTACTCGC AGGACATCCG TGACATCGAC AAGTCGGCCG TCGTCGGGGG CCTCTGGAGC
CGCCTGTTCG GACGAGGTAG CACCTATGGC CCGGCTTGA
 
Protein sequence
MVQHLLELNL APLAPTCLVG AFFYLFVTNW PRQKTWARSI ACAFVLAVAV RYLLWRFFHT 
VLPHPVDGSF ALIWTWAVFF VELGAFADIL LFLVVMSRSV DRSAEASRLE RAFFARPEAE
LPTVDVFIPT YNEPLDVLER TIVGALALDY PRDKFKVYVL DDKKRDWLKA YCEEKGAIHV
TRPDNSHAKA GNMNNGLKVS SGDFIAIFDA DFVPYRSFLR RTLPFFTDPT IGIVQTPQHF
FNKDPVQSNL SLEKVWPDEQ RLFFDEMAAS RDAWDVSFCC GSCSIARRAA LDVIGGFPHD
SITEDLLTTL AMLNKGYKTR YLNERLSMGL AAENLKGYFV QRGRWCRGGI QTIYLHNGPL
RGAGLNLFQR VMFLPLSWLM QYTTRFVILV VPAVYLWTGA APLYFTGSQD IVYYQLPVLT
AYFLLMGWLT PTRYLPLVSS AVGTFATFRM LPVVVSSVIK PFGVPFRVTP KGSGNEENAF
DAYTFFSIAF WIAVTALGLV INIVPEWSRI GEGEFSLVSA YWAALNILVL LVAALICFEK
SRPLLDSFAT DEPARIVAAD RALDARIVNL SLDRGIASFP ADPELCPGDQ IWIEMERFPH
LEATVEGVTP GRRRSPACVR FSYNLEGACR DVMIVRLYTG QYSQDIRDID KSAVVGGLWS
RLFGRGSTYG PA