Gene Mext_1367 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1367 
Symbol 
ID5831337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp1520215 
End bp1522710 
Gene Length2496 bp 
Protein Length831 aa 
Translation table11 
GC content68% 
IMG OID641367160 
Productcellulose synthase catalytic subunit (UDP-forming) 
Protein accessionYP_001638839 
Protein GI163850796 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID[TIGR03030] cellulose synthase catalytic subunit (UDP-forming) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.393143 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.0964081 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATACGTG CGCTGCGGTG GCTGGCCTGG ATGGGGACGA CGGTTGCCGG CCTCATCCTG 
CTGAGCCAAC CGGTCGGGAC GCAGAACCAG CTCGCCATGA GTCTTGCCGC CATGGCGGCG
ATGATCGTGC TGTGGCTGTT CCTCGACGGG CCGCGCACCC GCTTCGTGTT CCTGGCGCTC
GGGAGCCTCG TGGTGCTCCG CTACATCCTG TGGCGGGTCA CCGATACGCT GCCCTCCCCC
GGCGATCCGG TCAGCTTCGG CTTCGGCCTG CTGCTGCTCG TGGGCGAATT GTACTGCGTC
TTCATCCTGT TCGTGAGCCT GATCATCAAC GCCGACCCGC TCAAGCGGGC CCCGCCCCCT
GTCGCGCGCG CGGCCGAGCT GCCGACGGTC GACGTGTTCG TGCCGAGCTA CAACGAGGAC
GCCGCCATCC TGGCGATGAC GCTGGCAGCC GCGCGCCAGA TGAATTACCC GCCCGACAAG
CTTAACGTCT GGCTTCTCGA CGATGGCGGG TCGGACCAGA AATGCGCCGA CCCCAACCCG
GAGAAGGCCA AGGCTGCCCG CGACCGGCGG CGGGAGCTGA CGGTGCTGGC CGAGGAACTC
GGCTGCCGCT ACCTCACCCG CGCCCGCAAC GAGCACGCCA AGGCGGGCAA CCTCAACAAC
GGGCTGGCCT TCGCCAGCGG TGAGATCGTC GTCGTGCTCG ATGCCGACCA CGTCCCGTTC
CGCTCGTTCC TGAGCGAGAC GGTCGGCTAC TTCGCCGAAG ACCCGAAGCT GTTCCTCGTC
CAGACCCCGC ACGCCTTCCT CAACCCCGAT CCGATCGAGC GGAACCTGAA GACCTTCGAG
CGGATGCCGT CGGAGAACGA GATGTTCTAC GCGGTGACGC AGCGCGGGCT CGACAAGTGG
AACGGCTCGT TCTTCTGCGG CTCCGCCGCC TTGCTGCGAC GCACTGCCCT GGACGAGGCC
GGGGGATTCT CCGGCATCAC CATCACCGAG GATTGCGAGA CCGCGTTCGA GCTGCATTCC
CGCGGCTGGA CCAGCGCCTA TGTCGACAAG CCCCTGATCG CCGGTCTCCA GCCCGAGACG
CTCTCGGCCT TCATCGGCCA GCGCTCGCGC TGGTGCCAGG GCATGTTCCA GATCCTGCTC
CTGAAGAACC CCGCTTTGCA GAAGGGGCTC AAGCCGATCC AGAAGATCGC CTACCTCTCG
AGCATGACGT TCTGGTTCTT CCCCGTGCCG CGGCTGATCT TCATGTTCGC GCCGCTCTTG
CACATTTTCT TCGATCTGAA GATCTTCGTT GCCAGCGTCG ATGAATCGAT TGCCTATACG
GCGACCTACA TCGTCATCAA CCTGATGATG CAGAACTACG TCTACGGCAA GTTCCGCTGG
CCCTTCGTCT CGGAACTCTA CGAATACGTT CAGGGCCTCT ATCTGTCGAA GGCCATCGTC
TCGGTGATCT GGTCGCCGCG AAAGCCGACC TTCAACGTCA CCGACAAGGG CATCAGCCTC
GACCACGACC ATCTCTCGTC GGCCTCGCTC CCGTTCTTCG CCGTCTATGG GTTGCTGGCG
GCGGGCTGCG CGGTGGCGAC GTGGCGCTAC CTGTTCGAGC CGGGCGTGAC CAACCTGATG
CTGGTGGTCG GGCTGTGGAA CTTCTTCAAC CTGCTGACCG CGGGCGCGGC GCTCGGGGTC
TGCGCCGAGC GCCGACAGCT GGAGCGGACG CCCTCGCTCG CCATCAACCG GCGCGGCCAG
ATCACCTTGG GCGGCCGGGC CATCGACGTG TCGATCGAGC GCGTCTCGGC CGAGGCCTGC
ACGGTGCGCC TGCCCGCCGC CCTGCTGCCG ACGGGGGGTG GACACCGCAA GCTCACCGGC
GCGCTCACCG TCGTCCCGGT GGCGGGCGCG CGCCCGGCCG GCGCCCTGCC GGTGACCCTG
GAGCGGATCG ACCGCACCAA GGACGAGGCC TTCGCCCGCC TGAGCTTCGG CCGCCTGCGC
CCGCAGGATT ACGTGGCGCT CGCCGGCCTG ATGTACGGCG ACGCGGAGGC GATGCGGCGC
TTCCAGATGC GCCGCCGCCG CCACAAGGAC ATCCTGACCG GCACGCTGCA ATTCATCTGG
TGGGGTCTGT CCGAGCCGTT CCGCGCCGTG CGCTACGCCT TCGCCGCCGA TGCGCGACCG
GCGGTGGCGC CGGTGATCGA CGGCACGTCG CCGATCTACG ATGCGCCGGA ACAGGCGCCG
ACCGGGGCGA ACCTGCCCGA GCCGCTGCTT GCGCCAGCGG TGCCGGCTCA CGCGCTCCAG
GCCGCCCCGC AGGCGCCGAG GGCGCCTGTG CCGGCCGAGG CGCGGAACCC GGTCCCGACG
GCTCAGGCCC CGGCCAACTC CTCTGCCAAC CCTTCGGCCA ACCCCTCTGC CAACCCGCCG
GCCCGCGCCT CGAACGATTG GGTGCGCATG ATGCTCGATT ACGAGAACGA GCGCGCGCTC
GCCGGTCGGA CCGGCCGCGG CACCGGCACG GCATGA
 
Protein sequence
MIRALRWLAW MGTTVAGLIL LSQPVGTQNQ LAMSLAAMAA MIVLWLFLDG PRTRFVFLAL 
GSLVVLRYIL WRVTDTLPSP GDPVSFGFGL LLLVGELYCV FILFVSLIIN ADPLKRAPPP
VARAAELPTV DVFVPSYNED AAILAMTLAA ARQMNYPPDK LNVWLLDDGG SDQKCADPNP
EKAKAARDRR RELTVLAEEL GCRYLTRARN EHAKAGNLNN GLAFASGEIV VVLDADHVPF
RSFLSETVGY FAEDPKLFLV QTPHAFLNPD PIERNLKTFE RMPSENEMFY AVTQRGLDKW
NGSFFCGSAA LLRRTALDEA GGFSGITITE DCETAFELHS RGWTSAYVDK PLIAGLQPET
LSAFIGQRSR WCQGMFQILL LKNPALQKGL KPIQKIAYLS SMTFWFFPVP RLIFMFAPLL
HIFFDLKIFV ASVDESIAYT ATYIVINLMM QNYVYGKFRW PFVSELYEYV QGLYLSKAIV
SVIWSPRKPT FNVTDKGISL DHDHLSSASL PFFAVYGLLA AGCAVATWRY LFEPGVTNLM
LVVGLWNFFN LLTAGAALGV CAERRQLERT PSLAINRRGQ ITLGGRAIDV SIERVSAEAC
TVRLPAALLP TGGGHRKLTG ALTVVPVAGA RPAGALPVTL ERIDRTKDEA FARLSFGRLR
PQDYVALAGL MYGDAEAMRR FQMRRRRHKD ILTGTLQFIW WGLSEPFRAV RYAFAADARP
AVAPVIDGTS PIYDAPEQAP TGANLPEPLL APAVPAHALQ AAPQAPRAPV PAEARNPVPT
AQAPANSSAN PSANPSANPP ARASNDWVRM MLDYENERAL AGRTGRGTGT A