Gene Mchl_1324 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_1324 
Symbol 
ID7115587 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp1359067 
End bp1361025 
Gene Length1959 bp 
Protein Length652 aa 
Translation table11 
GC content67% 
IMG OID643524101 
ProductCellulose synthase (UDP-forming) 
Protein accessionYP_002420136 
Protein GI218529320 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.531441 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.84698 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGTTTC CCTCGCCCGG CGACGCGGCC TCCGTGCTCG CGCTCTCGTT CGGCATCGCG 
CTCGGGCTTT TCGCTCTCGC GGGCCTGTTG CGGCCGGAGC GTGCCTTCGA TCGCCTTCTG
TTCGGAGCGC TGACGGCCGC GCTCATCGCC ACCTACGCGC GCTGGCGCTG GAGCGACACG
CTGCCGCCGC TGACCCCCGA GGCCGGTGCC GTCTGGTCCT ACCTGTTCTT CGCCGCCGAG
ATGGCGGCGG TGGTCTATAC CCTGCTCTCG GTGATCATCC TGCTGCGCTT CAAGGACCGG
TCAAGGGAGG CCGACGCGGC CCAGGCTCGC CGCGAGGCGA GTGGCGAATG GCCGGCCGTC
GACATCTTCA TCTGCACCTA CAACGAGCCG CGCGAGGTTG TGGAGAAGTC GATCCTGCCA
TCGCTCGCCA TCGATTACGA ACCCAAGACC GTCTGGGTCT GCGACGACAC CCGCCGCGAC
TGGCTGCGCG ACTATTGCGA GGAGGTCGGC GCCCGCTACA TCACGCGCCC GGACAACAAG
GGGGCCAAGG CCGGCAACCT CAACAACGCC CTGCGTCACA CCGCTGAGCG GACCGACGCC
CCCCTGATTC TCGTGCTCGA TGCGGATTTC GCGCCGCAGC CGAACATCCT CAAGCGCATG
GTCGGCCTGT TCGATGATCC GAAGACGGGC GTGGTGCAGT CGCCGCAATT CTTCTTCAAT
GCCGATCCGA TCCAGCACAA TCTCGCCGCC TCCGACAGCT GGGTCGATGA CCAGCGCATC
TTCTTCGACG TGTTCCAGCC GGCAAAGGAC GCCTGGAACG CGGCCTTCTG TGTCGGTACC
TCGTTCATCG TGCGCCGCGA CCGGCTCGGC GAGATCGGTG GCTTCCCGGA TGCCGCGATC
TGCGAGGATC TCAATCTGTC GCTCGGCATG TCACGCCGAG GCTACGAGAC TCACTGGCTG
AACGAGCGGC TAAGCATGGG CCTCTCGGCT GAGGGCCTCC CGGAATACAT CACGCAGCGT
ACCCGCTGGT GTCTCGGGAC GATTCAGATC GCGTTGCTCG CCGACGGGCC GCTTCGCGGG
CCGGGCTATA CCCTGGTCCA GCGGATCCAC TTCCTGCATG GCGTGCTGAA CTGGGCCTGC
AAACCCTACA TCGTGCTGAT GCTGCTGGCC CCGGCGGTCT ACTGGATCGC CGGACTGCCG
GCCTTCGAGG CCGACGTGCT GTCCTTCCTG CGATACGGTG CGCCCCCGCT CTTCGCTCTG
TGGGCCTATA GCGGGTGGGT TTCGCGTTCG CGCACGCTGC CGATCTTCAT GGAGGTGACA
CACGCGATCA GTGCGCTCGC CGTCACCATG ACGCTCATCC AGGCGGCCGT CCGACCCTTC
GGCCGTCCGT TCAAAGTCAC GGAAAAGGGC GGTGATCGCT CCCAGATGCG CGTCCGCTGG
CGCATGGCCT CGGCCTTCGG CGGCCTCTCC CTGCTTTCGG CCTTCAGCAT CGTCTGGGCC
TTCATCGCCC CGACTGCGCC GGCCGAGATC TCGGATATCG ACTACTTCAA CCTCGTCTGG
GCCGGAGTGG CCATGGTGCT GACCTTCATC TGCTTCCTCG TCTGTTTCGA ATACCCGCGC
GTCGATCTCG CATTCCGCTA CGACGCCGAC GCCCGGATCG AAGCCGGCGG GACCAGTCAC
GCCTGCCGCA TCGCGACCCT TTCGCCCGGC CGGGCGACCC TCGCTGAGGC GGGGGAGCCG
GTCTCCGCGC TGGGTGCGCC GCTGATCCTG CACCTTCCGG GCATCGGCGC GATCGACGCG
GTTGCGGACC CCGCCGGCCT TTCGCTCGAT CCGACCCCGG AGCAGTACCG GGCTCTGGTC
GTGGCCCTCT ACTCCACCCC GCGCGACACC ATCGCCCGTG CCGCCCGGTT CACACCGGCC
GTCGGTGGCC TGCTTCGCCG GAGCCTGGGC CTCGGCTGA
 
Protein sequence
MMFPSPGDAA SVLALSFGIA LGLFALAGLL RPERAFDRLL FGALTAALIA TYARWRWSDT 
LPPLTPEAGA VWSYLFFAAE MAAVVYTLLS VIILLRFKDR SREADAAQAR REASGEWPAV
DIFICTYNEP REVVEKSILP SLAIDYEPKT VWVCDDTRRD WLRDYCEEVG ARYITRPDNK
GAKAGNLNNA LRHTAERTDA PLILVLDADF APQPNILKRM VGLFDDPKTG VVQSPQFFFN
ADPIQHNLAA SDSWVDDQRI FFDVFQPAKD AWNAAFCVGT SFIVRRDRLG EIGGFPDAAI
CEDLNLSLGM SRRGYETHWL NERLSMGLSA EGLPEYITQR TRWCLGTIQI ALLADGPLRG
PGYTLVQRIH FLHGVLNWAC KPYIVLMLLA PAVYWIAGLP AFEADVLSFL RYGAPPLFAL
WAYSGWVSRS RTLPIFMEVT HAISALAVTM TLIQAAVRPF GRPFKVTEKG GDRSQMRVRW
RMASAFGGLS LLSAFSIVWA FIAPTAPAEI SDIDYFNLVW AGVAMVLTFI CFLVCFEYPR
VDLAFRYDAD ARIEAGGTSH ACRIATLSPG RATLAEAGEP VSALGAPLIL HLPGIGAIDA
VADPAGLSLD PTPEQYRALV VALYSTPRDT IARAARFTPA VGGLLRRSLG LG