Gene ECH74115_4898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4898 
SymbolbcsA 
ID6967926 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4537869 
End bp4540487 
Gene Length2619 bp 
Protein Length872 aa 
Translation table11 
GC content55% 
IMG OID643388586 
Productcellulose synthase catalytic subunit 
Protein accessionYP_002273014 
Protein GI209397409 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID[TIGR03030] cellulose synthase catalytic subunit (UDP-forming) 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones76 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTATCC TGACCCGGTG GTTGCTTATC CCGCCGGTCA ACGCGCGGCT TATCGGGCGT 
TATCGCGATT ATCGTCGTCA CGGTGCGTCG GCTTTCAGCG CGACGCTCGG CTGTTTCTGG
ATGATCCTGG CCTGGATTTT TATTCCACTG GAGCACCCGC GCTGGCAGCG TATTCGCGCA
GAACATAAAA ACCTGTATCC GCATATCAAC GCCTCGCGTC CGCGTCCGCT GGACCCGGTC
CGTTATCTCA TTCAAACATG CTGGTTACTG ATCGGTGCAT CGCGCAAAGA AACGCCGAAA
CCGCGCAGGC GGGCATTTTC AGGTCTGCAG AATATTCGTG GACGTTACCA TCAATGGATG
AACGAGCTGC CTGAGCGCGT TAGCCATAAA ACACAGCATC TTGATGAGAA AAAAGAGCTC
GGTCATTTGA GTGCCGGGGC GCGGCGGTTG ATCCTCGGTA TCATCGTCAC CTTCTCGCTG
ATTCTGGCGT TAATCTGCGT TACTCAGCCG TTTAACCCGC TGGCGCAGTT TATCTTCCTG
ATGCTGCTGT GGGGGGGAGC GCTGATCGTA CGGCGGATGC CGGGGCGCTT CTCAGCGCTA
ATGTTGATTG TGCTGTCGCT GACTGTTTCT TGCCGTTATA TCTGGTGGCG ATATACCTCT
ACGCTGAACT GGGACGATCC GGTCAGCCTG GTGTGCGGGC TTATTCTGCT CTTCGCTGAA
ACGTACGCGT GGATTGTGCT GGTGCTCGGC TACTTCCAGG TAGTATGGCC GCTGAATCGT
CAGCCGGTGC CATTGCCGAA AGATATGTCG CTGTGGCCGT CGGTGGATAT CTTTGTCCCG
ACTTACAACG AAGATCTCAA CGTGGTGAAA AATACCATTT ACGCCTCGCT GGGTATCGAC
TGGCCGAAAG ACAAGCTGAA CATCTGGATC CTCGATGATG GCGGCAGGGA AGAGTTTCGC
CAGTTTGCGC AAAACGTGGG GGTGAAGTAT ATCGCCCGTA CCACTCATGA ACATGCGAAA
GCGGGCAACA TCAACAATGC GCTGAAATAT GCCAAAGGCG AGTTCGTGTC GATTTTCGAC
TGCGACCACG TACCAACGCG ATCGTTCCTG CAAATGACCG TGGGCTGGTT CCTGAAAGAG
AAACAGCTGG CGATGATGCA GACCCCACAC CATTTCTTCT CGCCGGACCC GTTTGAACGC
AACCTGGGGC GTTTTCGTAA AACACCGAAC GAAGGCACGC TGTTCTATGG TCTGGTGCAG
GATGGCAACG ATATGTGGGA CGCCACTTTC TTCTGCGGTT CCTGTGCGGT GATTCGCCGT
AAGCCGCTGG ATGAAATTGG CGGCATTGCT GTCGAAACTG TGACTGAAGA TGCGCATACT
TCTCTGCGTT TGCACCGTCG TGGCTATACC TCCGCGTATA TGCGTATTCC GCAGGCGGCG
GGGCTGGCGA CCGAAAGTCT GTCGGCGCAT ATCGGTCAGC GTATTCGCTG GGCGCGCGGG
ATGGTGCAAA TCTTCCGTCT CGATAACCCG CTCACCGGTA AAGGGCTGAA GTTTGCTCAG
CGGCTGTGCT ACGTCAACGC CATGTTCCAC TTCTTGTCGG GCATTCCACG GCTGATCTTC
CTGACTGCGC CGCTGGCGTT CCTGCTGCTT CATGCCTACA TCATCTATGC GCCAGCGTTG
ATGATCGCCC TGTTCGTGCT GCCGCATATG ATCCATGCCA GCCTGACCAA CTCCAAGATC
CAGGGCAAAT ATCGCCACTC TTTCTGGAGT GAAATCTACG AAACGGTGCT GGCGTGGTAT
ATCGCACCAC CGACGCTGGT GGCGCTGATT AACCCGCACA AAGGCAAATT TAACGTCACC
GCCAAAGGTG GACTGGTGGA AGAAGAGTAC GTCGACTGGG TGATCTCGCG GCCCTACATC
TTCCTTGTTC TGCTCAACCT GGTGGGCGTT GCGGTAGGCA TCTGGCGCTA CTTCTATGGC
CCGCCAACCG AGATGCTCAC CGTGGTCGTC AGTATGGTGT GGGTATTCTA CAACCTGATT
GTTCTTGGCG GCGCAGTTGC GGTATCGGTA GAAAGCAAAC AGGTACGCCG ATCGCACCGC
GTGGAGATGA CGATGCCCGC GGCAATTGCC CGCGAAGATG GTCACCTCTT CTCGTGTACC
GTTCAGGATT TCTCCGACGG TGGTTTGGGG ATCAAGATCA ACGGTCAGGC GCAGATTCTG
GAAGGGCAGA AAGTGAATCT GTTGCTTAAA CGCGGTCAGC AGGAATACGT CTTCCCGACC
CAGGTGGCGC GCGTGATGGG TAATGAAGTT GGGCTGAAAT TAATGCCGCT CACCACCCAG
CAACATATCG ATTTTGTGCA GTGTACGTTT GCCCGTGCGG ATACATGGGC GCTCTGGCAG
GACAGCTACC CGGAAGATAA GCCGCTGGAA AGTCTGCTGG ATATTCTGAA GCTCGGCTTC
CGTGGCTACC GCCATCTGGC GGAGTTTGCG CCTTCTTCGG TGAAGGGCAT ATTCCGTGTG
CTGACTTCTC TGGTTTCCTG GGTTGTATCG TTTATTCCGC GCCGCCCGGA GCGGAGCGAA
ACGGCACAAC CATCGGATCA GGCTTTGGCT CAACAATGA
 
Protein sequence
MSILTRWLLI PPVNARLIGR YRDYRRHGAS AFSATLGCFW MILAWIFIPL EHPRWQRIRA 
EHKNLYPHIN ASRPRPLDPV RYLIQTCWLL IGASRKETPK PRRRAFSGLQ NIRGRYHQWM
NELPERVSHK TQHLDEKKEL GHLSAGARRL ILGIIVTFSL ILALICVTQP FNPLAQFIFL
MLLWGGALIV RRMPGRFSAL MLIVLSLTVS CRYIWWRYTS TLNWDDPVSL VCGLILLFAE
TYAWIVLVLG YFQVVWPLNR QPVPLPKDMS LWPSVDIFVP TYNEDLNVVK NTIYASLGID
WPKDKLNIWI LDDGGREEFR QFAQNVGVKY IARTTHEHAK AGNINNALKY AKGEFVSIFD
CDHVPTRSFL QMTVGWFLKE KQLAMMQTPH HFFSPDPFER NLGRFRKTPN EGTLFYGLVQ
DGNDMWDATF FCGSCAVIRR KPLDEIGGIA VETVTEDAHT SLRLHRRGYT SAYMRIPQAA
GLATESLSAH IGQRIRWARG MVQIFRLDNP LTGKGLKFAQ RLCYVNAMFH FLSGIPRLIF
LTAPLAFLLL HAYIIYAPAL MIALFVLPHM IHASLTNSKI QGKYRHSFWS EIYETVLAWY
IAPPTLVALI NPHKGKFNVT AKGGLVEEEY VDWVISRPYI FLVLLNLVGV AVGIWRYFYG
PPTEMLTVVV SMVWVFYNLI VLGGAVAVSV ESKQVRRSHR VEMTMPAAIA REDGHLFSCT
VQDFSDGGLG IKINGQAQIL EGQKVNLLLK RGQQEYVFPT QVARVMGNEV GLKLMPLTTQ
QHIDFVQCTF ARADTWALWQ DSYPEDKPLE SLLDILKLGF RGYRHLAEFA PSSVKGIFRV
LTSLVSWVVS FIPRRPERSE TAQPSDQALA QQ