Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4898 |
Symbol | bcsA |
ID | 6967926 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 4537869 |
End bp | 4540487 |
Gene Length | 2619 bp |
Protein Length | 872 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643388586 |
Product | cellulose synthase catalytic subunit |
Protein accession | YP_002273014 |
Protein GI | 209397409 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | [TIGR03030] cellulose synthase catalytic subunit (UDP-forming) |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 76 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTATCC TGACCCGGTG GTTGCTTATC CCGCCGGTCA ACGCGCGGCT TATCGGGCGT TATCGCGATT ATCGTCGTCA CGGTGCGTCG GCTTTCAGCG CGACGCTCGG CTGTTTCTGG ATGATCCTGG CCTGGATTTT TATTCCACTG GAGCACCCGC GCTGGCAGCG TATTCGCGCA GAACATAAAA ACCTGTATCC GCATATCAAC GCCTCGCGTC CGCGTCCGCT GGACCCGGTC CGTTATCTCA TTCAAACATG CTGGTTACTG ATCGGTGCAT CGCGCAAAGA AACGCCGAAA CCGCGCAGGC GGGCATTTTC AGGTCTGCAG AATATTCGTG GACGTTACCA TCAATGGATG AACGAGCTGC CTGAGCGCGT TAGCCATAAA ACACAGCATC TTGATGAGAA AAAAGAGCTC GGTCATTTGA GTGCCGGGGC GCGGCGGTTG ATCCTCGGTA TCATCGTCAC CTTCTCGCTG ATTCTGGCGT TAATCTGCGT TACTCAGCCG TTTAACCCGC TGGCGCAGTT TATCTTCCTG ATGCTGCTGT GGGGGGGAGC GCTGATCGTA CGGCGGATGC CGGGGCGCTT CTCAGCGCTA ATGTTGATTG TGCTGTCGCT GACTGTTTCT TGCCGTTATA TCTGGTGGCG ATATACCTCT ACGCTGAACT GGGACGATCC GGTCAGCCTG GTGTGCGGGC TTATTCTGCT CTTCGCTGAA ACGTACGCGT GGATTGTGCT GGTGCTCGGC TACTTCCAGG TAGTATGGCC GCTGAATCGT CAGCCGGTGC CATTGCCGAA AGATATGTCG CTGTGGCCGT CGGTGGATAT CTTTGTCCCG ACTTACAACG AAGATCTCAA CGTGGTGAAA AATACCATTT ACGCCTCGCT GGGTATCGAC TGGCCGAAAG ACAAGCTGAA CATCTGGATC CTCGATGATG GCGGCAGGGA AGAGTTTCGC CAGTTTGCGC AAAACGTGGG GGTGAAGTAT ATCGCCCGTA CCACTCATGA ACATGCGAAA GCGGGCAACA TCAACAATGC GCTGAAATAT GCCAAAGGCG AGTTCGTGTC GATTTTCGAC TGCGACCACG TACCAACGCG ATCGTTCCTG CAAATGACCG TGGGCTGGTT CCTGAAAGAG AAACAGCTGG CGATGATGCA GACCCCACAC CATTTCTTCT CGCCGGACCC GTTTGAACGC AACCTGGGGC GTTTTCGTAA AACACCGAAC GAAGGCACGC TGTTCTATGG TCTGGTGCAG GATGGCAACG ATATGTGGGA CGCCACTTTC TTCTGCGGTT CCTGTGCGGT GATTCGCCGT AAGCCGCTGG ATGAAATTGG CGGCATTGCT GTCGAAACTG TGACTGAAGA TGCGCATACT TCTCTGCGTT TGCACCGTCG TGGCTATACC TCCGCGTATA TGCGTATTCC GCAGGCGGCG GGGCTGGCGA CCGAAAGTCT GTCGGCGCAT ATCGGTCAGC GTATTCGCTG GGCGCGCGGG ATGGTGCAAA TCTTCCGTCT CGATAACCCG CTCACCGGTA AAGGGCTGAA GTTTGCTCAG CGGCTGTGCT ACGTCAACGC CATGTTCCAC TTCTTGTCGG GCATTCCACG GCTGATCTTC CTGACTGCGC CGCTGGCGTT CCTGCTGCTT CATGCCTACA TCATCTATGC GCCAGCGTTG ATGATCGCCC TGTTCGTGCT GCCGCATATG ATCCATGCCA GCCTGACCAA CTCCAAGATC CAGGGCAAAT ATCGCCACTC TTTCTGGAGT GAAATCTACG AAACGGTGCT GGCGTGGTAT ATCGCACCAC CGACGCTGGT GGCGCTGATT AACCCGCACA AAGGCAAATT TAACGTCACC GCCAAAGGTG GACTGGTGGA AGAAGAGTAC GTCGACTGGG TGATCTCGCG GCCCTACATC TTCCTTGTTC TGCTCAACCT GGTGGGCGTT GCGGTAGGCA TCTGGCGCTA CTTCTATGGC CCGCCAACCG AGATGCTCAC CGTGGTCGTC AGTATGGTGT GGGTATTCTA CAACCTGATT GTTCTTGGCG GCGCAGTTGC GGTATCGGTA GAAAGCAAAC AGGTACGCCG ATCGCACCGC GTGGAGATGA CGATGCCCGC GGCAATTGCC CGCGAAGATG GTCACCTCTT CTCGTGTACC GTTCAGGATT TCTCCGACGG TGGTTTGGGG ATCAAGATCA ACGGTCAGGC GCAGATTCTG GAAGGGCAGA AAGTGAATCT GTTGCTTAAA CGCGGTCAGC AGGAATACGT CTTCCCGACC CAGGTGGCGC GCGTGATGGG TAATGAAGTT GGGCTGAAAT TAATGCCGCT CACCACCCAG CAACATATCG ATTTTGTGCA GTGTACGTTT GCCCGTGCGG ATACATGGGC GCTCTGGCAG GACAGCTACC CGGAAGATAA GCCGCTGGAA AGTCTGCTGG ATATTCTGAA GCTCGGCTTC CGTGGCTACC GCCATCTGGC GGAGTTTGCG CCTTCTTCGG TGAAGGGCAT ATTCCGTGTG CTGACTTCTC TGGTTTCCTG GGTTGTATCG TTTATTCCGC GCCGCCCGGA GCGGAGCGAA ACGGCACAAC CATCGGATCA GGCTTTGGCT CAACAATGA
|
Protein sequence | MSILTRWLLI PPVNARLIGR YRDYRRHGAS AFSATLGCFW MILAWIFIPL EHPRWQRIRA EHKNLYPHIN ASRPRPLDPV RYLIQTCWLL IGASRKETPK PRRRAFSGLQ NIRGRYHQWM NELPERVSHK TQHLDEKKEL GHLSAGARRL ILGIIVTFSL ILALICVTQP FNPLAQFIFL MLLWGGALIV RRMPGRFSAL MLIVLSLTVS CRYIWWRYTS TLNWDDPVSL VCGLILLFAE TYAWIVLVLG YFQVVWPLNR QPVPLPKDMS LWPSVDIFVP TYNEDLNVVK NTIYASLGID WPKDKLNIWI LDDGGREEFR QFAQNVGVKY IARTTHEHAK AGNINNALKY AKGEFVSIFD CDHVPTRSFL QMTVGWFLKE KQLAMMQTPH HFFSPDPFER NLGRFRKTPN EGTLFYGLVQ DGNDMWDATF FCGSCAVIRR KPLDEIGGIA VETVTEDAHT SLRLHRRGYT SAYMRIPQAA GLATESLSAH IGQRIRWARG MVQIFRLDNP LTGKGLKFAQ RLCYVNAMFH FLSGIPRLIF LTAPLAFLLL HAYIIYAPAL MIALFVLPHM IHASLTNSKI QGKYRHSFWS EIYETVLAWY IAPPTLVALI NPHKGKFNVT AKGGLVEEEY VDWVISRPYI FLVLLNLVGV AVGIWRYFYG PPTEMLTVVV SMVWVFYNLI VLGGAVAVSV ESKQVRRSHR VEMTMPAAIA REDGHLFSCT VQDFSDGGLG IKINGQAQIL EGQKVNLLLK RGQQEYVFPT QVARVMGNEV GLKLMPLTTQ QHIDFVQCTF ARADTWALWQ DSYPEDKPLE SLLDILKLGF RGYRHLAEFA PSSVKGIFRV LTSLVSWVVS FIPRRPERSE TAQPSDQALA QQ
|
| |