Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2985 |
Symbol | wcaI |
ID | 6967214 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2764730 |
End bp | 2765953 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643386825 |
Product | putative glycosyl transferase |
Protein accession | YP_002271293 |
Protein GI | 209396871 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.00135892 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAATAC TGGTCTACGG CATTAACTAC TCGCCGGAGT TAACCGGCAT CGGCAAATAC ACCGGCGAGA TGGTGGAATG GCTGGCGGCG CAAGGTCATG AGGTGCGGGT TATTACCGCA CCGCCTTACT ACCCGCAGTG GCAAGTGGGC GAGAACTATT CCGCCTGGCG GTACAAACGA GAAGAGGGGG CCGCTACGGT GTGGCGCTGC CCGCTGTACG TGCCAAAACA GCCGAGCACC CTGAAACGCC TGTTGCATCT CGGCAGTTTT GCCGTCAGCA GTTTTTTCCC GTTGATGGCG CAACGTCGCT GGAAGCCGGA TCGCATTATC GGCGTAGTGC CAACGCTGTT TTGCACGCCG GGAATGCGCC TGCTGGCGAA ACTCTCTGGC GCGCGTACCG TGCTGCATAT TCAGGATTAC GAAGTAGATG CCATGCTGGG GCTGGGCCTT GCCGGAAAAG GCAAAGGCGG CAAAGTGGCA CAGCTGGCAA CGGCGTTCGA ACGTAGCGGA CTGCATAACG TCGATAACGT TTCCACGATT TCGCGTTCGA TGATGAATAA AGCCATCGAA AAAGGCGTGG CGGCGGAAAA CGTCATCTTC TTCCCCAACT GGTCGGAAAT CGCCCGTTTT CAGCATGTTG CAGACGCCGA TGTTGATGCC CTTCGTAACC AGCTTGGCCT GCCGGATAAC AAAAAAATCA TTCTTTACTC CGGCAATATT GGTGAAAAGC AGGGGCTGGA AAACGTTATT GAAGCAGCCG ATCGCCTGCG CGATGAACCG CTGATTTTTG CCATTGTCGG GCAGGGCGGC GGCAAAGCGC GGCTGGAAAA AATGGCGCAG CAGCGTGGAC TGCGCAACAT GCAATTTTTC CCGCTGCAAT CGTATGACGC TTTACCCGCA CTGCTGAAGA TGGGCGATTG CCATCTGGTG GTGCAAAAAC GCGGCGCGGC AGATGCCGTA TTGCCGTCGA AACTGACCAA TATTCTGGCA GTAGGCGGTA ACGCGGTGAT TACTGCTGAA GCCCACACAG AACTGGGACA GCTTTGCGAA ACCTTTCCGG GCATTGCGGT TTGCGTAGAA CCGGAATCGG TTGAGGCGCT GGTGGCGGGG ATTCGTCAGG CGCTCCTGCT GCCCAAACAC AACACGGTGG CACGTGAATA TGCCGAACGC ACGCTTGATA AAGAGAACGT GTTACGTCAA TTTATAAATG ATATTCGGGG ATAA
|
Protein sequence | MKILVYGINY SPELTGIGKY TGEMVEWLAA QGHEVRVITA PPYYPQWQVG ENYSAWRYKR EEGAATVWRC PLYVPKQPST LKRLLHLGSF AVSSFFPLMA QRRWKPDRII GVVPTLFCTP GMRLLAKLSG ARTVLHIQDY EVDAMLGLGL AGKGKGGKVA QLATAFERSG LHNVDNVSTI SRSMMNKAIE KGVAAENVIF FPNWSEIARF QHVADADVDA LRNQLGLPDN KKIILYSGNI GEKQGLENVI EAADRLRDEP LIFAIVGQGG GKARLEKMAQ QRGLRNMQFF PLQSYDALPA LLKMGDCHLV VQKRGAADAV LPSKLTNILA VGGNAVITAE AHTELGQLCE TFPGIAVCVE PESVEALVAG IRQALLLPKH NTVAREYAER TLDKENVLRQ FINDIRG
|
| |