Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2978 |
Symbol | wcaL |
ID | 6968361 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2756122 |
End bp | 2757342 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643386818 |
Product | colanic acid biosynthesis glycosyl transferase WcaL |
Protein accession | YP_002271286 |
Protein GI | 209399030 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0458092 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.000111113 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAGGTCG GCTTCTTCTT ATTGAAATTT CCGCTGTCGT CAGAAACCTT CGTTCTTAAC CAAATCACTG CGTTTATTGA TATGGGCTTT GAAGTGGAGA TTGTCGCGCT GCAAAAAGGC GACACACAAA ACACCCACGT GGCATGGACT AAATACAACC TTGCCGCCAG AACCCGCTGG TTACAGGACG AACCTACGGG CAAAGTGGCG AAACTGAGCC ACCGAGCCAG TCAGACCTTG CGCGGCATTC ATCGTAAAAA TACCTGGCAG GCGCTCAACC TCAAACGCTA TGGTGCCGAG TCGTGGAACC TGATTTTGTC TGCCATTTGC GGCCAGGTCG CAACACCGTT TCGCGCCGAT GTGTTCATCG CTCATTTTGG CCCTGCGGGG GTAACCGCAG CAAAACTACG CGAACTGGGT GTCATTCACG GCAAAATTGC CACTATTTTC CACGGCATTG ATATTTCCAG TCGGGAAGTG CTCAACCATT ACACTCCCGA ATATCAACAA CTGTTTCGAC GTGGCGACCT GATGCTACCG ATAAGCGATC TGTGGGCCGG AAGGCTGCAA AAAATGGGCT GCCCGAGGGA AAAAATCGCC GTATCGCGCA TGGGCGTAGA TATGACGCGC TTTAGCCCGC GTCCCGTGAA AGCGCCCGCA ACACCGCTGG AGATTATTTC CGTCGCACGC TTAACCGAGA AAAAAGGTCT GCATGTGGCG ATTGAAGCCT GCCGGCAGTT GAAAGAGCAG GGCGTGGCAT TTCGCTATCG CATCCTCGGC ATTGGCCCGT GGGAACGACG CCTGCGCACG CTCATCGAAC AATATCAACT GGAAGATGTG GTGGAGATGC CGGGCTTTAA ACCGAGCCAT GAAGTGAAAG CGATGCTCGA CGACGCGGAT GTCTTCCTGT TGCCATCGGT TACAGGTGCG GATGGCGATA TGGAAGGCAT TCCGGTGGCG CTGATGGAGG CGATGGCGGT CGGCATTCCG GTGGTTTCTA CTCTGCATAG CGGAATACCG GAACTGGTGG AGGCTGACAA ATCCGGCTGG CTGGTGCCTG AGAACGATGC TCGCGCACTG GCGCAACGAC TGGCGGCGTT TAGCCAACTG GACACCGACG AATTGGCTCC GGTCGTCAAA CGCGCGCGCG AAAAAGTTGA ACACGATTTT AACCAGCAGG TGATCAATCG AGAACTCGCC AGCTTGCTGC AGGCTTTATA G
|
Protein sequence | MKVGFFLLKF PLSSETFVLN QITAFIDMGF EVEIVALQKG DTQNTHVAWT KYNLAARTRW LQDEPTGKVA KLSHRASQTL RGIHRKNTWQ ALNLKRYGAE SWNLILSAIC GQVATPFRAD VFIAHFGPAG VTAAKLRELG VIHGKIATIF HGIDISSREV LNHYTPEYQQ LFRRGDLMLP ISDLWAGRLQ KMGCPREKIA VSRMGVDMTR FSPRPVKAPA TPLEIISVAR LTEKKGLHVA IEACRQLKEQ GVAFRYRILG IGPWERRLRT LIEQYQLEDV VEMPGFKPSH EVKAMLDDAD VFLLPSVTGA DGDMEGIPVA LMEAMAVGIP VVSTLHSGIP ELVEADKSGW LVPENDARAL AQRLAAFSQL DTDELAPVVK RAREKVEHDF NQQVINRELA SLLQAL
|
| |