Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_2336 |
Symbol | wcaL |
ID | 5588258 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 2298830 |
End bp | 2300050 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640926001 |
Product | colanic acid biosynthesis glycosyl transferase WcaL |
Protein accession | YP_001463396 |
Protein GI | 157158914 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.000162166 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGGTCG GCTTCTTTTT ACTGAAATTT CCGCTGTCGT CGGAAACCTT CGTTCTTAAC CAAATTACCG CGTTTATTGA TATGGGATTC GAGGTGGAGA TTGTTGCGCT GCAAAAAGGC GACACCCAGA ACACCCACGC GGCATGGACG AAATACAACC TTGCCGCAAG AACCCGCTGG TTACAGGACG AACCACAAGG CAAAGTGGCG AAACTGCGCC ACCGCGCCAG CCAGACCTTA CGCGGCATTC ATCGTAAAAA TACCTGGCAG GCGCTCAATC TCAAACGCTA TGGTGCCGAG TCGCGGAACC TGATTTTGTC TGCCATTTGC GGTCAGGTCG CAACACCGTT TCATGCCGAT GTCTTTATCG CTCATTTTGG TCCTGCGGGG GTAGCCGCAG CAAAACTACG CGAACTGGGT GTCATTCGCG GCAAAATTGC CACTATCTTC CACGGTATTG ATATCTCCAG TCGGGAAGTG CTCAACCACT ACACTCCCGA ATATCAGCAA CTGTTTTGCC GTGGCGACCT GATGTTACCG ATAAGCGATC TGTGGGCCGG AAGGCTGCAA AAAATGGGCT GCCCGAGGGA AAAAATCGCC GTATCGCGCA TGGGCGTAGA TATGACGCGC TTTAGCCCGC GTCCCGTGAA AGCGCCCGCA ACGCCGCTGG AAATCATCTC CGTCGCACGT TTAACCGAGA AAAAAGGCCT GCATGTGGCG ATTGAAGCCT GCCGTCAGTT GAAAGAGCAG GGCGTGGCAT TTCGCTATCG CATCCTCGGC ATTGGCCCGT GGGAACGACG CCTGCGCACG CTCATCGAAC AATATCAACT GGAAGATGTG GTGGAGATGC CTGGCTTTAA ACCGAGCCAT GAAGTGAAAG CGATGCTCGA CGACGCGGAT GTCTTCCTGT TGCCATCGGT TACAGGTGCG GATGGTGATA TGGAAGGCAT TCCGGTGGCG CTAATGGAAG CGATGGCGGT CGGCATTCCG GTGGTTTCTA CTCTGCATAG CGGAATACCG GAACTGGTGG AGGCTGACAA ATCCGGCTGG CTGGTGCCTG AGAACGATGC TCGCGCACTG GCGCAACGAC TGGCGGCGTT TAGCCAACTG GACACCGACG AACTGGCTCC GGTTGTCAAA CGTGCGCGCG AAAAAGTCGA ACACGATTTT AACCAGCAGG TGATTAATCG AGAACTCGCC AGCTTGTTAC AGGCTTTATA G
|
Protein sequence | MKVGFFLLKF PLSSETFVLN QITAFIDMGF EVEIVALQKG DTQNTHAAWT KYNLAARTRW LQDEPQGKVA KLRHRASQTL RGIHRKNTWQ ALNLKRYGAE SRNLILSAIC GQVATPFHAD VFIAHFGPAG VAAAKLRELG VIRGKIATIF HGIDISSREV LNHYTPEYQQ LFCRGDLMLP ISDLWAGRLQ KMGCPREKIA VSRMGVDMTR FSPRPVKAPA TPLEIISVAR LTEKKGLHVA IEACRQLKEQ GVAFRYRILG IGPWERRLRT LIEQYQLEDV VEMPGFKPSH EVKAMLDDAD VFLLPSVTGA DGDMEGIPVA LMEAMAVGIP VVSTLHSGIP ELVEADKSGW LVPENDARAL AQRLAAFSQL DTDELAPVVK RAREKVEHDF NQQVINRELA SLLQAL
|
| |