Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1018 |
Symbol | wcaL |
ID | 6142712 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1038233 |
End bp | 1039453 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641615905 |
Product | colanic acid biosynthesis glycosyl transferase WcaL |
Protein accession | YP_001743097 |
Protein GI | 170682752 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGTCG GCTTCTTTTT ACTGAAATTT CCGCTGTCGT CGGAAACCTT CGTCCTCAAT CAAATTACCG CGTTTATTGA TATGGGCTTT GAGGTGGAGA TTGTCGCGCT GCAAAAAGGC GACACCCAGA ACACCCACGC GGCATGGACG AAATACAACC TTGCTGCCAG AACCCGCTGG TTACAGGACG AACCTACGGG CAAAGTGGCG AAACTGCGCC ACCGCGCCAG CCAGACGTTA CGCGGCATTC ATCGTAAAAA TACCTGGCAG GCGCTCAACC TCAAACGCTA TGGCGCCGAG TCGCGGAACC TGATTTTGTC TGCCATTTGT GGTCAGGTCG CAACACCGTT TCGCGCCGAT GTGTTCATCG CTCATTTTGG CCCTGCGGGG GTAACCGCAG CAAAACTACG CGAACTGGGT GTCATTCGCG GCAAAATTGC CACTATCTTC CACGGTATTG ATATCTCCAG TCGGGACGTG CTCAACCACT ACACTCCCGA ATATCAACAA CTGTTTCGCC GTGGCGACCT GATGTTACCG ATAAGCGATT TGTGGGCCGG AAGGCTGCAA AAAATGGGCT GTCCGAGGGA AAAAATCGCC GTATCGCGTA TGGGCGTGGA CATGACGCGC TTTAGCCCGC GTCCGGTGAA AGCGCCCGCA ACGCCGCTGG AAATCATCTC CGTCGCACGC TTAACCGAGA AAAAAGGCCT GCATGTGGCG ATCGAAGCCT GCCGGCAGTT GAAAGAGCAG GGCGTGGCAT TTCGCTATCG CATCCTCGGC ATTGGCCCGT GGGAACGACG CCTGCGCACG CTCATCGAAC AATATCAACT GGAAGATGTG GTAGAGATGC CGGGCTTTAA ACCGAGCCAT GAAGTGAAAG CGATGCTCGA CGACGCGGAT GTCTTCCTGT TGCCATCGAT AACGGGTGCG GATGGCGATA TGGAAGGTAT TCCGGTGGCG CTAATGGAAG CGATGGCGGT CGGCATTCCG GTTGTTTCAA CTCTGCATAG CGGAATACCA GAACTGGTGG AGGCCGATAA ATCCGGCTGG CTGGTGCCTG AGAACGATGC TCGCGCACTG GCGCAACGAC TGGCGGCGTT TAGCCAACTG AACACCGACG AACTGGCTCC GGTCGTCAAA CGTGCGCGCG AAAAAGTCGA ACACGATTTT AACCAGCAGG TGATTAATCG AGAACTCGCC AGCTTGCTGC AGGCTTTATA G
|
Protein sequence | MKVGFFLLKF PLSSETFVLN QITAFIDMGF EVEIVALQKG DTQNTHAAWT KYNLAARTRW LQDEPTGKVA KLRHRASQTL RGIHRKNTWQ ALNLKRYGAE SRNLILSAIC GQVATPFRAD VFIAHFGPAG VTAAKLRELG VIRGKIATIF HGIDISSRDV LNHYTPEYQQ LFRRGDLMLP ISDLWAGRLQ KMGCPREKIA VSRMGVDMTR FSPRPVKAPA TPLEIISVAR LTEKKGLHVA IEACRQLKEQ GVAFRYRILG IGPWERRLRT LIEQYQLEDV VEMPGFKPSH EVKAMLDDAD VFLLPSITGA DGDMEGIPVA LMEAMAVGIP VVSTLHSGIP ELVEADKSGW LVPENDARAL AQRLAAFSQL NTDELAPVVK RAREKVEHDF NQQVINRELA SLLQAL
|
| |