Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E1188 |
Symbol | wcaL |
ID | 6271131 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 1093135 |
End bp | 1094355 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641725320 |
Product | colanic acid biosynthesis glycosyl transferase WcaL |
Protein accession | YP_001879834 |
Protein GI | 187730685 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGGTCG GCTTCTTTTT ACTGAAATTT CCGCTGTCGT CGGAAACCTT CGTCCTCAAT CAAATTACCG CGTTTATTGA TATGGGCTTT GAGGTGGAGA TTGTCGCGCT GCAAAAAGGC GACACCCAGA ATACCCACGC GGCATGGACG AAATACAACC TTGCCGCCAG AACCCGCTGG TTACAGGACG AACCACAAGG CAAAGTTGCG AAACTGCGCC ACCGCGCCAG CCAGACCTTA CGCGGCATTC ATCGTAAAAA TACCTGGCAG GCGCTTAACC TCAAACGCTA TGGTGCCGAG TCGCGGAACC TGATTTTGTC TGCCATTTGC GGCCAGGTCG CAACACCATT TTATGCCGAT GTCTTTATCG CTCATTTTGG CCCTGCGGGG GTAACCGCAG CAAAACTACG CGAACTGGGT GTGATTCGCG GCAAAATTGC CACCATCTTC CACGGTATTG ATATCTCCAG TCGGGAAGTG CTCAACCACT ACACTCCCGA ATATCAACAA CTGTTTCGCC GTGGCGACCT GATGTTACCG ATAAGCGATC TGTGGGCCGG AAGGCTGCAA AAAATGGGCT GCCCGAGGGA AAAAATCGCC GTATCGCGCA TGGGCGTGGA CATGACGCGT TTTAGCCCGC GTCCGGTGAA AGCGCCCGCA ACGCCGCTGG AAATCATCTC CGTCGCACGC TTAACCGAAA AAAAAGGCCT GCATGTGGCG ATCGAAGCCT GCCGTCAGTT GAAAGAGCAG GGCATGACAT TTCGCTATCG CATCCTCGGC ATTGGCCCGT GGGAACGACG CCTGCGTACC CTCATCGAAC AATATCAACT GGAAGATGTG GTAGAGATGC CGGGCTTTAA ACCGAGCCAC GAAGTGAAAG CGATGCTCGA CGACGCGGAT GTCTTCCTGT TGCCATCGGT AACGGGCGCG GATGGCGATA TGGAAGGCAT TCCGGTAGCG CTGATGGAAG CGATGGCGGT CGGCATTCCG GTGGTTTCTA CTCTGCATAG CGGAATACCG GAACTGGTGG AGGCTGACAA ATCCGGCTGG CTGGTGCCTG AGAACGATGC TCGCGCACTG GCGCAACGCT TGGCGGCATT TAGCCAACTG GACACCGACG AACTGGCTCC GGTCGTCAAA CGCGCGCGCG AAAAAGTCGA ACACGATTTT AACCAGCAGG TGATTAATCG AGAACTCGCC AGCTTGTTGC AGGCTTTATA G
|
Protein sequence | MKVGFFLLKF PLSSETFVLN QITAFIDMGF EVEIVALQKG DTQNTHAAWT KYNLAARTRW LQDEPQGKVA KLRHRASQTL RGIHRKNTWQ ALNLKRYGAE SRNLILSAIC GQVATPFYAD VFIAHFGPAG VTAAKLRELG VIRGKIATIF HGIDISSREV LNHYTPEYQQ LFRRGDLMLP ISDLWAGRLQ KMGCPREKIA VSRMGVDMTR FSPRPVKAPA TPLEIISVAR LTEKKGLHVA IEACRQLKEQ GMTFRYRILG IGPWERRLRT LIEQYQLEDV VEMPGFKPSH EVKAMLDDAD VFLLPSVTGA DGDMEGIPVA LMEAMAVGIP VVSTLHSGIP ELVEADKSGW LVPENDARAL AQRLAAFSQL DTDELAPVVK RAREKVEHDF NQQVINRELA SLLQAL
|
| |