Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E1182 |
Symbol | wcaI |
ID | 6272035 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 1084428 |
End bp | 1085651 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641725314 |
Product | putative glycosyl transferase |
Protein accession | YP_001879828 |
Protein GI | 187730929 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATAC TGGTCTACGG CATTAACTAC TCGCCGGAGT TAACCGGCAT CGGCAAATAC ACCGGAGAGA TGGTTGAATG GCTGGCGGCG CAAGGTCATG AGGTGCGGGT TATTACCGCA CCGCCTTACT ACCCGCAGTG GCAGGTGGGC GAGAACTATT CCGCCTGGCG GTACAAACGA GAAGAGGGGG CCGCTACGGT GTGGCGCTGC CCGCTGTACG TGCCAAAACA GCCGAGCACC CTGAAACGCC TGTTGCATCT CGGCAGTTTT GCCGTCAGCA GTTTTTTCCC GTTGATGGCG CAACGTCGCT GGAAGCCGGA TCGCATTATC GGCGTAGTGC CAACGCTGTT TTGCACGCCG GGAATGCGCC TGCTGGCGAA ACTCTCTGGC GCGCGTACCG TGCTGCATAT TCAGGATTAC GAAGTGGACG CCATGCTGGG GCTGGGCCTT GCCGGAAAAG GCAAAGGCGG CAAAGTGGCA CAGCTGGCAA CGGCGTTCGA ACGTAGCGGA CTGCATAACG TCGATAACGT CTCCACGATT TCGCGTTCGA TGATGAATAA AGCCATCGAA AAAGGCGTGG CGGCAGAAAA CGTCATCTTC TTCCCCAACT GGTCAGAAAT TGCCCGTTTT CAGCATGTCG CAGATGTCGA TGTTGATGCC CTTCGTAACC AGCTTGGCCT GCCGGATAAC AAAAAAATCA TTCTTTACTC CGGCAATATT GGTGAAAAGC AGGGGCTGGA AAACGTTATT GAAGCTGCCG ATCGCCTGCG TGATGAACCG CTGATTTTTG CCATTGTCGG GCAGGGCGGC GGCAAAGCGC GGCTGGAAAA AATGGCGCAA CAGCGTGGTC TGCGCAACAT GCAATTTTTT CCGCTGCAAT CGTATGACGC TTTACCCGCA CTGCTGAAGA TGGGCGATTG CCATCTGGTG GTGCAAAAAC GCGGCGCGGC AGATGCCGTA TTGCCGTCGA AACTGACCAA TATTCTGGCG GTAGGCGGTA ACGCGGTGAT TACTGCAGAA GCCCACACAG AACTGGGGCA GCTTTGCGAA ACCTTTCCGG GCATTGCGGT TTGCGTAGAA CCGGAATCGG TCGAGGCGCT GGTGGCGGGG ATTCGTCAGG CGCTCCTGCT GCCCAAACAC AACACGGTGG CACGTGAATA TGCCGAACGC ACGCTCGATA AAGAGAACGT GTTACGTCAA TTTATAAATG ATATTCGGGG ATAA
|
Protein sequence | MKILVYGINY SPELTGIGKY TGEMVEWLAA QGHEVRVITA PPYYPQWQVG ENYSAWRYKR EEGAATVWRC PLYVPKQPST LKRLLHLGSF AVSSFFPLMA QRRWKPDRII GVVPTLFCTP GMRLLAKLSG ARTVLHIQDY EVDAMLGLGL AGKGKGGKVA QLATAFERSG LHNVDNVSTI SRSMMNKAIE KGVAAENVIF FPNWSEIARF QHVADVDVDA LRNQLGLPDN KKIILYSGNI GEKQGLENVI EAADRLRDEP LIFAIVGQGG GKARLEKMAQ QRGLRNMQFF PLQSYDALPA LLKMGDCHLV VQKRGAADAV LPSKLTNILA VGGNAVITAE AHTELGQLCE TFPGIAVCVE PESVEALVAG IRQALLLPKH NTVAREYAER TLDKENVLRQ FINDIRG
|
| |