Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E4058 |
Symbol | waaA |
ID | 6269421 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 3791362 |
End bp | 3792639 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641727898 |
Product | 3-deoxy-D-manno-octulosonic-acid transferase |
Protein accession | YP_001882330 |
Protein GI | 187730590 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1519] 3-deoxy-D-manno-octulosonic-acid transferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.000579677 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCGAAT TGCTTTACAC CGCCCTTCTC TACCTTATTC AGCCGCTGAT CTGGATACGG CTCTGGGTGC GCGGACGTAA GGCTCCGGCC TATCGAAAAC GCTGGGGTGA ACGTTACGGT TTTTACCGCC ATCCGCTAAA ACCAGGCGGC ATTATGCTGC ACTCCGTCTC CGTCGGTGAA ACTCTGGCGG TGATCCCGTT GGTGCGCGCG CTGCGTCATC GTTATCCTGA TTTACCGATT ACCGTTACAA CCATGACGCC AACCGGTTCG GAGCGCGTAC AATCGGCTTT CGGGAAGGAT GTTCAGCACG TTTATCTGCC GTATGACCTG CCCGATGCAC TTAATCGTTT CCTGAATAAA GTCGACCCTA AACTGGTGTT GATTATGGAA ACCGAACTAT GGCCTAACCT GATTGCGGCG CTACATAAAC GTAAAATTCC GCTGGTGATC GCGAACGCGC GACTCTCTGC CCGCTCGGCC GCAGGTTATG CCAAACTGGG TAAATTCGTC CGTCGCTTGC TGCGTCGTAT TACGCTGATT GCCGCCCAAA ATGAAGAAGA TGGTGCACGT TTTGTGGCGC TGGGCGCAAA AAATAACCAG GTAACCGTTA CCGGTAGCCT GAAATTCGAT ATTTCTGTAA CGCCGCAGTT GGCTGCTAAA GCCGTGACGC TGCGCCGCCA GTGGGCACCA CACCGCCCGG TATGGATTGC CACCAGCACT CACGAAGGTG AAGAGAGTGT GGTGATCGCC GCACATCAGG TATTGTTACA GCAATTCCCG AATTTATTGC TCATCCTGGT ACCCCGTCAT CCAGAACGCT TCCCGGATGC GATTAACCTT GTCCGCCAGG CTGGACTAAG CTATATCACA CGCTCTTCAG GGGAAGTCCC CTCCACCAGC ACGCAGGTTG TGGTTGGCGA TACGATGGGC GAGTTGATGT TACTGTACGG CATTGCCGAT CTCGCCTTTG TTGGCGGTTC ACTGGTTGAA CGTGGTGGGC ATAATCCGCT GGAAGCTGCC GCACACGCTA TTCCGGTATT GATGGGGCCG CATACTTTTA ATTTTAAAGA CATTTGCGCG CGGCTGGAGC AGGCAAGCGG GCTGATTACC GTTACCGATG CCACTACGCT TGCAAAAGAG GTTTCCTCTT TACTCACCGA CGCCGATTAC CGTAGTTTCT ATGGCCGTCA TGCCGTTGAA GTACTGTATC AAAACCAGGG CGCGCTACAG CGCCTGCTTC AACTGCTGGA ACCTTACCTG CCACCGAAAA CGCATTGA
|
Protein sequence | MLELLYTALL YLIQPLIWIR LWVRGRKAPA YRKRWGERYG FYRHPLKPGG IMLHSVSVGE TLAVIPLVRA LRHRYPDLPI TVTTMTPTGS ERVQSAFGKD VQHVYLPYDL PDALNRFLNK VDPKLVLIME TELWPNLIAA LHKRKIPLVI ANARLSARSA AGYAKLGKFV RRLLRRITLI AAQNEEDGAR FVALGAKNNQ VTVTGSLKFD ISVTPQLAAK AVTLRRQWAP HRPVWIATST HEGEESVVIA AHQVLLQQFP NLLLILVPRH PERFPDAINL VRQAGLSYIT RSSGEVPSTS TQVVVGDTMG ELMLLYGIAD LAFVGGSLVE RGGHNPLEAA AHAIPVLMGP HTFNFKDICA RLEQASGLIT VTDATTLAKE VSSLLTDADY RSFYGRHAVE VLYQNQGALQ RLLQLLEPYL PPKTH
|
| |