Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_4134 |
Symbol | waaA |
ID | 5586758 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 4124476 |
End bp | 4125753 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640927753 |
Product | 3-deoxy-D-manno-octulosonic-acid transferase |
Protein accession | YP_001465113 |
Protein GI | 157157302 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1519] 3-deoxy-D-manno-octulosonic-acid transferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00000304372 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCGAAT TGCTTTACAC CGCCCTTCTC TACCTTATTC AGCCGCTGAT CTGGATACGG CTCTGGGTGC GCGGACGTAA GGCTCCGGCC TATCGAAAAC GCTGGGGTGA ACGTTACGGT TTTTACCGCC ATCCGCTAAA ACCAGGCGGC ATTATGCTGC ACTCCGTCTC CGTCGGTGAA ACTCTGGCGG CGATCCCGTT GGTGCGCGCG CTGCGTCATC GTTATCCTGA TTTACCGATT ACCGTTACAA CCATGACGCC AACCGGTTCG GAGCGCGTAC AATCGGCTTT CGGGAAGGAT GTTCAGCACG TTTATCTGCC GTATGACCTG CCCGATGCAC TCAACCGTTT CCTGAATAAA GTCGACCCTA AACTGGTGTT GATTATGGAA ACCGAACTAT GGCCTAACCT GATTGCGGCG CTACATAAAC GTAAAATTCC GCTGGTGATC GCTAACGCGC GACTCTCTGC CCGCTCGGCC GCAGGTTATG CCAAACTGGG TAAATTCGTC CGCCGCTTGC TGCGTCGTAT TACGCTGATT GCTGCGCAAA ATGAAGAAGA TGGTGCACGT TTTGTGGCGC TGGGCGCAAA AAATAACCAG GTAACCGTTA CCGGTAGCCT GAAATTCGAT ATTTCTGTAA CGCCGCAGTT GGCTGCTAAA GCCGTGACGC TACGCCGCCA ATGGGCACCA CACCGCCCGG TATGGATTGC CACCAGCACT CACGAAGGTG AAGAGAGCGT GGTCATTGCC GCACATCAGG CATTGTTACA GCAATTCCCG AATTTATTGC TCATCCTGGT ACCCCGTCAT CCAGAACGCT TCCCGGATGC GATTAACCTT GTCCGCCAGG CTGGACTAAG CTATATCACA CGCTCTTCAG GGGAAGTCCC CTCCACCAGC ACGCAGGTTG TAGTTGGCGA TACGATGGGC GAGTTGATGT TACTGTACGG CATTGCCGAT CTCGCCTTTG TTGGCGGTTC ACTGGTTGAA CGTGGTGGGC ATAATCCGCT GGAAGCTGCC GCACACGCTA TTCCGGTATT GATGGGGCCG CATACTTTTA ACTTTAAAGA CATTTGCGCG CGGCTGGAGC AGGCAAGCGG GCTGATTACC GTTACCGATG CCACTACGCT TGCAAAAGAG GTGTCCTCTT TACTCACCGA CGCCGATTAC CGTAGTTTCT ATGGCCGTCA TGCCGTTGAA GTACTGTATC AAAACCAGGG CGCGCTACAG CGTCTGCTTC AACTGCTGGA ACCTTACCTG CCACCGAAAA CGCATTGA
|
Protein sequence | MLELLYTALL YLIQPLIWIR LWVRGRKAPA YRKRWGERYG FYRHPLKPGG IMLHSVSVGE TLAAIPLVRA LRHRYPDLPI TVTTMTPTGS ERVQSAFGKD VQHVYLPYDL PDALNRFLNK VDPKLVLIME TELWPNLIAA LHKRKIPLVI ANARLSARSA AGYAKLGKFV RRLLRRITLI AAQNEEDGAR FVALGAKNNQ VTVTGSLKFD ISVTPQLAAK AVTLRRQWAP HRPVWIATST HEGEESVVIA AHQALLQQFP NLLLILVPRH PERFPDAINL VRQAGLSYIT RSSGEVPSTS TQVVVGDTMG ELMLLYGIAD LAFVGGSLVE RGGHNPLEAA AHAIPVLMGP HTFNFKDICA RLEQASGLIT VTDATTLAKE VSSLLTDADY RSFYGRHAVE VLYQNQGALQ RLLQLLEPYL PPKTH
|
| |