Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A4535 |
Symbol | |
ID | 5594050 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 4542662 |
End bp | 4543768 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640923631 |
Product | N-acetylneuraminic acid mutarotase |
Protein accession | YP_001461071 |
Protein GI | 157163753 |
COG category | [S] Function unknown |
COG ID | [COG3055] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR03547] mutatrotase, YjhT family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 0.0287347 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAAAA CAATAACGGC GCTTGCTATC ATGATGGCTT CATTTGCCGC AAACGCGTCT GTATTACCTG AAACTCCTGT GCCATTTAAA AGTGGTACCG GAGCAATTGA TAACGACACT GTCTACATTG GTTTAGGTAG CGCAGGTACG GCATGGTACA AGCTGGATAC ACAGGCCAAA GATAAAAAAT GGACAGCGTT AGCTGCATTC CCTGGTGGAC CAAGAGATCA AGCAACCTCG GCATTTATTG ATGGCAATCT GTATGTGTTT GGCGGCATTG GCAAAAATAG CAAGGGCTTG ACTCAGGTAT TTAATGACGT ACACAAATAC AACCCCAAAA CCAATAGTTG GGTTAAATTG ATGTCGCATG CACCGATGGG CATGGCGGGT CATGTGACTT TTGTACACAA CGGCAAGGCT TATGTTACTG GCGGTGTTAA CCAGAATATC TTCAATGGCT ATTTTGAAGA TCTCAACGAG GCTGGAAAAG ATTCAGCAAC TATAGACAAG ATCAATGCCC ATTATTTTGA CAAAAAAGCA GAAGATTATT TCTTCAATAA GTTTCTGTTG TCTTTTGATC CCTCAACACA GCAATGGAGT TACGCTGGCG AATCTCCCTG GTACGGAACG GCTGGTGCGG CGGTTGTGAA TAAAGGTGAT AAAACCTGGC TTATTAATGG CGAAGCCAAA CCAGGATTGC GAACGGATGC CGTATTTGAA CTTGATTTCA CCGGTAATAA TTTAAAATGG AATAAGCTTG CTCCCGTCTC ATCACCAGAT GGCGTCGCTG GCGGTTTTGC GGGGATAAGC AATGATTCTC TTATATTTGC CGGAGGGGCC GGATTCAAAG GTTCACGAGA AAATTACCAG AACGGTAAGA ACTATGCGCA TGAAGGCCTA AAAAAATCAT ATAGCACTGA TATTCATCTT TGGCATAACG GGAAATGGGA TAAATCGGGT GAATTATCGC AAGGTCGGGC CTACGGAGTA TCATTGCCCT GGAATAATAG TCTATTGATT ATTGGCGGTG AAACTGCAGG CGGCAAAGCG GTGACGGATT CAGTTTTGAT CTCTGTGAAG GATAATAAAG TCACAGTACA AAATTAA
|
Protein sequence | MNKTITALAI MMASFAANAS VLPETPVPFK SGTGAIDNDT VYIGLGSAGT AWYKLDTQAK DKKWTALAAF PGGPRDQATS AFIDGNLYVF GGIGKNSKGL TQVFNDVHKY NPKTNSWVKL MSHAPMGMAG HVTFVHNGKA YVTGGVNQNI FNGYFEDLNE AGKDSATIDK INAHYFDKKA EDYFFNKFLL SFDPSTQQWS YAGESPWYGT AGAAVVNKGD KTWLINGEAK PGLRTDAVFE LDFTGNNLKW NKLAPVSSPD GVAGGFAGIS NDSLIFAGGA GFKGSRENYQ NGKNYAHEGL KKSYSTDIHL WHNGKWDKSG ELSQGRAYGV SLPWNNSLLI IGGETAGGKA VTDSVLISVK DNKVTVQN
|
| |