Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3520 |
Symbol | nanA |
ID | 6142606 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3599608 |
End bp | 3600501 |
Gene Length | 894 bp |
Protein Length | 297 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641618349 |
Product | N-acetylneuraminate lyase |
Protein accession | YP_001745496 |
Protein GI | 170680563 |
COG category | [E] Amino acid transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0329] Dihydrodipicolinate synthase/N-acetylneuraminate lyase |
TIGRFAM ID | [TIGR00683] N-acetylneuraminate lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0303093 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAACGA ATTTACGTGG CGTAATGGCC GCACTCCTGA CTCCTTTTGA TCAACAACAA GCACTGGATA AAGCGAGTCT GCGTCGCCTG GTTCAGTTCA ATATTCAGCA GGGCATCGAC GGTTTATACG TGGGTGGTTC TACCGGCGAA GCCTTTGTAC AAAGCCTTTC CGAGCGTGAA CAGGTACTGG AAATCGTCGC CGAAGAGGCG AAAGGCAAGA TTAAACTCAT CGCTCACGTA GGTTGCGTCA GCACCGCCGA AAGCCAACAA CTTGCGGCAT CGGCTAAACG TTATGGCTTC GATGCCGTCT CCGCCGTCAC ACCGTTCTAC TATCCTTTCA GCTTTGAAGA ACACTGCGAT CACTATCGGG CAATTATTGA TTCAGCGGAT GGTTTGCCGA TGGTGGTGTA CAACATTCCG GCGTTAAGTG GCGTTAAACT GAGCCTGGAT CAGATCAACA CTCTGGTTAC GTTGCCAGGT GTTGGCGCGC TGAAACAGAC CTCTGGCGAT CTCTATCAGA TGGAGCAGAT CCGTCGTGAA CATCCGGATC TGGTGCTCTA TAACGGTTAC GACGAAATCT TCGCCTCTGG TCTTCTGGCG GGCGCTGATG GTGGTATCGG TAGTACCTAC AACATCATGG GCTGGCGTTA TCAGGGCATT GTTAAGGCGC TGAAAGAAGG CGATATCCAG ACCGCGCAGA AGCTGCAAAC CGAATGTAAT AAAGTCATTG ATTTACTGAT CAAAACGGGC GTATTCCGCG GCCTGAAAAC GGTCCTCCAT TATATGGATG TCGTTTCTGT GCCGCTGTGC CGCAAACCGT TTGGTCCGGT AGATGAAAAA TATCTGCCAG AACTGAAGGC GCTGGCCCAG CAGTTGATGC AAGAGCGCGG GTGA
|
Protein sequence | MATNLRGVMA ALLTPFDQQQ ALDKASLRRL VQFNIQQGID GLYVGGSTGE AFVQSLSERE QVLEIVAEEA KGKIKLIAHV GCVSTAESQQ LAASAKRYGF DAVSAVTPFY YPFSFEEHCD HYRAIIDSAD GLPMVVYNIP ALSGVKLSLD QINTLVTLPG VGALKQTSGD LYQMEQIRRE HPDLVLYNGY DEIFASGLLA GADGGIGSTY NIMGWRYQGI VKALKEGDIQ TAQKLQTECN KVIDLLIKTG VFRGLKTVLH YMDVVSVPLC RKPFGPVDEK YLPELKALAQ QLMQERG
|
| |