Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_03036 |
Symbol | nanA |
ID | 8112700 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 3233053 |
End bp | 3233946 |
Gene Length | 894 bp |
Protein Length | 297 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 644849221 |
Product | hypothetical protein |
Protein accession | YP_003000794 |
Protein GI | 251786490 |
COG category | [E] Amino acid transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0329] Dihydrodipicolinate synthase/N-acetylneuraminate lyase |
TIGRFAM ID | [TIGR00683] N-acetylneuraminate lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.84784 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAACGA ATTTACGTGG CGTAATGGCT GCACTCCTGA CTCCTTTTGA TCAACAACAA GCACTGGATA AAGCGAGTCT GCGCCGCCTG GTTCAGTTCA ATATTCAGCA GGGCATCGAC GGTTTATACG TGGGTGGTTC GACCGGCGAG GCCTTTGTAC AAAGCCTTTC CGAGCGTGAA CAGGTACTGG AAATCGTCGC CGAAGAGGCG AAAGGTAAGA TTAAACTCAT CGCCCACGTC GGTTGCGTCA GCACCGCCGA AAGCCAACAA CTTGCGGCAT CGGCTAAACG TTATGGCTTC GATGCCGTCT CCGCCGTCAC GCCGTTCTAC TATCCTTTCA GCTTTGAAGA ACACTGCGAT CACTATCGGG CAATTATTGA TTCGGCGGAT GGTTTGCCGA TGGTGGTGTA CAACATTCCA GCCCTGAGTG GGGTAAAACT GACCCTGGAT CAGATCAACA CACTTGTTAC ATTGCCTGGC GTAGGTGCGC TGAAACAGAC CTCTGGCGAT CTCTATCAGA TGGAGCAGAT CCGTCGTGAA CATCCTGATC TTGTGCTCTA TAACGGTTAC GACGAAATCT TCGCCTCTGG TCTGCTGGCG GGCGCTGATG GTGGTATCGG TAGTACCTAC AACATCATGG GCTGGCGCTA TCAGGGGATC GTTAAGGCGC TGAAAGAAGG CGATATCCAG ACCGCGCAGA AACTGCAAAC TGAATGCAAT AAAGTCATTG ATTTACTGAT CAAAACGGGC GTATTCCGCG GCCTGAAAAC TGTCCTCCAT TATATGGATG TCGTTTCTGT GCCGCTGTGC CGCAAACCGT TTGGACCGGT AGATGAAAAA TATCTGCCAG AACTGAAGGC GCTGGCCCAG CAGTTGATGC AAGAGCGCGG GTGA
|
Protein sequence | MATNLRGVMA ALLTPFDQQQ ALDKASLRRL VQFNIQQGID GLYVGGSTGE AFVQSLSERE QVLEIVAEEA KGKIKLIAHV GCVSTAESQQ LAASAKRYGF DAVSAVTPFY YPFSFEEHCD HYRAIIDSAD GLPMVVYNIP ALSGVKLTLD QINTLVTLPG VGALKQTSGD LYQMEQIRRE HPDLVLYNGY DEIFASGLLA GADGGIGSTY NIMGWRYQGI VKALKEGDIQ TAQKLQTECN KVIDLLIKTG VFRGLKTVLH YMDVVSVPLC RKPFGPVDEK YLPELKALAQ QLMQERG
|
| |