Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_21871 |
Symbol | aroE |
ID | 4779406 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 1848441 |
End bp | 1849376 |
Gene Length | 936 bp |
Protein Length | 311 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 640085485 |
Product | shikimate 5-dehydrogenase |
Protein accession | YP_001016007 |
Protein GI | 124026892 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0169] Shikimate 5-dehydrogenase |
TIGRFAM ID | [TIGR00507] shikimate 5-dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTTCCCG AACAGTCTCT TAGCAAATTT GAATCAACAT CATCAAAAAT GAGCACTATC ACTGGAAAAA CTAATCTTGT AGGGCTTCTT GGACAACCAG TAAATCATTC GCTGTCACCA ATTATGCACA ATGCCGCATA TGAAGAAATG GGACTTGACT GGTGTTATGT GGCGATGCCT TGCCACAGTC AAGATTTAGA GAAAGTCACA AAAGCATTAA GGTTTTTAGA CTTTAAAGGC TTGAATATAA CTATTCCCCA TAAGCAAGAG GTATTAAAAG CTTGCAACAA ATTAACTGAA ATTGCAAATG ATATCCAAGC AGTTAATACA CTTATACCTG AAAAAAATAA TCAATGGATA GGCGCAAACA CTGATGTACA AGGATTCTTG ACGCCATTAA AAGATCATAA TTTAAAAAAT AAAAGTGTAG TTGTAATAGG CTGCGGGGGC AGCGCTAGAG CGGTAGTAAT GGGATTAAGC AGTTTAAATA TTAAAAAAGT AACAATAATT AGCAGGAACG AAAAAACATT AAATATTTTT GTACAAAGCA CGAGTAATTT ATTATCAAAA AGAGATATAT CAATTGAAAG TATTAATAAT ACAAAATTAA ATGTTTTACC ATACATACAA GAAGCCGATT TAATTATTAA TACAACTCCA ATAGGGATGA ATGGCAGTCA AGCTAAACAA GAAAGTGTTC CTCTTGGTCA TGAAGTATGG GACTGTCTTT CTAACAAAAC TATTTTATAC GATTTGATTT ATACTCCTAG ACCAACAAAC TGGTTAAAAA TTGGTCAACA AAAAAATTGT TTTACCATTG ACGGATTGGA TATGCTTGTC GAACAAGGAG CTTTTTCAAT AAAACTTTGG ACTGGTTTTC ATGACGTACC GGTTCAGACA ATGAAATCAT CTGCAGAAAA ACATTTAATG GTTTAA
|
Protein sequence | MVPEQSLSKF ESTSSKMSTI TGKTNLVGLL GQPVNHSLSP IMHNAAYEEM GLDWCYVAMP CHSQDLEKVT KALRFLDFKG LNITIPHKQE VLKACNKLTE IANDIQAVNT LIPEKNNQWI GANTDVQGFL TPLKDHNLKN KSVVVIGCGG SARAVVMGLS SLNIKKVTII SRNEKTLNIF VQSTSNLLSK RDISIESINN TKLNVLPYIQ EADLIINTTP IGMNGSQAKQ ESVPLGHEVW DCLSNKTILY DLIYTPRPTN WLKIGQQKNC FTIDGLDMLV EQGAFSIKLW TGFHDVPVQT MKSSAEKHLM V
|
| |