Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_19091 |
Symbol | aroG |
ID | 4718648 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 1646278 |
End bp | 1647345 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 640079644 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_001010299 |
Protein GI | 123969441 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.352834 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGACAT CATCCAATAA TTCAGCTTTA GAAAAGACAT CAGATTTACA TGTTGTTGAA ACACGTCCAT TAATACCTCC AAGCAGATTA CATAATGATA TACCTTTAGA TCACGCCTCT GCTAATACAG TATCTAAAAC AAGAAGATCG ATACAAAATA TTTTGCATCA TAATGATAAG AAGCTTCTAG TAATCGTGGG CCCATGTTCA ATTCATGATC TTGAGGCGGC AAAGGAATAT TCAAAATATA TTCAAAAATT CCGAGAAATG TATAACGATA AATTAGAAAT AATTATGAGA GTATATTTTG AAAAACCAAG GACAACTATT GGTTGGAAGG GATTGATAAA TGATCCTCAT CTAGATGATT CTTATGATAT TAATACTGGT TTAAGAAGGG CAAGAAGTTT GCTTTCATAT TTAGCAACTC GAGGCATACC TTCTGCTACA GAATTACTAG ATCCAATTGT TCCTCAATAC ATTGCCGATT TAATAAGTTG GACAGCCATA GGTGCGCGGA CCACGGAAAG TCAAACTCAT AGAGAAATGG CATCAGGATT ATCAATGCCT ATAGGCTTTA AAAATGGAAC GGATGGTTCT TTTACTACTG CAATTAATGC AATGCAGTCA GCTTCAAAAT CCCATCACTT CTTAGGTGTA AATGAAAATG GAATGGCTTC TATAGTTAAT ACTACAGGAA ATCCAGATGG ACATATAGTT TTAAGGGGCG GTTCAAAAGG CCCAAATTTT GAAAGTGATC ATGTACAAAG AATTTCAGCA GAATTGAGGC AGTATAATCT TCCCCATAAA GTGATGATTG ATTGTAGTCA TGGAAATTCC AATAAAGATT TCCGAAAACA GTCAGAAGTG CTAAAAAATG TAGCTTCTCA AATTAGTAAT GGTGAAAAAA ATATTTTAGG AGTTATGCTT GAAAGTCATT TGAAGGAAGG AAATCAAAAA CTTTTAAAAA AAGAAGATCT CCAGTTTGGA AGAAGCATTA CAGATGCATG TATAGATATA GAAACAACAA AAGAATTAAT CGCTATTTTA TACGATTCAC TTAGCTAG
|
Protein sequence | MTTSSNNSAL EKTSDLHVVE TRPLIPPSRL HNDIPLDHAS ANTVSKTRRS IQNILHHNDK KLLVIVGPCS IHDLEAAKEY SKYIQKFREM YNDKLEIIMR VYFEKPRTTI GWKGLINDPH LDDSYDINTG LRRARSLLSY LATRGIPSAT ELLDPIVPQY IADLISWTAI GARTTESQTH REMASGLSMP IGFKNGTDGS FTTAINAMQS ASKSHHFLGV NENGMASIVN TTGNPDGHIV LRGGSKGPNF ESDHVQRISA ELRQYNLPHK VMIDCSHGNS NKDFRKQSEV LKNVASQISN GEKNILGVML ESHLKEGNQK LLKKEDLQFG RSITDACIDI ETTKELIAIL YDSLS
|
| |