Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9515_18901 |
Symbol | aroG |
ID | 4720540 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9515 |
Kingdom | Bacteria |
Replicon accession | NC_008817 |
Strand | - |
Start bp | 1680567 |
End bp | 1681634 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 640081591 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_001012204 |
Protein GI | 123967123 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGACTT CATCAAATAA TCAATCTTTA GAAAAAACAT CTGATTTGCA TGTTGTTGAA ACACGTCCAT TGATACCTCC AAGCAAACTT CATAATGATA TACCTTTAGA TTATACCTCT GCTGATACTG TCTCCAATAC GAGGAGATCG ATACAAAATA TTTTGCATAA TAATGATCCT AGGCTATTAG TCATTGTGGG ACCATGCTCA ATCCACGATA TTAAAGCTGC TAAAGAGTAT TCAGAATATA TTCAGGAATT TAGAAAAATC TACAATGATA AATTGGAAAT TGTAATGAGA GTATATTTTG AAAAACCGAG AACTACAATC GGATGGAAAG GATTGATAAA TGACCCCCAT TTAGATGGTT CCTACGATAT TAATACAGGT TTACGTAGAG CTAGAAGCTT GCTCTCCTAT CTTGCGACTA GAGGGATCCC TTCAGCTACT GAGTTGTTGG ACCCCATTGT CCCTCAATAT ATTGCTGATT TAATCAGCTG GACAGCCATT GGTGCAAGGA CAACTGAAAG TCAAACTCAT AGAGAAATGG CTTCAGGATT ATCTATGCCA ATTGGTTTTA AAAATGGTAC AGATGGTTCT TTCAGTACAG CTATTAATGC GATGCAGTCT GCATCAAAAT CTCATCACTT TTTAGGCGTT AATGATCATG GTTATGCTTC TATTGTAAAT ACGACTGGCA ATCCCGATGG GCATATAGTT TTAAGGGGTG GGTCTAAAGG AGTTAATTTT GAAAATCAAC ATGTAAAAGG CATATCTTCT GAATTAAAAG CCAGTAATCT TCCTCATAAG GTTATGATCG ATTGTAGTCA TGGTAATTCT AATAAAGACT TTAGGAAGCA ATCTGATGTT CTAGAAAACG TAGCAACTCA AATTAAGAAT GGTGAAAAAA ATATTTTAGG AATTATGCTT GAAAGTCATC TTAAGGAAGG TAATCAAAAA CTTTCAAATA ATAAAGATCT TGAATATGGG AGAAGTATTA CTGATGCTTG CATTAATATA GACAAGACAA AAAATTTGCT AGAGAGTTTA TATGATTCAA TTTCTTAA
|
Protein sequence | MTTSSNNQSL EKTSDLHVVE TRPLIPPSKL HNDIPLDYTS ADTVSNTRRS IQNILHNNDP RLLVIVGPCS IHDIKAAKEY SEYIQEFRKI YNDKLEIVMR VYFEKPRTTI GWKGLINDPH LDGSYDINTG LRRARSLLSY LATRGIPSAT ELLDPIVPQY IADLISWTAI GARTTESQTH REMASGLSMP IGFKNGTDGS FSTAINAMQS ASKSHHFLGV NDHGYASIVN TTGNPDGHIV LRGGSKGVNF ENQHVKGISS ELKASNLPHK VMIDCSHGNS NKDFRKQSDV LENVATQIKN GEKNILGIML ESHLKEGNQK LSNNKDLEYG RSITDACINI DKTKNLLESL YDSIS
|
| |