Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_18331 |
Symbol | aroG |
ID | 5731648 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 1661513 |
End bp | 1662586 |
Gene Length | 1074 bp |
Protein Length | 357 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 641286220 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_001551718 |
Protein GI | 159904374 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCATTT CTCCAGATCT TAATTTGTTC GATAAAACTT CTGATCTTCA TATTGTTGAG ACTAGACCTT TGGTCTCACC TTCTTTGTTG CATCATGACT TACCGCTGGA TTTAAAAGCT GCAGAAATTG TTGCTCAAAC TCGTAAACGG ATTCAAGCAA TCCTTGCTGG GGATGATTCA CGTTTATTAG TGATAGTAGG ACCTTGCTCA GTTCATGATG TAAGTGCAGC TAAGGAATAC GCTAAGAAAC TTATTCCTTT AAGAGAACGT TTTTCAGATG CACTGGAAAT TGTGATGAGA GTTTATTTTG AAAAACCTAG AACAACTATT GGATGGAAAG GGCTTATAAA TGATCCGCAT TTAGATGGAT CATACGATAT CAATACAGGC TTAAGACGGG CAAGAGCCTT ACTTTTAGAT TTAGCCCGGT CTGGGATGCC AGCAGCAACA GAACTATTAG ATCCCATAGT GCCTCAATAT ATTGCAGATT TGATTAGTTG GACTGCTATT GGTGCAAGAA CTACAGAAAG TCAAACTCAT AGAGAAATGG CTTCAGGTTT ATCCATGCCT ATAGGTTATA AGAATGGGAC GGATGGAACT GTAGCTATTG CAATTAATGC AATGCAGGCA GCATCAAGGG CTCATCATTT TTTAGGCATT AATCACAAAG GTTTTGCTTC AATAATCAGC ACAACAGGTA ATCCTGATGG TCATCTTGTT TTGCGAGGTG GTAGTAGTGG CACTAATTAT CACGTTGAAT CTGTTTTGAA TGCAGCGAAA GAGCTTTCTA AGGCATCTTT AGGTGACAAA GTAATGATTG ATTGCAGTCA TGGCAACTCT AATAAAGATT TTCGGCAACA ATCACATGTT CTTCGAGAAG TATCTAAGCA GATTAAAGAA GGTTTCTCTC ATGTAATGGG AGTTATGCTT GAAAGTCATC TTGTAGAAGG TAATCAAAAG TTAGTAGCAG ATCTTTCACA ATTGAAGTAT GGCCAAAGTA TTACTGATGC ATGTATAGAT ATAAAGGCTA CGGAAAGTCT TTTAGAAGAG CTTGCAGTTT CTATGAGGGC ATAG
|
Protein sequence | MVISPDLNLF DKTSDLHIVE TRPLVSPSLL HHDLPLDLKA AEIVAQTRKR IQAILAGDDS RLLVIVGPCS VHDVSAAKEY AKKLIPLRER FSDALEIVMR VYFEKPRTTI GWKGLINDPH LDGSYDINTG LRRARALLLD LARSGMPAAT ELLDPIVPQY IADLISWTAI GARTTESQTH REMASGLSMP IGYKNGTDGT VAIAINAMQA ASRAHHFLGI NHKGFASIIS TTGNPDGHLV LRGGSSGTNY HVESVLNAAK ELSKASLGDK VMIDCSHGNS NKDFRQQSHV LREVSKQIKE GFSHVMGVML ESHLVEGNQK LVADLSQLKY GQSITDACID IKATESLLEE LAVSMRA
|
| |