Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_29971 |
Symbol | aroG |
ID | 4778538 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 2647903 |
End bp | 2648985 |
Gene Length | 1083 bp |
Protein Length | 360 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640088521 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_001018992 |
Protein GI | 124024685 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.906124 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTCTG GCGAAATGAC CACCACCTCC GACTTGCATG TGGTGGATAC GCGGCCTTTG GTGTCACCCG TCTTGCTTCA TCAGGAGCTG CCCCTCGATC TAGTGGCTCT CAAAACCGTT GCAGACACTC GTAGACGCAT CCAGGCAATT CTGCGTGGTG AGGATCCCCG CTTGCTGGTG ATTGTTGGTC CCTGCTCGGT TCATGACATT GCCTCTGCAA GGGACTATGC CCGTCAGTTG GAGCCATTGC GACAGCGATA TGCCGCCCAG TTGGAGGTGG TTTTGCGGGT CTATTTCGAA AAACCTCGCA CCACCGTTGG CTGGAAAGGT CTCATCAATG ATCCCCATCT CGATGGTTCC TATGACATCA ACACTGGCTT GAGGCGTGCG CGGTCGTTGT TGCTCGACCT TGCCCGCGAG GGTATGCCGA CGGCCACTGA ATTGCTGGAT CCGGTTGTTC CTCAATACAT CGCTGATTTG ATCAGTTGGA CGGCGATTGG AGCCAGAACC ACTGAGAGTC AGACCCATCG GGAGATGGCT TCTGGATTGT CAATGCCCGT TGGTTACAAA AACGGTACCG ATGGCAGTGC CAAGATTGCG ATCCATGCGA TGCAGGCAGC ATCTAGGCCG CATCATTTTC TAGGGATCAA TCGGCAGGGT CAGGCTTCGA TTGTGCATAC CACTGGAAAC CCTGATGGCC ATCTCGTGTT GCGGGGAGGC AATGGCTGCA CCAATTACCA TCCCGAAGCT GTGGAAGGGG TTGCAAAAGA ATTAGTGAAG GCTGGCTTGG CTGATCGGTT GATGGTGGAT TGCAGCCATG ACAATTCGAA TAAAGATTTT CGGCGACAGT CAGAGGTGCT GCAGGCTGTT GCTACTCAGG TACGCCAAGG ATCAACCCAC CTGATGGGTG TGATGTTGGA AAGTCATCTT GTCGAGGGCA ATCAGAAGTT GCCTGAAGAC CTCTCTACTC TTGTCTATGG TCAAAGCATT ACGGATGCTT GTATCGATAT AGAGACAACG GCAACTCTCC TTGAGGATTT GGCGGCTGCA GTGGCTTCAG TGACGTTGTC ACCAATAACT TGA
|
Protein sequence | MNSGEMTTTS DLHVVDTRPL VSPVLLHQEL PLDLVALKTV ADTRRRIQAI LRGEDPRLLV IVGPCSVHDI ASARDYARQL EPLRQRYAAQ LEVVLRVYFE KPRTTVGWKG LINDPHLDGS YDINTGLRRA RSLLLDLARE GMPTATELLD PVVPQYIADL ISWTAIGART TESQTHREMA SGLSMPVGYK NGTDGSAKIA IHAMQAASRP HHFLGINRQG QASIVHTTGN PDGHLVLRGG NGCTNYHPEA VEGVAKELVK AGLADRLMVD CSHDNSNKDF RRQSEVLQAV ATQVRQGSTH LMGVMLESHL VEGNQKLPED LSTLVYGQSI TDACIDIETT ATLLEDLAAA VASVTLSPIT
|
| |