Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_21801 |
Symbol | aroG |
ID | 4780300 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 1837917 |
End bp | 1838987 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640085478 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_001016000 |
Protein GI | 124026885 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.299148 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCACCT CATCATATTT TTCAATGGTG GATAGCACCT CCGACCTTCA TGTTGTCGAG ACTCGTCCAT TAATGTCACC AGCATTAATT CATAGAGATT TGCCTTTAGA TAAGGCATCC TCTGGAGTTG TCTCTACTAC TCGAAACAAG ATTCAATCAA TTCTTCATGG TAATGACCCA AGAATTTTAG TGATTGTTGG ACCGTGTTCG ATTCATGATG TTGATGCTGC TATTGAATAT GCAAATCGTT TAGCGCCATT GAGGGAGAGA TATAGTCAAA AGCTTGAGAT TGTTATGCGT GTTTATTTTG AGAAACCACG CACAACTGTT GGTTGGAAAG GACTTATTAA TGATCCTCAT CTTGATAATT CTTACGATAT TAATACTGGT TTAAGAAAGG CAAGAGGTCT ATTACTTGAT TTAGCCAAAG CAGGAATGCC GGCTGCAACT GAATTACTGG ATCCCGTTGT TCCTCAATAT ATTGCTGATT TAATTAGTTG GACTGCTATT GGAGCAAGAA CGACAGAGAG CCAGACTCAT CGTGAAATGG CGTCTGGATT ATCAATGCCT GTTGGTTATA AGAATGGTAC TGACGGGACA GCAACGATAG CGATTAATGC AATGCAAGCG GCTTCAAAAC CTCATCATTT TTTAGGAATT AATCATGATG GTCATGCCTC AATAGTGAGT ACTACAGGTA ATCCAAATGG TCATCTTGTT TTAAGAGGTG GTAAGAATGG GACTAATTAC CATTTTGATG CAATTAAGTT AATTACAGAT GAGTTAGAAC AATTTAAGAT GCCTGGAAAA GTTATGGTTG ATTGTAGTCA TGGTAATTCC AATAAAGATT TTCGTAGACA ATCAGAAGTT TTAAGAGATG TAGCATCACA GATAAAAGGT GGATCAAAGA ATTTAATGGG CGTAATGATA GAAAGTCATC TTGTTGAGGG TAATCAGAAA TTAAATTTAG ATGTGTCAAC ACTCACCTAT GGGCAAAGTG TTACGGATGC ATGCATAAAC TTTTCTACAA CTGAAATTTT ATTAGAGGAA CTAGCTGAAT CGGTTAAATA A
|
Protein sequence | MSTSSYFSMV DSTSDLHVVE TRPLMSPALI HRDLPLDKAS SGVVSTTRNK IQSILHGNDP RILVIVGPCS IHDVDAAIEY ANRLAPLRER YSQKLEIVMR VYFEKPRTTV GWKGLINDPH LDNSYDINTG LRKARGLLLD LAKAGMPAAT ELLDPVVPQY IADLISWTAI GARTTESQTH REMASGLSMP VGYKNGTDGT ATIAINAMQA ASKPHHFLGI NHDGHASIVS TTGNPNGHLV LRGGKNGTNY HFDAIKLITD ELEQFKMPGK VMVDCSHGNS NKDFRRQSEV LRDVASQIKG GSKNLMGVMI ESHLVEGNQK LNLDVSTLTY GQSVTDACIN FSTTEILLEE LAESVK
|
| |