Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_06701 |
Symbol | aroA |
ID | 4781288 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 616159 |
End bp | 617493 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640083946 |
Product | 3-phosphoshikimate 1-carboxyvinyltransferase |
Protein accession | YP_001014495 |
Protein GI | 229000869 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0128] 5-enolpyruvylshikimate-3-phosphate synthase |
TIGRFAM ID | [TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.36733 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAAAGCTC CCACTAAAGA TCAGTCTTTA AGAAACCTTC AAAAAGGAGG GGAGCTCTAT GGAAAAGTGA AAGTACCTGG AGACAAGTCA ATCTCACACC GTGCACTACT TTTTGGGGCT ATTGCTAAGG GGAAAACACT AATTGAGGGC CTTTTACCTG CTGAAGATCC ATTAAGTACT GCTGAATGCC TTAGGTCAAT GGGCGTAAAG ATTAGTCCAA TCAAGAAAGG AGACATTATT GAAATTGAAG GCGTTGGATT AAATGGCCTC CAGGAGCCAC AAGATATTTT GAACTGCGGA AATTCAGGAA CAACTATGAG ATTAATAATG GGATTATTAG CCGGTCAAAA AGATCATCAT TTCATCCTCA CAGGCGATAA ATCACTTAGA AATAGGCCGA TGAAAAGAGT AGGACAGCCG TTAAAAATGA TGGGGGCTAA AGTTTTCGGA AGATGCGGTG GAGACTTGGC TCCTCTATCG ATTATTGGGA ATAAATTAAG AGGTGCCGTA ATTGGTACAC CAGTAGCAAG TGCTCAGATA AAATCTGCAA TCCTACTAGC TGCTCTTAAT GCAGAAGGCT CAACGACGGT TATTGAACCC GCCAGATCAA GAGATCATAG CGAGAGAATG CTAAAAGCCT TCGGAGCTAA TCTAGAGGTT GGTGGAGAGA TGGGTAGACA TATAACTGTA TCCCCTGGTA AAGATCTAAA AGGTCAATCA ATTATTGTTC CTGGAGATAT TAGTTCCGCT GCATTTTGGC TCGTTGCAGG TAGCATCATA CCCGGATCAG AGTTGGTTGT AGAAAATGTT GGTCTAAATC CAACAAGGAC TGGAATACTT GACGTATTAG AAGCAATGGA AGCAAATATC AACGTAATAA ACAAAAGAGA TGTAGCCGGT GAACCTGTCG GAGATATTGA AGTTTTCTAC AAAGAAAACT TAAAACCATT TAAAATTGAC GATGAGATAA TGCCACGGCT TGTTGACGAG ATACCCATTT TATCCGTAGG AGCATGTTTT TGTAATGGTA TCAGTCAAAT AAAAGGAGCA AGTGAGCTAA GAGTTAAAGA AACTGATCGA TTAGCTGTAA TGGCAAGGCA ATTAAAAAGG ATGGGAGCCA GCGTAGATGA GCATCAAGAT GGTCTAACTA TCTATGGAGG AAAAAGCTTA GAAGGATGCG AACTTGATAG CGAGGATGAT CACCGTATAG CCATGAGTTT AGCTATTGCA TCAATAATGG CTAATTCTAA TTCGACATTA CGACGTAGTG AGGCTGCAGC AATTTCATAT CCTGATTTTT GGAGTGATCT TAAGAGACTT CAACAAAAAA ATTAG
|
Protein sequence | MKAPTKDQSL RNLQKGGELY GKVKVPGDKS ISHRALLFGA IAKGKTLIEG LLPAEDPLST AECLRSMGVK ISPIKKGDII EIEGVGLNGL QEPQDILNCG NSGTTMRLIM GLLAGQKDHH FILTGDKSLR NRPMKRVGQP LKMMGAKVFG RCGGDLAPLS IIGNKLRGAV IGTPVASAQI KSAILLAALN AEGSTTVIEP ARSRDHSERM LKAFGANLEV GGEMGRHITV SPGKDLKGQS IIVPGDISSA AFWLVAGSII PGSELVVENV GLNPTRTGIL DVLEAMEANI NVINKRDVAG EPVGDIEVFY KENLKPFKID DEIMPRLVDE IPILSVGACF CNGISQIKGA SELRVKETDR LAVMARQLKR MGASVDEHQD GLTIYGGKSL EGCELDSEDD HRIAMSLAIA SIMANSNSTL RRSEAAAISY PDFWSDLKRL QQKN
|
| |