Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_02461 |
Symbol | aroC |
ID | 4911581 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | + |
Start bp | 228980 |
End bp | 230077 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640159812 |
Product | chorismate synthase |
Protein accession | YP_001090470 |
Protein GI | 126695584 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAGTA GTTTTGGAAA AATTTTTCGT GTTAGTACTT TTGGAGAATC ACATGGTGGT GCAGTAGGAG TTATCCTTGA TGGATGTCCA CCTAAATTAA AAATAGATAT AAATCTGATA CAAAATGAAT TAGATAGGCG CAGACCTGGC CAAAGTGACA TTACAACACC CAGAAATGAA GAAGATAAAA TTGAAATATT AAGTGGAATA AAGGAAGGGT TAACACTTGG AACTCCAATA GCAATGTTGG TAAGAAATAA GGATCAAAGA CCAGCAGATT ATAATAATAT GGAGCAGGTA TTTAGACCTT CTCATGCAGA TGGTACATAT CATCTGAAAT ATGGAATTCA GGCAAGTTCT GGCGGTGGAA GAGCCTCTGC TAGAGAAACA ATTGGGAGAG TAGCTGCTGG TGCTGTAGCA AAACAATTAT TAAAAACCTT CTGTAACACT GAAATACTAT CTTGGGTAAA GCGTATACAT GATATTGATT CTGATATAAA TAAAGAGAAG ATTTCTCTCA AAAAAATAGA TTCAAATATT GTTAGATGTC CTGATGAAAA GGTATCAACA GAAATGATTG AGAGAATTAA GGAATTAAAG CGTCAAGGAG ACTCTTGCGG CGGTGTTATT GAATGTCTAG TAAGAAATGT TCCCTCTGGT CTTGGAATGC CTGTTTTTGA TAAATTAGAA GCTGATTTAG CAAAGGCTTT GATGTCTTTG CCTGCCACGA AAGGCTTTGA AATAGGTTCA GGTTTCTCTG GCACTTATTT AAAAGGAAGC GAGCATAATG ATTCGTTCAT TAAGTCTGAT GATTCTAGTA AATTAAGAAC AACATCAAAC AATTCAGGAG GTATACAGGG TGGAATAAGT AATGGTGAAA ATATTGAGAT GAAGATAGCT TTTAAACCTA CAGCAACTAT CGGGAAAGAA CAAAAAACAG TAAATGCTGA AGGGAAAGAA GTTTTGATGA AAGCAAAAGG GAGACATGAT CCATGCGTTT TACCAAGAGC AGTTCCCATG GTTGATGCTA TGGTTGCCTT AGTACTTGCT GATCATTTGC TTCTAAATCA TGCTCAATGT GACTTAATCA ATAACTAG
|
Protein sequence | MSSSFGKIFR VSTFGESHGG AVGVILDGCP PKLKIDINLI QNELDRRRPG QSDITTPRNE EDKIEILSGI KEGLTLGTPI AMLVRNKDQR PADYNNMEQV FRPSHADGTY HLKYGIQASS GGGRASARET IGRVAAGAVA KQLLKTFCNT EILSWVKRIH DIDSDINKEK ISLKKIDSNI VRCPDEKVST EMIERIKELK RQGDSCGGVI ECLVRNVPSG LGMPVFDKLE ADLAKALMSL PATKGFEIGS GFSGTYLKGS EHNDSFIKSD DSSKLRTTSN NSGGIQGGIS NGENIEMKIA FKPTATIGKE QKTVNAEGKE VLMKAKGRHD PCVLPRAVPM VDAMVALVLA DHLLLNHAQC DLINN
|
| |