Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_02451 |
Symbol | aroC |
ID | 4716929 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | + |
Start bp | 228016 |
End bp | 229113 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640077944 |
Product | chorismate synthase |
Protein accession | YP_001008640 |
Protein GI | 123967782 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.642724 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAGTA GTTTTGGAAA AATTTTTCGT GTTAGTACTT TTGGAGAATC ACATGGTGGT GCAGTAGGAG TTATCCTTGA TGGATGTCCC CCTAAGTTAA AAGTAGATAT AAATCTGATA CAAAATGAAT TAGATAGGCG AAGGCCTGGC CAAAGTGACA TTACAACGCC CAGAAATGAA GAAGATAAAA TTGAAATATT AAGTGGGATA AAGGAAGGGT TCACACTTGG AACTCCAATA GCGATGTTGG TAAGAAACAA GGATCAAAGA CCAGGAGACT ATGATAATTT GGAACAAGTA TTTAGGCCTT CTCATGCAGA TGGTACATAT CATCTGAAAT ATGGAATTCA GGCAAGTTCT GGCGGTGGAA GAGCCTCTGC TAGAGAAACA ATTGGGAGAG TAGCTGCTGG TGCTGTAGCA AAACAATTAT TAAAAACCTT CTGTAACACT GAAATACTAT CTTGGGTAAA GCGTATACAT GATATTGAGT CTGATATAAA TAAAGAGAAG ATTTCTCTCA AACAAATAGA TTCTAATATT GTTAGATGTC CAGATGAAAA GGTATCAACA GAAATGATCG AGAGAATTAA GGAATTAAAG CGTCAAGGAG ACTCTTGCGG CGGTGTTATT GAATGTCTAG TAAGGAATGT TCCCTCGGGT CTTGGAATGC CAGTTTTTGA TAAATTAGAA GCTGATTTAG CAAAGGCTTT GATGTCTTTG CCTGCCACGA AAGGCTTTGA AATAGGTTCA GGTTTCTCTG GAACTTATTT AAAAGGAAGC GAACATAATG ATGCATTCAT CAAGTCTGAT GATATTAGTA AGTTAAGAAC AACATCAAAC AATTCAGGAG GTATACAGGG CGGAATAAGT AATGGTGAAA ATATCGAGAT GAAGATAGCT TTTAAACCTA CAGCAACCAT CGGGAAAGAA CAGAAAACAG TAAATGCTGA AGGTAAAGAA GTACTTATGA AAGCAAAAGG GAGGCACGAT CCATGCGTTC TACCAAGAGC AGTTCCCATG GTTGATGCTA TGGTAGCTCT AGTACTTGCT GATCATTTGC TTCTAAATCA TGCTCAATGT GACTTAATAA ATAAGTAG
|
Protein sequence | MSSSFGKIFR VSTFGESHGG AVGVILDGCP PKLKVDINLI QNELDRRRPG QSDITTPRNE EDKIEILSGI KEGFTLGTPI AMLVRNKDQR PGDYDNLEQV FRPSHADGTY HLKYGIQASS GGGRASARET IGRVAAGAVA KQLLKTFCNT EILSWVKRIH DIESDINKEK ISLKQIDSNI VRCPDEKVST EMIERIKELK RQGDSCGGVI ECLVRNVPSG LGMPVFDKLE ADLAKALMSL PATKGFEIGS GFSGTYLKGS EHNDAFIKSD DISKLRTTSN NSGGIQGGIS NGENIEMKIA FKPTATIGKE QKTVNAEGKE VLMKAKGRHD PCVLPRAVPM VDAMVALVLA DHLLLNHAQC DLINK
|
| |