Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9515_02561 |
Symbol | aroC |
ID | 4720392 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9515 |
Kingdom | Bacteria |
Replicon accession | NC_008817 |
Strand | + |
Start bp | 237312 |
End bp | 238406 |
Gene Length | 1095 bp |
Protein Length | 364 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640079919 |
Product | chorismate synthase |
Protein accession | YP_001010572 |
Protein GI | 123965491 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.46541 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAGCA TTTTTGGTAA AATTTTCCGG GTCAGTACTT TTGGAGAATC TCATGGAGGT GCAGTTGGAG TTATTCTTGA TGGATGTCCA CCAAAGTTAA AAATAAATAT TGATCTCATA CAAAATGAAT TAGATAGAAG GAGGCCTGGT CAGAGTAAAA TTACTACCCC AAGAAAGGAA GATGACAAAT TAGAGATATT AAGTGGTTTA AAGGAAGGAA TAACACTTGG GACTCCTATA GCCATGTTGG TTCGAAATAA AGACCAAAGA CCAGAAGACT ATAATAATCT TGAGCAAGTA TTTAGACCAT CTCATGCAGA TGGTACATAT CATCTCAAAT ATGGGATTCA AGCTGGTTCA GGAGGTGGTA GGGCTTCGGC TAGAGAAACT ATAGGAAGAG TTGCTGCAGG CGCTATTGCA AAACAATTAT TAAAAACCTT ATTCAATACT GAGATTCTTT CTTGGGTAAA ACGTATACAT GATATCGACT CTCAAGTTAA TAAAAATAAA CTTACTCTAA GTAAAATAGA TTCAAATATT GTTAGATGTC CTGATGAAAA GGTGGCTACA AAAATGATTC AAAGAATAAA AGAATTACAG CAAGAGGGTG ATTCTTGCGG AGGTGTTATT GAATGTTTAG TGAAAAATGT TCCCTCAGGA TTAGGGATGC CTGTTTTTGA TAAATTGGAG GCTGATTTAG CAAAGGCTCT GATGTCCTTG CCTGCGACTA AAGGTTTTGA AATAGGTTCA GGTTTCTTAG GAACTTATTT AAGAGGTAGT GAACATAATG ATTCATTTGT TGAGTCTGAT GACATCAATA AGCTTAAAAC AAAATCAAAT AATTCTGGAG GTATTCAAGG AGGTATAAGT AATGGAGAGA ATATTGAGAT GAAAATAGCT TTTAAACCGA CAGCAACTAT TGGAAAGGAA CAGAAAACTG TTAACTCTGA TGGTAAGGAA ATTGTAATGA AGGCAAAAGG TCGACATGAT CCATGTGTTT TACCAAGAGC AGTACCAATG GTTGATTCAA TGGTTGCACT TGTTTTAGCA GACCATTTGC TTCTTCATCA AGCGCAATGT TCAATTATTA AATAG
|
Protein sequence | MSSIFGKIFR VSTFGESHGG AVGVILDGCP PKLKINIDLI QNELDRRRPG QSKITTPRKE DDKLEILSGL KEGITLGTPI AMLVRNKDQR PEDYNNLEQV FRPSHADGTY HLKYGIQAGS GGGRASARET IGRVAAGAIA KQLLKTLFNT EILSWVKRIH DIDSQVNKNK LTLSKIDSNI VRCPDEKVAT KMIQRIKELQ QEGDSCGGVI ECLVKNVPSG LGMPVFDKLE ADLAKALMSL PATKGFEIGS GFLGTYLRGS EHNDSFVESD DINKLKTKSN NSGGIQGGIS NGENIEMKIA FKPTATIGKE QKTVNSDGKE IVMKAKGRHD PCVLPRAVPM VDSMVALVLA DHLLLHQAQC SIIK
|
| |