Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_23791 |
Symbol | aroC |
ID | 4776524 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 2096588 |
End bp | 2097676 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640087899 |
Product | chorismate synthase |
Protein accession | YP_001018377 |
Protein GI | 124024070 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0240204 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCAGCA GCTTTGGGGA TCTATTCCGA ATCAGCACCT TCGGTGAATC TCATGGAGGG GGCGTGGGAG TCATCGTGGA AGGCTGCCCA CCAAGGCTTG AGCTTGACCT ACAGAAGATT CAGGCCGAGC TAGATCGACG CAAACCTGGC CAAAGCAAGA TCAGTACACC CCGCAAGGAA GAAGATCAAG TCGAAATCCT CAGCGGTCTA CTCAACAACA CAACCCTTGG AACACCAATC GCCATGGTGG TGCGCAACAA GGATCACAAA CCTGGCGACT ACAAGGAGAT GAATGTTGCA TTTCGGCCTT CCCATGCCGA TGCCACCTAT CAGGCGAAGT ACGGCATCCA AGCTCGAAGC GGTGGAGGGC GAGCCTCCGC AAGAGAGACG ATTGCGCGGG TAGCCGCTGG AGCGATTGCC AAGCAATTAC TGACCAAAGC CCATAACACC GAAGTACTGG CATGGGTCAA ACGCATTCAC ACCCTGGAGG CCGAAATCAA TGCCCAGGAC GTCAGCATTG ATGACGTCGA AGCAAACATC GTGCGTTGCC CGAACCAAGT CATGGCAGCG CAAATGGTGG AGCGTATTGA AGCCATCAGC CGTGAAGGCG ACTCATGCGG TGGTGTGATC GAGTGTGTTG TACGCAATGC CCCAATGGGT CTGGGGATGC CTGTGTTCGA CAAGCTTGAG GCAGACCTCG CCAAGGCCGT GATGTCACTA CCTGCCAGCA AGGGCTTTGA GATCGGCTCG GGGTTTGGCG GCACCCTGCT AAAAGGCAGC GAGCACAACG ACGCTTTCCT CCCCAGCAAT GATGGTCGCC TACGAACAGC CACCAACAAC TCTGGTGGCA TCCAGGGAGG GATCACTAAC GGTGAATCCA TCGTGATCCG AGTGGCGTTC AAGCCAACAG CCACCATCCG TAAAGATCAA CAAACAATTG ACGCTGATGG CAACACCACG ACACTGTCTG CCAAAGGTCG TCATGATCCC TGCGTCCTGC CTAGGGCCGT ACCAATAGTT GAAGCCATGG TGTCCCTTGT ACTCGCTGAT CACCTCTTAC GCCAACAAGG ACAGTGCAGT CTCTGGTAA
|
Protein sequence | MGSSFGDLFR ISTFGESHGG GVGVIVEGCP PRLELDLQKI QAELDRRKPG QSKISTPRKE EDQVEILSGL LNNTTLGTPI AMVVRNKDHK PGDYKEMNVA FRPSHADATY QAKYGIQARS GGGRASARET IARVAAGAIA KQLLTKAHNT EVLAWVKRIH TLEAEINAQD VSIDDVEANI VRCPNQVMAA QMVERIEAIS REGDSCGGVI ECVVRNAPMG LGMPVFDKLE ADLAKAVMSL PASKGFEIGS GFGGTLLKGS EHNDAFLPSN DGRLRTATNN SGGIQGGITN GESIVIRVAF KPTATIRKDQ QTIDADGNTT TLSAKGRHDP CVLPRAVPIV EAMVSLVLAD HLLRQQGQCS LW
|
| |