Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_02471 |
Symbol | aroC |
ID | 5731734 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 237674 |
End bp | 238765 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 641284591 |
Product | chorismate synthase |
Protein accession | YP_001550132 |
Protein GI | 159902788 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCAGCA GTTTTGGAGA TCTTTTTCGA ATAAGTACTT TTGGAGAGTC TCATGGAGGA GGGGTGGGTG TAATTTTGGA GGGTTGCCCT CCTAGGCTTG CAATAGATGT TGATGCAATT CAGGCAGAAC TGGATAGGAG GAGACCAGGA CAAAGCAAAA TTACAACCCC TAGAAATGAA GTTGACCAAG TAGAAATCTT GAGTGGGCTT GTTGACAACA AAACTTTAGG CACACCAATA TCGATGGTCG TTAGGAATAA AGATTTCCGA CCAAATGACT ATGGGGAAAT GCAAAATATT TTTAGGCCTT CCCATGCAGA TGGAACTTAT CATTTGAAAT ATGGAGTCCA AGCTGCTAGT GGTGGAGGGA GAGCTTCTGC TCGAGAAACG ATTGGTCGTG TAGCTGCTGG AGCAATTGCG AAACAATTAC TTAGAAAAGT CAATCAGACG GAAGTTTTGG CATGGGTGAA ACGAATTCAC ACTATTGAGG CTGATGTGGA TCCAAATTCT GTTCAGATTA AAGATATTGA ATCGAATATT GTTCGATGTC CTGATCCCAA AATTGCAAAA CTGATGGTTG AAAGAATTGA AGAAGTTAGT CGGGATGGAG ACTCTTGTGG TGGAGTTATT GAATGCATTG TGCGTAATCC TCCAGCAGGA TTGGGAATGC CTGTCTTTGA CAAATTAGAA GCTGATTTAG CCAAGGCATT GATGTCATTA CCTGCAAGCA AGGGCTTTGA AATTGGATCA GGATTTAGTG GCACTTTTCT GAAAGGAAGT GAACATAATG ACGCTTTTAT TCCTTCAGGT AAAGGCATTT TGAGAACAGC GACTAATAAT TCTGGAGGCA TACAAGGTGG GATTAGTAAT GGAGAGTTAA TTGTTTTAAG AGTTGCCTTT AAACCCACAG CCACAATTCG CAAAGATCAA AAGACTGTTG ATTCTGACGG GAAGGAAAGA ACATTGTCAG CCAAAGGAAG ACATGATCCA TGTGTTTTGC CAAGAGCAGT ACCTATGGTG GAATCTATGG TGGCATTAGT TTTGGCTGAT CATCTTTTAA GACAGCAAGG TCAATGCGGC CTTTGGCAGT AA
|
Protein sequence | MGSSFGDLFR ISTFGESHGG GVGVILEGCP PRLAIDVDAI QAELDRRRPG QSKITTPRNE VDQVEILSGL VDNKTLGTPI SMVVRNKDFR PNDYGEMQNI FRPSHADGTY HLKYGVQAAS GGGRASARET IGRVAAGAIA KQLLRKVNQT EVLAWVKRIH TIEADVDPNS VQIKDIESNI VRCPDPKIAK LMVERIEEVS RDGDSCGGVI ECIVRNPPAG LGMPVFDKLE ADLAKALMSL PASKGFEIGS GFSGTFLKGS EHNDAFIPSG KGILRTATNN SGGIQGGISN GELIVLRVAF KPTATIRKDQ KTVDSDGKER TLSAKGRHDP CVLPRAVPMV ESMVALVLAD HLLRQQGQCG LWQ
|
| |