Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_03041 |
Symbol | aroC |
ID | 4780216 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 281855 |
End bp | 282940 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640083569 |
Product | chorismate synthase |
Protein accession | YP_001014133 |
Protein GI | 124025017 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.418255 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAAGTA GTTTCGGGAA ACTTTTTACT ATCAGCACCT TTGGCGAATC TCACGGAGGA GGTGTTGGTG TAATTATTGA TGGATGTCCT CCAAGGTTGG AGCTCGACAT TAACGAAATA CAAAATGATC TCAATCGAAG GAGGCCAGGA CAAAGCAAAA TAACTACTCC AAGAAACGAA AGTGATGAAG TTGAAATTCT TAGTGGTCTT TTAGGTAATA AAACCTTAGG AACACCAATT GCCATGGTTG TAAGAAATAA AGATCATCGG CCTAAGGATT ATTCTGAAAT TAAAAAAACT TTTAGGCCAT CTCATGCTGA TGCTACATAT CAGAAAAAAT ACGGAATTCA GGCTTCAAGT GGCGGCGGGC GTGCATCAGC AAGAGAAACT ATAGGTAGAG TTGCTGCGGG TTCTGTTGCA AAGCAACTTC TAACTAAGTT TGCTAAAACG GAAATACTCG CTTGGGTAAA GAGAATTCAT GATATTGAGG CTGAGATTCA TCCGAGTGAA GTTACTTTTG ATGAGATTGA GAAAAATATT GTTCGATGTC CAAATCAGTC AGCTGCTGAT TTAATGATTC AGAGAGTGGA GGCTTTTGGT AAAGAAGGAG ACTCCTGTGG TGGAGTCATA GAATGTGTTG TTCGGAATCC GCCCATAGGA CTTGGTATGC CTGTTTTTGA TAAATTAGAA GCTGATTTAG CAAAGGCATT AATGTCTTTG CCTGCCACTA AAGGTTTTGA GGTGGGATCT GGTTTCGGAG GTACTTATTT GAAAGGCAGC GAACATAATG ATCCTTTTTT GCCATCCGAT TCCAATCAAT TGAAAACTGC CACTAACAAT TCAGGCGGAA TTCAAGGAGG TATCAGTAAT GGTGAGGATA TAGTACTAAG AGTAGGTTTT AAACCAACAG CAACTATTAG GAAAAGTCAA AAGACAATTG ATGAGGATGG TAATGCAATA ACTCTCAAGG CGACAGGAAG ACATGATCCT TGTGTTTTGC CAAGGGCAGT TCCAATGGTT GAAGCAATGG TTGCGCTAGT TTTAGCTGAT CATTTACTAA GGCAAAGAGG CCAATGTACT GACTAA
|
Protein sequence | MGSSFGKLFT ISTFGESHGG GVGVIIDGCP PRLELDINEI QNDLNRRRPG QSKITTPRNE SDEVEILSGL LGNKTLGTPI AMVVRNKDHR PKDYSEIKKT FRPSHADATY QKKYGIQASS GGGRASARET IGRVAAGSVA KQLLTKFAKT EILAWVKRIH DIEAEIHPSE VTFDEIEKNI VRCPNQSAAD LMIQRVEAFG KEGDSCGGVI ECVVRNPPIG LGMPVFDKLE ADLAKALMSL PATKGFEVGS GFGGTYLKGS EHNDPFLPSD SNQLKTATNN SGGIQGGISN GEDIVLRVGF KPTATIRKSQ KTIDEDGNAI TLKATGRHDP CVLPRAVPMV EAMVALVLAD HLLRQRGQCT D
|
| |