Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_14961 |
Symbol | trpC |
ID | 4718217 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 1277266 |
End bp | 1278153 |
Gene Length | 888 bp |
Protein Length | 295 aa |
Translation table | 11 |
GC content | 29% |
IMG OID | 640079217 |
Product | indole-3-glycerol-phosphate synthase |
Protein accession | YP_001009886 |
Protein GI | 123969028 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0134] Indole-3-glycerol phosphate synthase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.147831 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGATAA GACGCAGGCC ACCAAATCCA ACAGTGAGGG TAGAAAACTT AGAATATGCT GTACCTCATA GAGAAGCACA AGCAAAAAAC ATTCTGGAAG AAATTGTATG GCACAAGGAT ATTGAAATTA AGAATTTTAA AAAAATAGTC TCTTTAGAAG ATCTCATCAA AAAAATTGAA AATCTTCCTA CTCCCAAAGA TTTTTATAAA AATATCTTGG AGTCAAAAAT AAAACCAGGA GTTATTGCTG AAATAAAAAA AGCTAGTCCG AGTAAAGGAG TTATTAGAAA AGATTTTAAC CCTGAAAACA TAGCAATTTG TTATGAAGGA TTAGGTGCAT CATGTATCTC AGTACTTACC GATAAAAAGT TTTTTCAAGG TAGTTATGAA ATACTCGAAA CTGTAAGGAA ATCTACTAAT CTCCCTCTAC TCTGCAAAGA TTTTATTATT TCTGCTTATC AGATTTATAA AGCAAGGGTA TCTGGTGCTG ATGCAATATT ATTAATCGCT GCGATTTTAA GTGATGATGA TTTAATTTAT TTAAAGAAAA TAGCTGATAA TTTAAAGATG AGTGTTCTTG TTGAAGTCCA TAACTCTTAT GAATTAGAAA GGATTCTAAA GTTAAAATCT TTTAATTTGA TTGGAATAAA TAATAGGGAC TTAAAGACTT TTAAAACGGA TTTAAAAACA TCAAAAGAAT TGATGAATAC ATATGCAGAT ATATTTTTAA AACAAAATAT TATTCCCATA AGTGAATCTG GAATTAATTG TGCCGAAGAT TTAGAATCGC TTAGATCTAT TGGAATCATG GGAGTATTAA TTGGTGAAAC TTTTATGAGA GAAACTGATA TTGAACAATC GTTCAAGAAA TTATTTAACT CAATTTAA
|
Protein sequence | MEIRRRPPNP TVRVENLEYA VPHREAQAKN ILEEIVWHKD IEIKNFKKIV SLEDLIKKIE NLPTPKDFYK NILESKIKPG VIAEIKKASP SKGVIRKDFN PENIAICYEG LGASCISVLT DKKFFQGSYE ILETVRKSTN LPLLCKDFII SAYQIYKARV SGADAILLIA AILSDDDLIY LKKIADNLKM SVLVEVHNSY ELERILKLKS FNLIGINNRD LKTFKTDLKT SKELMNTYAD IFLKQNIIPI SESGINCAED LESLRSIGIM GVLIGETFMR ETDIEQSFKK LFNSI
|
| |