Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_13441 |
Symbol | trpC |
ID | 5730197 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 1211830 |
End bp | 1212717 |
Gene Length | 888 bp |
Protein Length | 295 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 641285715 |
Product | indole-3-glycerol-phosphate synthase |
Protein accession | YP_001551229 |
Protein GI | 159903885 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0134] Indole-3-glycerol phosphate synthase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.847237 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGATTC GTCGACGTCC ACCAAACCCT AAAGTTAAGG TTGCAAACCT TGAATATGCT ATTCCGCATG AAGATGGTGA ACCCAGAAAT ATTCTTGAAA AAATCTTGTG GGAGAAAGAT CGTGAAGTAA AAGTTTCAAG AGAGAGAGTT CCTCTTTCGG AATTAAAGGC TCAGATCAAT AATTTGCCTC AAACTAAAGA TTTCTTAGGG GCTTTGAGAC AATCCTCTAC TTCACCAGCT GTTATAGCAG AAATAAAGAA AGCAAGTCCT AGCAAGGGAG TGATTCGCGA AAACTTTGAT CCAATAGAAA TAGCACTTGC TTATAAGTTA GGAGGTGCAA CATGCTTATC AGTGTTGACA GATAAAAGTT TTTTTCAAGG AGGCTTTGAG GTACTTGTTC AAGTCAGAAA GACTGTTGAT TTACCATTGT TATGCAAGGA ATTCATTATT CAGCCTTACC AGATTTATCA AGCAAGAGTT GCTGGTGCTG ATGCGGTTTT ATTGATTGCA GCCATACTTT CTGATCAAGA TCTTCTTTAT CTGAGAAAAG TTGCAATTAG CCTTGGATTA ACAATATTGG TTGAAGTGCA TGACTCTAAT GAGTTGAAAA GAGTACTAGA TTTAGAAGGA TTTCCTCTGG TTGGAATTAA TAATCGCGAC CTTAAGACTT TCAATACTGA TTTAAGAACG ACAAAAGAAG TAGTAAAAGA GCATAAAAAG AGAATTTCTG AACAAGAGGT TCTATTGGTA AGTGAGTCTG GTTTATTTAA CTCTGCAGAT CTAGAAGAAG TTAGTTCCTA TGGAGCAAAG GCTGTTCTTG TTGGTGAGTC TTTGATGAGA CAACCTGATA TTGGATTGGC GTTAAAAAAC TTGCAAGGAT TTAAGTAG
|
Protein sequence | MEIRRRPPNP KVKVANLEYA IPHEDGEPRN ILEKILWEKD REVKVSRERV PLSELKAQIN NLPQTKDFLG ALRQSSTSPA VIAEIKKASP SKGVIRENFD PIEIALAYKL GGATCLSVLT DKSFFQGGFE VLVQVRKTVD LPLLCKEFII QPYQIYQARV AGADAVLLIA AILSDQDLLY LRKVAISLGL TILVEVHDSN ELKRVLDLEG FPLVGINNRD LKTFNTDLRT TKEVVKEHKK RISEQEVLLV SESGLFNSAD LEEVSSYGAK AVLVGESLMR QPDIGLALKN LQGFK
|
| |