Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_19771 |
Symbol | trpC |
ID | 4777578 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 1738168 |
End bp | 1739073 |
Gene Length | 906 bp |
Protein Length | 301 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640087487 |
Product | indole-3-glycerol-phosphate synthase |
Protein accession | YP_001017984 |
Protein GI | 124023677 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0134] Indole-3-glycerol phosphate synthase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0661236 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGATCC GCAGACGTCC GCCCAATCCA AGCGTCAAGG TGGCGCATTT GCAGTACGCC ATTCCCCACG CCGATGCTGA GCCGCGCCAC ATTCTTGAGA AGATTGTTTG GGAGAAGGAC CGTGAAGTTG AGACGGCAAG ACAGCGCGTA CCTCTAGAGA CGCTTAAATC TCAGATCGCA GACTTGCCCA TCCCTCGAGA TTTCATTGCT GCCTTGCGTC AGGCTTCGGT GGCTCCGGCA GTGATTGCTG AAGTGAAGAA GGCCAGCCCA AGTCAAGGTG TGATCCGTGC GGACTTTGAC CCTGTTTTGA TCGCCAATGC CTATGCGGAA GGTGGTGCTA GTTGCTTGTC TGTGCTGACC GATAAGAGCT TCTTCCAGGG TGGATTTGAG GTGCTTGTTG AGGTGCGGCA GACGGTTGGA CTACCACTGC TATGTAAGGA CTTCATCCTG ACGCCATATC AGCTTTATCA GGCGCGTGCT GCAGGTGCAG ATGCTGCCTT GTTGATTATG GCGATCCTCT CTGATCAGGA TCTGACGTAT TTGAGCAAGG TGGCCAACAG TCTTGGACTC ACTGTCTTAG TGGAAGTGCA TGATGCTGAG GAACTTGAAC GGGTTCTCAA CCTTGGTGGT TTCCCACTGA TTGGCATCAA TAATCGTGAT CTCACCACCT TTGAGACAGA CCTTGAAACC ACTGAAACAC TGAGCAAGCA ATTCGCTACT CGATTGAAGC AGCAGGGTGT CTTGTTGGTG AGTGAATCGG GCTTGTTTAA CCGTGCTGAT TTGGACAGGG TTCAGGCAGT AGGCGCTGAA GCTGTGTTGG TAGGAGAAGC CCTGATGCGT CAGTCGGATG TTTGTGCTGG CTTAAAGCAG CTGATGATTG GAGACGAAGG AACGAGCAAC AGCTGA
|
Protein sequence | MEIRRRPPNP SVKVAHLQYA IPHADAEPRH ILEKIVWEKD REVETARQRV PLETLKSQIA DLPIPRDFIA ALRQASVAPA VIAEVKKASP SQGVIRADFD PVLIANAYAE GGASCLSVLT DKSFFQGGFE VLVEVRQTVG LPLLCKDFIL TPYQLYQARA AGADAALLIM AILSDQDLTY LSKVANSLGL TVLVEVHDAE ELERVLNLGG FPLIGINNRD LTTFETDLET TETLSKQFAT RLKQQGVLLV SESGLFNRAD LDRVQAVGAE AVLVGEALMR QSDVCAGLKQ LMIGDEGTSN S
|
| |