Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_14591 |
Symbol | thiE |
ID | 4911099 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | + |
Start bp | 1230114 |
End bp | 1231169 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 640161051 |
Product | thiamine-phosphate pyrophosphorylase |
Protein accession | YP_001091683 |
Protein GI | 126696797 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0352] Thiamine monophosphate synthase |
TIGRFAM ID | [TIGR00693] thiamine-phosphate pyrophosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGAATT CTAATACTAC AAACACAGAA GATTTAAGAA TTTATCAAAT TATTGACGCT AATTTAGATA GAGCTAGAGA GGGACTTAGA GTACTAGAGG ATTGGGCTAG ATTTGGTCTA GGCAAAGAAA AATATGTTGA AAAGATTAAA AATTTTAGGC AAATTTTAGG AAAAAATCAT TTAGAAATTT ATAAACTATC TAGAAATCAC GTTGAGGACA AATGTAAAGG ACTGACACAT CAAGAGCAAA TCAACAGGAA AACTTCTGAG CAAATTATTA GTTCTAATTC AGCCCGAGTT CAAGAAGCAT TACGAGTCAT AGAAGAATTC TCAAGGCTTC AGAATCATGA GCTTTCAAAA ATCGCTTCCG AAATTAGATA TGAAATTTAT ACTATTGAAA TTGACTTATT GAGTTATAGC AAGTTTAAGA AGTCGGAGGA AATATTAAAA GAAAATGACT TATATGTAAT CACAGATCAA AAAGACAATT TATTAGAAAT AATAGAGGAA ATTTTAATTG CTGGAGTAAG AATTATTCAA CATAGATTTA AAACGGGAAC TGATCAAGAT CATCTTCAAG AAGCAATTCA GATTAAAAAT CTATGTAAAA GATATAATTC TCTTTTTATC GTTAACGATA GACTTGATAT AGCTCTAGCA TCTAACGCTG ATGGGATTCA TCTTGGTAAA GACGATTTAG ATTTTAAAAC CGCAAGGAGA CTATTAGGAT ATTCAAAAAT TATTGGTATA AGCGCAAATA ATGAAATTGA TATTTCTAAT GCTCTTAAAG AGGGTTGTGA TTACATAGGA ATAGGACCGG TATTTGAAAC TGCAACAAAG AAGGACAAAA AACCCATTGG TATTGAAAAA ATCAAAACAT TAACCAAAGA TTTAGATATT CCTTGGTTTG CCATCGGAGG AATTAAGTCA AATAATATTT CATATTTAAA AAGCAATGGG TTTAAGAAAG TTGCCTTAGT TTCGCAATTA ATGAATTCTG AAGATCCTAA AGAAGACGCT ATGATTATCC TAAAAAAGTT GTCTCATGAA AATTAG
|
Protein sequence | MLNSNTTNTE DLRIYQIIDA NLDRAREGLR VLEDWARFGL GKEKYVEKIK NFRQILGKNH LEIYKLSRNH VEDKCKGLTH QEQINRKTSE QIISSNSARV QEALRVIEEF SRLQNHELSK IASEIRYEIY TIEIDLLSYS KFKKSEEILK ENDLYVITDQ KDNLLEIIEE ILIAGVRIIQ HRFKTGTDQD HLQEAIQIKN LCKRYNSLFI VNDRLDIALA SNADGIHLGK DDLDFKTARR LLGYSKIIGI SANNEIDISN ALKEGCDYIG IGPVFETATK KDKKPIGIEK IKTLTKDLDI PWFAIGGIKS NNISYLKSNG FKKVALVSQL MNSEDPKEDA MIILKKLSHE N
|
| |