Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_13191 |
Symbol | thiE |
ID | 5731714 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 1190190 |
End bp | 1191221 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 641285690 |
Product | thiamine-phosphate pyrophosphorylase |
Protein accession | YP_001551204 |
Protein GI | 159903860 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0352] Thiamine monophosphate synthase |
TIGRFAM ID | [TIGR00693] thiamine-phosphate pyrophosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.422887 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGTCA TCCCTACTCC TGAAAAACAT GTTGCTCAGC TGATAGATGC AAACCTTGAT AGAGCAAGAG AAGGTTTAAG AGTCATTGAA GATTGGTGCA GATATGGCCT GCAAAGAAAA GACTTGATAA TTATTATTAA GGATTATCGT CATCAATTAG GTCGCCTCCA TCAAAAGACT TATAAGCAAG CTCGCTCAGC ACAGACTGAC CAAGGTTCAG GATTAACTCA CGAAGCTCAA AATGATCGTA TAAGTCCTCT ACAAATCGTG AGTGCTAATT GTGCAAGAGT TCAAGAAGCT CTTCGCGTGA TAGAAGAATT TGCCAGAAAT ATTGATCCAG AGCTTACAAA AGCAGCCTCA AAAATTCGTT ATGAGATTTA CGATCTAGAA ATCAATATTC AAGAAGCAAC TTCTGGGAAA AAACGTCAAA AGGAACTCTC AGCTTGCAAA TTATGCTTAA TAACGACCCC CCACCAAGAG CTTATAGCAA AGGTTTCTGC AGGATTAAAG GCAGGAGTAG GGATGGTGCA ATATCGCTGT AAAAAGGGTA AAGACATTGA CAAGTTTTCT GAAGCTGAAA AACTTGCTGT GATTTGCAAA GATTATGGTG CTCTCTTTAT AGTCAATGAC CGAATCGATA TAGCTCTAGC CGTAGATGCT GACGGTATTC ATATTGGTCA AGAAGACTTA CCACTAGATA TAGCAAGAAA GCTTATAGGT CCCGAAAAGT TAATAGGAGT TAGTTGCCAT TCTCTTGAAG AAGCTCAAAA GGCAGATAAA AATGGTTCTG ACTATATAGG CTTTGGTCCT ATTTTTCGAA CTACTTCAAA GCCAGAGGTT GCTCCACTTG GTCTTGAGTG CTTAAAGCAA ATATCAAATT CAATTAATCA ACCTTGTTTT GCAATTGGTG GAATCAACCA TCTCAACAGA TCTAAATTGT TGTCAACTGG AGTCTCTCGA ATAGCAGTCA TTGATGCCAT CATGAAAGCA GAAGACCCTT TTCAAGCAAG CAAGCAGCTA CTTGAGATTT GA
|
Protein sequence | MAVIPTPEKH VAQLIDANLD RAREGLRVIE DWCRYGLQRK DLIIIIKDYR HQLGRLHQKT YKQARSAQTD QGSGLTHEAQ NDRISPLQIV SANCARVQEA LRVIEEFARN IDPELTKAAS KIRYEIYDLE INIQEATSGK KRQKELSACK LCLITTPHQE LIAKVSAGLK AGVGMVQYRC KKGKDIDKFS EAEKLAVICK DYGALFIVND RIDIALAVDA DGIHIGQEDL PLDIARKLIG PEKLIGVSCH SLEEAQKADK NGSDYIGFGP IFRTTSKPEV APLGLECLKQ ISNSINQPCF AIGGINHLNR SKLLSTGVSR IAVIDAIMKA EDPFQASKQL LEI
|
| |