Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_00241 |
Symbol | thiL |
ID | 5731846 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 25876 |
End bp | 26886 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 641284366 |
Product | putative thiamine-monophosphate kinase |
Protein accession | YP_001549909 |
Protein GI | 159902565 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0611] Thiamine monophosphate kinase |
TIGRFAM ID | [TIGR01379] thiamine-monophosphate kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.208498 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.107796 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGAAT CTTCAGGATC AAGTGAAACG CTATTAGAAA TTGGTGAAAG AGAAATACTC AATCGCCTTA AAAAATATAT GGATGATGGG CAAATTGATA ATGACACAGC ATTAATAAAA AATTGCAAAA AGGACTTAAT TATAAATACA GACTTAATGG TGGAAAAAGT TCATTTCAGT TCGAGAACCA TGACTCCTGA AGATATTGGT TGGAAAGCAA TAACAAGTAA TTTCTCTGAT CTTGCCTCTA GTGGATTAGA TAAAGTCTTA TCAGTAACCA TTGGCCTAAT AGCACCACCT TCTACATCTT GGTCATGGGT AAATCGTTTA TACAAAGGAA TGACAAATGC ATTGGAGGTT TATGGTGGCA AGTTAATAGG TGGTGATATA TCTAAAGGAA ATGAAAAAGT AATTTCGATT ACCGCTATAG GTTCTCAAGG ACCATTAGAT CTGCATAGGT CTCATGCAAT CCCCGGAGAT TGTCTTGTGA CAAGTGGTCC TCATGGACTA AGTCGCTTAG GCCTCGCATT ACTTCTTGAG GATAAAGTCT TAGAAACTAA GCATGTAAGT GATGAATTAA AAGCAATTGC TATCGAAAGT CACCAACATC CTTCAGCGCC AATAAAAGCT CTGAAGTCAC TAATCAATTG CAAGCCTACA AAGATTCCTT GGAGAGCTGC AGGAACTGAT AGCAGTGATG GACTATTAGA AGCAATTAAA AGTATTTGCA TAAGCAGTAA CTGCAAAGCA ATTATAAGAA CTAATAATCT TCCTACTCAT AAAGATTGGC CTAAAGGGAA TCATTGGGAT AAATGGTGTC TTAATGGAGG TGAAGACTAT GAGCTTGTAA TCAGTTTGCC GCTGGAATGG GCAAATGAAT GGATAAAAAT AATGCCATTG AGTAAAATTA TTGGCGATGT AAAACAGGGT TCTCCCGAAA TTCTTTGGGA GAATGGAAAA GAAATCGATT CTAAAAATTA TTTAGACTTT GAACATTTTC AAAATAAGTA A
|
Protein sequence | MPESSGSSET LLEIGEREIL NRLKKYMDDG QIDNDTALIK NCKKDLIINT DLMVEKVHFS SRTMTPEDIG WKAITSNFSD LASSGLDKVL SVTIGLIAPP STSWSWVNRL YKGMTNALEV YGGKLIGGDI SKGNEKVISI TAIGSQGPLD LHRSHAIPGD CLVTSGPHGL SRLGLALLLE DKVLETKHVS DELKAIAIES HQHPSAPIKA LKSLINCKPT KIPWRAAGTD SSDGLLEAIK SICISSNCKA IIRTNNLPTH KDWPKGNHWD KWCLNGGEDY ELVISLPLEW ANEWIKIMPL SKIIGDVKQG SPEILWENGK EIDSKNYLDF EHFQNK
|
| |