Gene P9211_00241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_00241 
SymbolthiL 
ID5731846 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp25876 
End bp26886 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content36% 
IMG OID641284366 
Productputative thiamine-monophosphate kinase 
Protein accessionYP_001549909 
Protein GI159902565 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0611] Thiamine monophosphate kinase 
TIGRFAM ID[TIGR01379] thiamine-monophosphate kinase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.208498 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.107796 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGAAT CTTCAGGATC AAGTGAAACG CTATTAGAAA TTGGTGAAAG AGAAATACTC 
AATCGCCTTA AAAAATATAT GGATGATGGG CAAATTGATA ATGACACAGC ATTAATAAAA
AATTGCAAAA AGGACTTAAT TATAAATACA GACTTAATGG TGGAAAAAGT TCATTTCAGT
TCGAGAACCA TGACTCCTGA AGATATTGGT TGGAAAGCAA TAACAAGTAA TTTCTCTGAT
CTTGCCTCTA GTGGATTAGA TAAAGTCTTA TCAGTAACCA TTGGCCTAAT AGCACCACCT
TCTACATCTT GGTCATGGGT AAATCGTTTA TACAAAGGAA TGACAAATGC ATTGGAGGTT
TATGGTGGCA AGTTAATAGG TGGTGATATA TCTAAAGGAA ATGAAAAAGT AATTTCGATT
ACCGCTATAG GTTCTCAAGG ACCATTAGAT CTGCATAGGT CTCATGCAAT CCCCGGAGAT
TGTCTTGTGA CAAGTGGTCC TCATGGACTA AGTCGCTTAG GCCTCGCATT ACTTCTTGAG
GATAAAGTCT TAGAAACTAA GCATGTAAGT GATGAATTAA AAGCAATTGC TATCGAAAGT
CACCAACATC CTTCAGCGCC AATAAAAGCT CTGAAGTCAC TAATCAATTG CAAGCCTACA
AAGATTCCTT GGAGAGCTGC AGGAACTGAT AGCAGTGATG GACTATTAGA AGCAATTAAA
AGTATTTGCA TAAGCAGTAA CTGCAAAGCA ATTATAAGAA CTAATAATCT TCCTACTCAT
AAAGATTGGC CTAAAGGGAA TCATTGGGAT AAATGGTGTC TTAATGGAGG TGAAGACTAT
GAGCTTGTAA TCAGTTTGCC GCTGGAATGG GCAAATGAAT GGATAAAAAT AATGCCATTG
AGTAAAATTA TTGGCGATGT AAAACAGGGT TCTCCCGAAA TTCTTTGGGA GAATGGAAAA
GAAATCGATT CTAAAAATTA TTTAGACTTT GAACATTTTC AAAATAAGTA A
 
Protein sequence
MPESSGSSET LLEIGEREIL NRLKKYMDDG QIDNDTALIK NCKKDLIINT DLMVEKVHFS 
SRTMTPEDIG WKAITSNFSD LASSGLDKVL SVTIGLIAPP STSWSWVNRL YKGMTNALEV
YGGKLIGGDI SKGNEKVISI TAIGSQGPLD LHRSHAIPGD CLVTSGPHGL SRLGLALLLE
DKVLETKHVS DELKAIAIES HQHPSAPIKA LKSLINCKPT KIPWRAAGTD SSDGLLEAIK
SICISSNCKA IIRTNNLPTH KDWPKGNHWD KWCLNGGEDY ELVISLPLEW ANEWIKIMPL
SKIIGDVKQG SPEILWENGK EIDSKNYLDF EHFQNK