Gene A9601_00231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_00231 
SymbolthiL 
ID4716705 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp25105 
End bp26091 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content30% 
IMG OID640077720 
Productputative thiamine-monophosphate kinase 
Protein accessionYP_001008418 
Protein GI123967560 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0611] Thiamine monophosphate kinase 
TIGRFAM ID[TIGR01379] thiamine-monophosphate kinase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.554976 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATAAAG AAATACTAGA AGATATTGGG GAAAAAGAAT TAATTAATAG GCTAGGAAAA 
TTTATGCCTA AAAATCAAGT TTCAGATGAT TGCGCTTTGA TTAAAACTAA AAATGAAAAT
TTACTTGTTA ATACTGATTC TTTGGTAGAA AATGTTCATT TTAATACAAT TTCTATTTGT
CCTCAAGACC TTGGTTGGAA AGCTGTTGTT AGCAACATCT CTGACTTATT ATCAAGTGGA
AGCAAAAAAA CTATAGGTAT TACAATAAGC CTAATTTTAC CTGCTAAAAC TGAGTGGATT
TGGATTGAAG AATTATACAA AGGGATAAAT AAAGCATTAA AAAAATATGG CGGGCTTATT
CTAGGGGGAG ATTGTTCAAA AGGGAATGAA AAAATCATTT CAATTACAGC CTTTGGGATT
CAAGGTGAAC TTGAATTACG AAGAAACGCA TGTAAAGCAG GAGATATTAT CTTAACGACA
GGAATGCATG GCCTTAGCAA GCTAGGCTTT ATGATACAAA ATAAAATAAA TTTCGATAAT
AATTTTTCTC TAAATGAAAG ATTAATCAGT AAGTCAATTA AACATTTTTG TCGCCCGAAA
GTTTACCCAA TTTTTCTCAC AAATCTCATT AAAACTCGAT CCAATAAAAA AATAAGAAGA
ATAGGGTGTA CTGATAGTAG CGACGGTCTT TTTCAATCTA TACAAGATTT AGCAATAGCT
AGCAACTGTA AAGCGATCAT GAATTATGAA AAAATGCCCA AAGATAAGAA TTGGCCAAAA
GGAGATAAAT GGGATGAATA TTATTTTTTT GGAGGTGAAG ATTACGAATT AGTATTCTCT
TTACCCAAAA AATGGGCCAA TAATTTATGC AAACTCGATA AAAATATTTA CGAGATTGGT
TATTTCGTTA ATGGTAAACC ATCAATAGAA TTTAAAGATA AAAATAAAAA TCATTTATTG
AAAAATATAC CTTTCAAGCA CTTTTAA
 
Protein sequence
MHKEILEDIG EKELINRLGK FMPKNQVSDD CALIKTKNEN LLVNTDSLVE NVHFNTISIC 
PQDLGWKAVV SNISDLLSSG SKKTIGITIS LILPAKTEWI WIEELYKGIN KALKKYGGLI
LGGDCSKGNE KIISITAFGI QGELELRRNA CKAGDIILTT GMHGLSKLGF MIQNKINFDN
NFSLNERLIS KSIKHFCRPK VYPIFLTNLI KTRSNKKIRR IGCTDSSDGL FQSIQDLAIA
SNCKAIMNYE KMPKDKNWPK GDKWDEYYFF GGEDYELVFS LPKKWANNLC KLDKNIYEIG
YFVNGKPSIE FKDKNKNHLL KNIPFKHF