Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_00231 |
Symbol | thiL |
ID | 4716705 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 25105 |
End bp | 26091 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 640077720 |
Product | putative thiamine-monophosphate kinase |
Protein accession | YP_001008418 |
Protein GI | 123967560 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0611] Thiamine monophosphate kinase |
TIGRFAM ID | [TIGR01379] thiamine-monophosphate kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.554976 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATAAAG AAATACTAGA AGATATTGGG GAAAAAGAAT TAATTAATAG GCTAGGAAAA TTTATGCCTA AAAATCAAGT TTCAGATGAT TGCGCTTTGA TTAAAACTAA AAATGAAAAT TTACTTGTTA ATACTGATTC TTTGGTAGAA AATGTTCATT TTAATACAAT TTCTATTTGT CCTCAAGACC TTGGTTGGAA AGCTGTTGTT AGCAACATCT CTGACTTATT ATCAAGTGGA AGCAAAAAAA CTATAGGTAT TACAATAAGC CTAATTTTAC CTGCTAAAAC TGAGTGGATT TGGATTGAAG AATTATACAA AGGGATAAAT AAAGCATTAA AAAAATATGG CGGGCTTATT CTAGGGGGAG ATTGTTCAAA AGGGAATGAA AAAATCATTT CAATTACAGC CTTTGGGATT CAAGGTGAAC TTGAATTACG AAGAAACGCA TGTAAAGCAG GAGATATTAT CTTAACGACA GGAATGCATG GCCTTAGCAA GCTAGGCTTT ATGATACAAA ATAAAATAAA TTTCGATAAT AATTTTTCTC TAAATGAAAG ATTAATCAGT AAGTCAATTA AACATTTTTG TCGCCCGAAA GTTTACCCAA TTTTTCTCAC AAATCTCATT AAAACTCGAT CCAATAAAAA AATAAGAAGA ATAGGGTGTA CTGATAGTAG CGACGGTCTT TTTCAATCTA TACAAGATTT AGCAATAGCT AGCAACTGTA AAGCGATCAT GAATTATGAA AAAATGCCCA AAGATAAGAA TTGGCCAAAA GGAGATAAAT GGGATGAATA TTATTTTTTT GGAGGTGAAG ATTACGAATT AGTATTCTCT TTACCCAAAA AATGGGCCAA TAATTTATGC AAACTCGATA AAAATATTTA CGAGATTGGT TATTTCGTTA ATGGTAAACC ATCAATAGAA TTTAAAGATA AAAATAAAAA TCATTTATTG AAAAATATAC CTTTCAAGCA CTTTTAA
|
Protein sequence | MHKEILEDIG EKELINRLGK FMPKNQVSDD CALIKTKNEN LLVNTDSLVE NVHFNTISIC PQDLGWKAVV SNISDLLSSG SKKTIGITIS LILPAKTEWI WIEELYKGIN KALKKYGGLI LGGDCSKGNE KIISITAFGI QGELELRRNA CKAGDIILTT GMHGLSKLGF MIQNKINFDN NFSLNERLIS KSIKHFCRPK VYPIFLTNLI KTRSNKKIRR IGCTDSSDGL FQSIQDLAIA SNCKAIMNYE KMPKDKNWPK GDKWDEYYFF GGEDYELVFS LPKKWANNLC KLDKNIYEIG YFVNGKPSIE FKDKNKNHLL KNIPFKHF
|
| |