Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_00281 |
Symbol | thiL |
ID | 4776285 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 32042 |
End bp | 33025 |
Gene Length | 984 bp |
Protein Length | 327 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640085527 |
Product | putative thiamine-monophosphate kinase |
Protein accession | YP_001016050 |
Protein GI | 124021743 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0611] Thiamine monophosphate kinase |
TIGRFAM ID | [TIGR01379] thiamine-monophosphate kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCGAAA CCCTGGGCCA ACTGGGTGAA AACGAACTAC TGCATCGACT AGCCCGCTTT GCTCCTGCTG GCCAACTTGA CGACGACACA GCGCAAGTCC ACACCAATGA GGGAGAACTC CTCATTAACA CCGATGTCAT GGTGGAAGGC ATTCACTTCA GCGAAGACAC CACGACTCCA AAAGATGTGG GTTGGCGTTG CGTAACCGCC AACCTTTCCG ACCTAGCAGC CAGTGGCGTT GATCAGATTC TGGGTATCAC TGTCGGACTT GTAGTCCCTC CAGAGACCCC CTGGAACTGG GTAGAAGGCG TTTATTTAGG CATAGAAGCC GCATTGAAAC AGTTTGGGGG GACCTTGCTA GGCGGCGACT GCTCCCGAGG GGATCAACGA CTACTCGCAA TCACCGCCCT GGGCACTCTC GGGCCATTAC GGTTGCACCG CTCCCAAGCC CAACCTGGAG ATTCTCTTGT CGTCAGTGGT CCCCATGGCC TCAGCCGCCT TGGCCTGGCC CTACTACGCT CAGATCCACT GATAAAAGCA GACCTGCTGC CAGACAAACT CAAACAAAAA GCGATAGAAG CTCATCAACA CCCTCAACCA TGTTTGAAAG CGCTCCACGC TCTTCAAACA TGTAAGCCTG AAGAACTTCC ATGGCGAGCT GGGGGCACCG ACAGTAGTGA CGGACTTCTG GCAGCGGTTC AAGGTCTTTG CAGAAGCAGC GGTTGCCGGG CGATTCTTGA TCCAACAGGC CTACCCAAAG ATCCTGACTG GCCACTAGGT CAACACTGGG ATAGCTGGTG TCTAAACGGC GGAGAAGACT TTGAACTTAT TCTCAGCCTG CCACCTCAAT GGGCCACCGC TTGGCTGCAA GTTCTTCCAT CAAGCCAAGT CATTGGGGTC ATGGAAAAAG GCCCACCTCG GGTGGAATGG GCGCATGGCA GAGGAGAAGT AAGCAACTTC TCAAGCTTCA AACATTTCCA ATAA
|
Protein sequence | MGETLGQLGE NELLHRLARF APAGQLDDDT AQVHTNEGEL LINTDVMVEG IHFSEDTTTP KDVGWRCVTA NLSDLAASGV DQILGITVGL VVPPETPWNW VEGVYLGIEA ALKQFGGTLL GGDCSRGDQR LLAITALGTL GPLRLHRSQA QPGDSLVVSG PHGLSRLGLA LLRSDPLIKA DLLPDKLKQK AIEAHQHPQP CLKALHALQT CKPEELPWRA GGTDSSDGLL AAVQGLCRSS GCRAILDPTG LPKDPDWPLG QHWDSWCLNG GEDFELILSL PPQWATAWLQ VLPSSQVIGV MEKGPPRVEW AHGRGEVSNF SSFKHFQ
|
| |