Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_0832 |
Symbol | thiL |
ID | 4240324 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | + |
Start bp | 909740 |
End bp | 910747 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 638104387 |
Product | thiamine-monophosphate kinase |
Protein accession | YP_719042 |
Protein GI | 113460975 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0611] Thiamine monophosphate kinase |
TIGRFAM ID | [TIGR01379] thiamine-monophosphate kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000411831 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAATG GCGAATTTGA TGTTATTCAA CGCTATTTCA TCGGATCAAA ACGTCCGCAA CGTAAAGATG TGATTGTTTC TATCGGTGAT GATTGTGCTA TTACTGAACA TTATCAAAAT CAGCGAATTG CTATTACGAC TGATACGATG GTAGAAAATG TCCATTTTCT GTCTTCTATT AATCCTTCGG ATTTAGCATA TAAAGCTGTT GCAACAAATT TAAGCGATCT TGCTGCAATG GGGGCGACTC CCGCATGGTT TTCTCTTGCA ATCACTTTAC CTTATGTTGA TGAAAAATGG CTCAATGAAT TTAGTCAAAG TTTATTTGAT GTATTGGATC ATTATAATGT GTCGCTTATT GGTGGAGATA CAACAAAAGG TCCCGTATAT ACAATAACTA TTACTGCACA AGGCATTGTA CCTAAAGGGA AAGCACTTTG TCGCCACAGT GCACAAGATG GTGACTGGAT CTATGTTTCC GGCACATTAG GAGACAGTGC GGCAGGATTG GAATTACTAT TAAAAAATAC AGGGGAAACT TATAGTAAAA GTCATCAAAG TGCGGTTGAT TCTGCACAAG AATATTTAAT TCAACGCCAT TTACGTCCAA CCCCCAGAGT TTTGCTTGGT TTGGAACTAG CTAGTGCTGA GTTGGCAAAT GCTGCGATTG ATATTTCTGA TGGATTTATT GCAGATTTAG GACATATTTT ACAGCGTAGT CAATGCGGTG CAGTGATCGA TTTAGATAAA TTACCTTTAT CTGAGCAATT AATCAAAACT GTTGGTATAG AGCGAGCGGA GCAATTTGCT TTAACTGGCG GAGAAGATTA TGAGTTGTGC TTTACAGTTC CTGATCGCAA TCTTGAAAAG TTAGAGCGAG CTTTAACCCA TATTGGTGTG AATTATAGCT GTGTTGGACA AATTCGTAAT AAAAAACGTA TCAGTTTTCA ACGCAGCGGT AAAACCGTTG AGATTGGTTC ATTACTCGGC TTTGATCATT TTAAATAA
|
Protein sequence | MSNGEFDVIQ RYFIGSKRPQ RKDVIVSIGD DCAITEHYQN QRIAITTDTM VENVHFLSSI NPSDLAYKAV ATNLSDLAAM GATPAWFSLA ITLPYVDEKW LNEFSQSLFD VLDHYNVSLI GGDTTKGPVY TITITAQGIV PKGKALCRHS AQDGDWIYVS GTLGDSAAGL ELLLKNTGET YSKSHQSAVD SAQEYLIQRH LRPTPRVLLG LELASAELAN AAIDISDGFI ADLGHILQRS QCGAVIDLDK LPLSEQLIKT VGIERAEQFA LTGGEDYELC FTVPDRNLEK LERALTHIGV NYSCVGQIRN KKRISFQRSG KTVEIGSLLG FDHFK
|
| |