Gene HS_0832 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0832 
SymbolthiL 
ID4240324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp909740 
End bp910747 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content38% 
IMG OID638104387 
Productthiamine-monophosphate kinase 
Protein accessionYP_719042 
Protein GI113460975 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0611] Thiamine monophosphate kinase 
TIGRFAM ID[TIGR01379] thiamine-monophosphate kinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000411831 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAATG GCGAATTTGA TGTTATTCAA CGCTATTTCA TCGGATCAAA ACGTCCGCAA 
CGTAAAGATG TGATTGTTTC TATCGGTGAT GATTGTGCTA TTACTGAACA TTATCAAAAT
CAGCGAATTG CTATTACGAC TGATACGATG GTAGAAAATG TCCATTTTCT GTCTTCTATT
AATCCTTCGG ATTTAGCATA TAAAGCTGTT GCAACAAATT TAAGCGATCT TGCTGCAATG
GGGGCGACTC CCGCATGGTT TTCTCTTGCA ATCACTTTAC CTTATGTTGA TGAAAAATGG
CTCAATGAAT TTAGTCAAAG TTTATTTGAT GTATTGGATC ATTATAATGT GTCGCTTATT
GGTGGAGATA CAACAAAAGG TCCCGTATAT ACAATAACTA TTACTGCACA AGGCATTGTA
CCTAAAGGGA AAGCACTTTG TCGCCACAGT GCACAAGATG GTGACTGGAT CTATGTTTCC
GGCACATTAG GAGACAGTGC GGCAGGATTG GAATTACTAT TAAAAAATAC AGGGGAAACT
TATAGTAAAA GTCATCAAAG TGCGGTTGAT TCTGCACAAG AATATTTAAT TCAACGCCAT
TTACGTCCAA CCCCCAGAGT TTTGCTTGGT TTGGAACTAG CTAGTGCTGA GTTGGCAAAT
GCTGCGATTG ATATTTCTGA TGGATTTATT GCAGATTTAG GACATATTTT ACAGCGTAGT
CAATGCGGTG CAGTGATCGA TTTAGATAAA TTACCTTTAT CTGAGCAATT AATCAAAACT
GTTGGTATAG AGCGAGCGGA GCAATTTGCT TTAACTGGCG GAGAAGATTA TGAGTTGTGC
TTTACAGTTC CTGATCGCAA TCTTGAAAAG TTAGAGCGAG CTTTAACCCA TATTGGTGTG
AATTATAGCT GTGTTGGACA AATTCGTAAT AAAAAACGTA TCAGTTTTCA ACGCAGCGGT
AAAACCGTTG AGATTGGTTC ATTACTCGGC TTTGATCATT TTAAATAA
 
Protein sequence
MSNGEFDVIQ RYFIGSKRPQ RKDVIVSIGD DCAITEHYQN QRIAITTDTM VENVHFLSSI 
NPSDLAYKAV ATNLSDLAAM GATPAWFSLA ITLPYVDEKW LNEFSQSLFD VLDHYNVSLI
GGDTTKGPVY TITITAQGIV PKGKALCRHS AQDGDWIYVS GTLGDSAAGL ELLLKNTGET
YSKSHQSAVD SAQEYLIQRH LRPTPRVLLG LELASAELAN AAIDISDGFI ADLGHILQRS
QCGAVIDLDK LPLSEQLIKT VGIERAEQFA LTGGEDYELC FTVPDRNLEK LERALTHIGV
NYSCVGQIRN KKRISFQRSG KTVEIGSLLG FDHFK