Gene HS_0021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0021 
Symbol 
ID4239529 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp22388 
End bp23446 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content36% 
IMG OID638103552 
ProductZn-binding dehydrogenase 
Protein accessionYP_718227 
Protein GI113460170 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0554419 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGATTA ACTTTGTTAA AGAGGGAAAC ATTATGAGCA AATACGTAAG ATCAGTTTGT 
CTCGTGGAAC CTAAGAAGGT TGATATAAAG ACCGTGTTGT ATCCTAAAAA AGGTGAGTGT
GATGTTCTGA TCAAAGTAGA GAGTATAGGA ATCTGCGGAT CTGATATTGG TGCATTTAGA
GGTACTAATC CGCTGGTAAC TTACCCTAGA ATTTTAGGAC ATGAAATTGT TGGTACAGTC
ATTGAATCTG GTGTTGGTAT GCCAAAAAAT ATTAATATAG GTGATCGTGT AATTCTCGAA
CCTTACATTT ATTGTGGACA TTGCTATCCT TGTTCAATCA GTAGAACAAA TTGCTGTGAG
GCACTAAAAG TTTTGGGGGT ACATATTGAT GGAGCAATGC AAGAAATTGT TAGACATCCA
GCTCATATGC TTATTAAAGC ACCTGATATA CCGATACATG AACTGGCTTT AGCAGAACCT
TTAACTATTT CATTACATGC AATTCGTAGA ACCAAAGTAA AAGCCGGTGA ACACGTTGCT
ATCATCGGTG CTGGTGCGAT TGGACTGATG GCCGCATTAG TTGCCAAAGC TTATGGTGCG
ACACCAATTT TAATTGACAT TTTAGACAAG CGATTAGATT ACGCAAAATC TATTGGCATT
CCAAATATAA TTAACCCAGC AAAAGAAAAT GATCTTGAAG CTATTAAATC CATTACTAAC
GGAAGAATGG CTGAGGTTGT TATTGAAGCG TCAGGAGCAA ATATTGCTGT ACAAAATACG
CTTAAATATA CTTCTTTTGC AGGACGTATT GCTTTAACTG GATGGCCGAA AAACGAAACG
CCACTACCAA CTAATTTAAT TACCTTCAAA GAACTTAACA TTTATGGAGC AAGAACAAGC
AAGGGGGAAT TTGAAGAAGC ATTAAAACTT TTAGAATCGA GAAAGATTGA ACCGAAGAAT
ATTATTAGTA AGGTAATTAC ATTTGATGAA ATTCCTCACT ACATCGAAGA GCTTTCAGAA
AATCCTGATG ATTATTTAAA AATCATTGCT GTATTTTAA
 
Protein sequence
MSINFVKEGN IMSKYVRSVC LVEPKKVDIK TVLYPKKGEC DVLIKVESIG ICGSDIGAFR 
GTNPLVTYPR ILGHEIVGTV IESGVGMPKN INIGDRVILE PYIYCGHCYP CSISRTNCCE
ALKVLGVHID GAMQEIVRHP AHMLIKAPDI PIHELALAEP LTISLHAIRR TKVKAGEHVA
IIGAGAIGLM AALVAKAYGA TPILIDILDK RLDYAKSIGI PNIINPAKEN DLEAIKSITN
GRMAEVVIEA SGANIAVQNT LKYTSFAGRI ALTGWPKNET PLPTNLITFK ELNIYGARTS
KGEFEEALKL LESRKIEPKN IISKVITFDE IPHYIEELSE NPDDYLKIIA VF