Gene HS_0049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0049 
Symbol 
ID4239557 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp53491 
End bp55044 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content40% 
IMG OID638103580 
Productautoinducer-2 (AI-2) kinase 
Protein accessionYP_718255 
Protein GI113460198 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1070] Sugar (pentulose and hexulose) kinases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATTAG ATGCCGGCAC AGGTAGTATT CGTGCAGTTA TTTTTGATCT TGAAGGAAAT 
CAAATCGCTA CGTCACAAAA AGAATGGACA CATATTTCCG ACCCAAATAT CCCAGGCTCA
ATGGGCTTTG ATTTACAAAA TAACTGGCAC CTTGCTTGCT TATGTATTCA AGAAGTTTTA
GCTACCAGCC AAATAGATGC TAAACAAATT ATAGCTATAT CGACTTGCTC TATGCGTGAA
GGCATTGTTT TATACGATGC CAATAAAAAC CCGATTTGGG CGTGCGGTAA TGTGGATGCC
AGATCTGTAG AAGAAGTTAT TCAATTAAAG TCTCTAAACC AGTATCAGTT TGAACAACAA
GTTTATCAAT CCTCCGGTCA AACATTGGCA TTAAGTGCAT TACCCCGTTT ACTTTGGCTT
GCACATCATC AACCTAATCT TTATGCTCAA GTCCATTTTC TCTCTATGAT TAGCGATTGG
TTAGGATTTA TGCTTAGCGG AGAACTGGCC GTCGAACCTT CAAATGCTGG CACAACCGGC
ATTCTCAACC TAAAAACCCG AAAATGGGAG CACACTTTAC TAGAGATGGC TGGACTCAAT
CCTGCTATTT TACCGAAAGT AAAAGAGACA GGTGAAATAC TCGGTCAAGT AACCGCCCAT
TCTGCACAGC AAACTGGGTT AATAGTCGGC ACGCCTGTTG TTGTTGGTGG TGGTGATGTG
CAATTAGGTT GTATCGGACT AGGGATTACA GAACCGGGGC AAGCTGCTAT TATTGGAGGA
ACTTTCTGGC AACAGGTCGT AAACTTACCA CAAGCAATGA CAGATCCTAA AATGAATATA
CGCATCAATC CGCATGTCAT TGCACCGATG GTACAAGCAG AATCTATCAG CTTCTTTACA
GGACTTACTA TGCGTTGGTT TAGAGATGCT TTCTGTGAAG AAGAAAAAGC CGTCGCTCAT
CGCTTAGGTG TTGATGCTTA CACATTACTG GAACAAATGG CAGAAAAGAT ACCCGTAGGT
TCAAATGATG TTATTCCTGT ATTCTCTGAT GCTATGCATT TCAAATCTTG GTATCACGCA
GCCCCATCAT TTATTAACCT TTCGATTGAT CCTGAAAAAT GTAACAAATC AGTCCTGTTT
AGGGCATTAC AAGAAAATGC AGCAATTGTA TCTTCATGTA ATCTTGATCA AGTCCAGCAA
TTCAGCCACG TTAATCTTAC CAGTATTGTT TTTGCCGGAG GTGGTGCAAA AGGGAAATTA
TGGAGCCAAA TTCTAGCTGA TGTAACAGGA TTGGTTGTTA ATGTACCTGT AGTAAAAGAA
GCAACTGCTC TAGGATGTGC CATTGCAGCT GGAGTAGGTG CTGGTATTTA TACTTCATTA
CATGAAGCAG GTAAAACATT AGTAAAATTT GAAAGACAAC ATCAACCAAA TGCAAGAAAT
CATAATTTAT ATCAAATACA TAAAGAAAAA TGGCAAGAAA TATACCAGCA GCAATTGAAA
TTGGTTGACA GAGGACTAAC CATTTCGCTT TGGAAAGCTC CTGGGATTAA ATAA
 
Protein sequence
MALDAGTGSI RAVIFDLEGN QIATSQKEWT HISDPNIPGS MGFDLQNNWH LACLCIQEVL 
ATSQIDAKQI IAISTCSMRE GIVLYDANKN PIWACGNVDA RSVEEVIQLK SLNQYQFEQQ
VYQSSGQTLA LSALPRLLWL AHHQPNLYAQ VHFLSMISDW LGFMLSGELA VEPSNAGTTG
ILNLKTRKWE HTLLEMAGLN PAILPKVKET GEILGQVTAH SAQQTGLIVG TPVVVGGGDV
QLGCIGLGIT EPGQAAIIGG TFWQQVVNLP QAMTDPKMNI RINPHVIAPM VQAESISFFT
GLTMRWFRDA FCEEEKAVAH RLGVDAYTLL EQMAEKIPVG SNDVIPVFSD AMHFKSWYHA
APSFINLSID PEKCNKSVLF RALQENAAIV SSCNLDQVQQ FSHVNLTSIV FAGGGAKGKL
WSQILADVTG LVVNVPVVKE ATALGCAIAA GVGAGIYTSL HEAGKTLVKF ERQHQPNARN
HNLYQIHKEK WQEIYQQQLK LVDRGLTISL WKAPGIK