Gene HS_0034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0034 
Symbol 
ID4239542 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp35662 
End bp37254 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content36% 
IMG OID638103565 
Productsugar kinase 
Protein accessionYP_718240 
Protein GI113460183 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1070] Sugar (pentulose and hexulose) kinases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAATG AAAAACAGCT TATTGAACAA GGTAATATCG TTATTGGTAT TGAATTAGGT 
TCAACCCGAA TTAAGGCAGT ATTAATTACC TCAGATGGTA CTATTTTAGC AACGGGTGGA
GCAGATTGGG AAAATAGGTT AATTGAAGGA GTTTGGACAT ACCATCAGCA TGAGATATGG
GAAAAACTAC AAGCGGCGTA TGTAGATTTG AGTAAGACTG TAAAAGCAAA ATATCACACC
ACAATTCGTA CTGCAAAAGC ACTGGGAATT AGTGCAATGA TGCACGGTTA TTTGCCTTTT
GATAAACAGG GAAATCAGCT GGTGCCTTTT AGAACATGGC GAAACAATAT AACTTTAAAA
TCATCGGAAA AATTAACCGC ACTTTTTCAA TATCCAATTC CTCAACGTTG GAGTATTGCC
CATTTATACC AAGCATTACT CAATAAAGAA CCGCATTTAT CTGAAATTGA TTATATTACA
ACTCTCTCAG GTTATGTGCA TTGGCAACTA ACCGAGAACA AAGTTTTAGG AATTGGAGAC
GCTTCTGGTA TGTTCCCGAT AGATACTGAG ATGCAATCAT ACAATAAAAC TATGCTGACT
CAATTTGAGC AGGCTATTAT TGAATATCAG ATGCCTTGGA GATTAGATGC CATTTTGCCA
AAAGTATTAG TAGCTGGAGA ACATGCGGGT AAATTAACTT TTAAAGGGGC TAAATTACTT
GATCCAACAG GTAACTTACA GGCTGGTATT CCACTTTGTC CGCCAGAGGG TGATGCAGGC
ACAGGAATGA TTGCAACCAA TTGTATCAAA GAAAAAACAG GTAATATATC TGCTGGTACA
TCAGCGTTTG CTATGATTGT TTTGGAAAAA GCCTTATCGA AAGTCTATTC AGAACTGGAT
ATTGTAACTA CTCCGGCAGG TAAACTAGTC GCAATGGCAC ATGCAAATAA CTGTACTTCC
GATATTAATG CTTGGATAAA TTTATTCGGT GAATGTTTAG CTACTTTTGG TGTTTATGTG
TGTATGGAAG AGTTATATGA AATGCTTTTT CTGCATGCTT TGCAAGGTGA GCCGGATTGT
GGTGGATTAC TATCTTACGG TTTTTATTCT GGCGAACATA ATGTTGGTTT ATCGGAAGGT
TGTCCCGTTT TTTTGCATCC AACCAAAGCT CGTTTTAATT TGGCTAACTT TATGCGTACG
CATTTTTATA CTGCTTTTGG TGCAATGAAA TTAGGAATGG ATATTTTGAT TAATCAAGAG
AAAGTGGAAA TTTCTCGTAT TCTAGGGCAC GGAGGTATTT TTAAAACTGA AGGGGTCGCT
CAAAAAATAT TGTCTTCTGC ACTTAATATT CCTTTAGCAA CTGCAAGCAC AGCTTCAAAT
GGAGGAGCTT GGGGTATTGC ACTGCTGGCG AATTTTCTTA CCATTTCTAA AAGATATAAT
TTGGAAGAAT ATTTAGATAA TTGTATTTTC AACACAACTA AACTTAATCT TATTCAACCG
GATAAAATAA TGAGTGAAGG CTATGAACGA TTTATGCAAC GTTATAAACA AGGTATTTCA
ATTGTGGAAA GTGCACTTTT TTTGAATCAG TAA
 
Protein sequence
MQNEKQLIEQ GNIVIGIELG STRIKAVLIT SDGTILATGG ADWENRLIEG VWTYHQHEIW 
EKLQAAYVDL SKTVKAKYHT TIRTAKALGI SAMMHGYLPF DKQGNQLVPF RTWRNNITLK
SSEKLTALFQ YPIPQRWSIA HLYQALLNKE PHLSEIDYIT TLSGYVHWQL TENKVLGIGD
ASGMFPIDTE MQSYNKTMLT QFEQAIIEYQ MPWRLDAILP KVLVAGEHAG KLTFKGAKLL
DPTGNLQAGI PLCPPEGDAG TGMIATNCIK EKTGNISAGT SAFAMIVLEK ALSKVYSELD
IVTTPAGKLV AMAHANNCTS DINAWINLFG ECLATFGVYV CMEELYEMLF LHALQGEPDC
GGLLSYGFYS GEHNVGLSEG CPVFLHPTKA RFNLANFMRT HFYTAFGAMK LGMDILINQE
KVEISRILGH GGIFKTEGVA QKILSSALNI PLATASTASN GGAWGIALLA NFLTISKRYN
LEEYLDNCIF NTTKLNLIQP DKIMSEGYER FMQRYKQGIS IVESALFLNQ