Gene HS_0303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0303 
SymboliolC 
ID4239505 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp302655 
End bp304565 
Gene Length1911 bp 
Protein Length636 aa 
Translation table11 
GC content41% 
IMG OID638103843 
Productmyo-inositol catabolism protein 
Protein accessionYP_718511 
Protein GI113460449 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0524] Sugar kinases, ribokinase family
[COG3892] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAG AAAAAACATT AGATCTTATT TGTCTTGGAC GAGTTGCCGT AGATTTATAT 
GCTCAACAAA TCGGTTCACG CTTAGAGGAT GTGAGCTCTT TTGCTAAATA TCTCGGCGGC
TCATCAGGTA ATGTTGCTTA TGGAACGGCT GTACAAGGTG TAAAATCATC TATGTTGGCT
CGTGTAGGTG ATGAACATAT GGGACGTTTC TTACGTGAGG AATTGCAGCG TCTAGGTGTA
GATACGAGCC ATTTGATTAC GGATAAAGAG CGGCTGACTG CTCTAGTGAT TTTAGGAATA
AAAGATCAAG ATACTTTTCC ATTAATTTTC TATCGTGACA ATTGTGCAGA CATGGCAATC
ACCGCTGATG ATTTTGATGA AGAATATATT GCTTCGGCAA AAGCACTTGC TATTACCGGA
ACGCATTTAT CCCATCCAAA AACTCGCCAT GCGGTTTTAA CCGCACTTGA ATACGCAGGT
CGTAATGGAA CAAAACGTTT ACTGGATATT GACTATCGTC CTGTACTTTG GGGATTAACT
TCATTGGGTG ATGGTGAAAC TCGTTTTATT GACTCTGAAG CCGTTACACA ATCTTTACAA
GAAGTTTTAC ATCACTTTGA TGTCTTAGTT GGTACAGAAG AGGAGTTTCA TATTGCTGGT
GGGTCAACTG ATACGCTAAC TGCATTAAAA AATGTACGTA AAGTCAGTAA CGCTACGTTG
GTTTGCAAAC GTGGGGCGTT AGGTTGCTCT GTCTTTGAGG GCGATATTCC GAATCATATT
GACGGTGGTA TTAATGTATA TGGCGTACGT GTTGAAGTTT TGAATGTCTT AGGTGCAGGC
GATGCCTTTA TGTCAGGTTT ATTACGTGGC TATATTAACG GTGAAAGTTG GGAGCAATGT
TGTCGTTATG CAAATGCTTG CGGTGCGTTG GTCGTCTCTC GCCATGGTTG TGCCCCAGCA
ATGCCGACTA AAGAAGAATT AGATAATTAT CTCTCACGTG CGGAATCGGT ACCTCGACCG
GACTTAGACG AACAATTAAA CCATTTACAC CGTGTTACAA CACGCAAGCA ACAATGGCAT
GATTTATGTA TTTTTGCCTT TGATCATCGT AAACAATTGG TTGATATGGC TGAAAAGTGC
GGTGCAGATT TGAAACGTAT TCCAACACTC AAAACGTTAT TACTTAAAGC TGCAGAACAA
ACCGCTCAAA AAGAGGGGAT TTATAATGGC AATGCCGGCG TTTTAGCTGA TACCACATTT
GGTCAAGAAG CACTCAATGA AATTACTGGT AAAAAATGGT GGATTGGTCG TCCAATAGAA
ATGCCATCTT CTCGTCCATT GCGTTTAGAG CATGGTGATT TAGGTAGTCA ACTCATTAGT
TGGCCCGCAG AACATGTCGT TAAATGTTTA GCATTCTATC ATCCAGCAGA CAAGGCAGAG
TTAAAAGCAG ATCAAGATGC AACATTAAAA GAAGTTTATC GTGCTTGTTG TCGTACAGGT
CACGAATTAT TACTGGAGAT CATTTTGCCT GCGGATATGG AGCAAAAAGA AAGTTACTAC
ACCGATATGA TTGCCCACTT CTATTCTCTC GGCATTAAAC CGGATTGGTG GAAATTACCG
GGAGTATCCG CAAAAACTTG GGCAGACATC AGTCAAGTCA TTGAAAAAAA TGACAAACAT
TGTCGTGGTA TTTTAATCCT TGGCTTAGAC GCACCTGAAG CAGTCTTTGA GAATGTGTTT
AAAGCATCGG CAAACGCACC TTTAGTAAAA GGGTTTGCTG TAGGCAGAAC AATTTTTGGA
CAACCTTCGG CTGATTGGTT AGCTGGTAAA ATTGATGATG AGCAACTCAT CAAAGAAACC
TCAGAGCGTT ATAGAAGATT AATTAAATTA TGGAAAAATC GTAAAAACTA A
 
Protein sequence
MKKEKTLDLI CLGRVAVDLY AQQIGSRLED VSSFAKYLGG SSGNVAYGTA VQGVKSSMLA 
RVGDEHMGRF LREELQRLGV DTSHLITDKE RLTALVILGI KDQDTFPLIF YRDNCADMAI
TADDFDEEYI ASAKALAITG THLSHPKTRH AVLTALEYAG RNGTKRLLDI DYRPVLWGLT
SLGDGETRFI DSEAVTQSLQ EVLHHFDVLV GTEEEFHIAG GSTDTLTALK NVRKVSNATL
VCKRGALGCS VFEGDIPNHI DGGINVYGVR VEVLNVLGAG DAFMSGLLRG YINGESWEQC
CRYANACGAL VVSRHGCAPA MPTKEELDNY LSRAESVPRP DLDEQLNHLH RVTTRKQQWH
DLCIFAFDHR KQLVDMAEKC GADLKRIPTL KTLLLKAAEQ TAQKEGIYNG NAGVLADTTF
GQEALNEITG KKWWIGRPIE MPSSRPLRLE HGDLGSQLIS WPAEHVVKCL AFYHPADKAE
LKADQDATLK EVYRACCRTG HELLLEIILP ADMEQKESYY TDMIAHFYSL GIKPDWWKLP
GVSAKTWADI SQVIEKNDKH CRGILILGLD APEAVFENVF KASANAPLVK GFAVGRTIFG
QPSADWLAGK IDDEQLIKET SERYRRLIKL WKNRKN