Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_0303 |
Symbol | iolC |
ID | 4239505 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | - |
Start bp | 302655 |
End bp | 304565 |
Gene Length | 1911 bp |
Protein Length | 636 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 638103843 |
Product | myo-inositol catabolism protein |
Protein accession | YP_718511 |
Protein GI | 113460449 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0524] Sugar kinases, ribokinase family [COG3892] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAG AAAAAACATT AGATCTTATT TGTCTTGGAC GAGTTGCCGT AGATTTATAT GCTCAACAAA TCGGTTCACG CTTAGAGGAT GTGAGCTCTT TTGCTAAATA TCTCGGCGGC TCATCAGGTA ATGTTGCTTA TGGAACGGCT GTACAAGGTG TAAAATCATC TATGTTGGCT CGTGTAGGTG ATGAACATAT GGGACGTTTC TTACGTGAGG AATTGCAGCG TCTAGGTGTA GATACGAGCC ATTTGATTAC GGATAAAGAG CGGCTGACTG CTCTAGTGAT TTTAGGAATA AAAGATCAAG ATACTTTTCC ATTAATTTTC TATCGTGACA ATTGTGCAGA CATGGCAATC ACCGCTGATG ATTTTGATGA AGAATATATT GCTTCGGCAA AAGCACTTGC TATTACCGGA ACGCATTTAT CCCATCCAAA AACTCGCCAT GCGGTTTTAA CCGCACTTGA ATACGCAGGT CGTAATGGAA CAAAACGTTT ACTGGATATT GACTATCGTC CTGTACTTTG GGGATTAACT TCATTGGGTG ATGGTGAAAC TCGTTTTATT GACTCTGAAG CCGTTACACA ATCTTTACAA GAAGTTTTAC ATCACTTTGA TGTCTTAGTT GGTACAGAAG AGGAGTTTCA TATTGCTGGT GGGTCAACTG ATACGCTAAC TGCATTAAAA AATGTACGTA AAGTCAGTAA CGCTACGTTG GTTTGCAAAC GTGGGGCGTT AGGTTGCTCT GTCTTTGAGG GCGATATTCC GAATCATATT GACGGTGGTA TTAATGTATA TGGCGTACGT GTTGAAGTTT TGAATGTCTT AGGTGCAGGC GATGCCTTTA TGTCAGGTTT ATTACGTGGC TATATTAACG GTGAAAGTTG GGAGCAATGT TGTCGTTATG CAAATGCTTG CGGTGCGTTG GTCGTCTCTC GCCATGGTTG TGCCCCAGCA ATGCCGACTA AAGAAGAATT AGATAATTAT CTCTCACGTG CGGAATCGGT ACCTCGACCG GACTTAGACG AACAATTAAA CCATTTACAC CGTGTTACAA CACGCAAGCA ACAATGGCAT GATTTATGTA TTTTTGCCTT TGATCATCGT AAACAATTGG TTGATATGGC TGAAAAGTGC GGTGCAGATT TGAAACGTAT TCCAACACTC AAAACGTTAT TACTTAAAGC TGCAGAACAA ACCGCTCAAA AAGAGGGGAT TTATAATGGC AATGCCGGCG TTTTAGCTGA TACCACATTT GGTCAAGAAG CACTCAATGA AATTACTGGT AAAAAATGGT GGATTGGTCG TCCAATAGAA ATGCCATCTT CTCGTCCATT GCGTTTAGAG CATGGTGATT TAGGTAGTCA ACTCATTAGT TGGCCCGCAG AACATGTCGT TAAATGTTTA GCATTCTATC ATCCAGCAGA CAAGGCAGAG TTAAAAGCAG ATCAAGATGC AACATTAAAA GAAGTTTATC GTGCTTGTTG TCGTACAGGT CACGAATTAT TACTGGAGAT CATTTTGCCT GCGGATATGG AGCAAAAAGA AAGTTACTAC ACCGATATGA TTGCCCACTT CTATTCTCTC GGCATTAAAC CGGATTGGTG GAAATTACCG GGAGTATCCG CAAAAACTTG GGCAGACATC AGTCAAGTCA TTGAAAAAAA TGACAAACAT TGTCGTGGTA TTTTAATCCT TGGCTTAGAC GCACCTGAAG CAGTCTTTGA GAATGTGTTT AAAGCATCGG CAAACGCACC TTTAGTAAAA GGGTTTGCTG TAGGCAGAAC AATTTTTGGA CAACCTTCGG CTGATTGGTT AGCTGGTAAA ATTGATGATG AGCAACTCAT CAAAGAAACC TCAGAGCGTT ATAGAAGATT AATTAAATTA TGGAAAAATC GTAAAAACTA A
|
Protein sequence | MKKEKTLDLI CLGRVAVDLY AQQIGSRLED VSSFAKYLGG SSGNVAYGTA VQGVKSSMLA RVGDEHMGRF LREELQRLGV DTSHLITDKE RLTALVILGI KDQDTFPLIF YRDNCADMAI TADDFDEEYI ASAKALAITG THLSHPKTRH AVLTALEYAG RNGTKRLLDI DYRPVLWGLT SLGDGETRFI DSEAVTQSLQ EVLHHFDVLV GTEEEFHIAG GSTDTLTALK NVRKVSNATL VCKRGALGCS VFEGDIPNHI DGGINVYGVR VEVLNVLGAG DAFMSGLLRG YINGESWEQC CRYANACGAL VVSRHGCAPA MPTKEELDNY LSRAESVPRP DLDEQLNHLH RVTTRKQQWH DLCIFAFDHR KQLVDMAEKC GADLKRIPTL KTLLLKAAEQ TAQKEGIYNG NAGVLADTTF GQEALNEITG KKWWIGRPIE MPSSRPLRLE HGDLGSQLIS WPAEHVVKCL AFYHPADKAE LKADQDATLK EVYRACCRTG HELLLEIILP ADMEQKESYY TDMIAHFYSL GIKPDWWKLP GVSAKTWADI SQVIEKNDKH CRGILILGLD APEAVFENVF KASANAPLVK GFAVGRTIFG QPSADWLAGK IDDEQLIKET SERYRRLIKL WKNRKN
|
| |