Gene HS_1578 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1578 
SymboliolD 
ID4241105 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1790102 
End bp1792039 
Gene Length1938 bp 
Protein Length645 aa 
Translation table11 
GC content40% 
IMG OID638105164 
Productmyo-inositol catabolism protein 
Protein accessionYP_719783 
Protein GI113461714 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3962] Acetolactate synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.505237 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGACAG TGAGATTAAC GGTAGCTCAA GCATTAGTAA AATTTCTTGA TAACCAATAT 
ATTGAGTTCG ATGGAAAGGT AACAAAATTC GTTGAAGGCA TTTTCGGGAT TTTTGGACAT
GGTAACGTGC TGGGTTTAGG TCAAGCACTG GAACAGGATA GTGGCGAATT GATTGTGCGT
CAAGGACGTA ACGAGCAGGG CATGGCTCAT GTAGCAATAG GGTTTGCGAA GCAAAAATTA
CGCAAACAAA TCTATGCCTG TACTTCATCT GTTGGTCCGG GGGCTGCAAA TATGATTACA
GCAGCTGCAA CAGCAACGGC AAACCGTATT CCACTTCTTT TATTGCCGGG TGATGTTTTT
GCTACTCGTC AACCAGACCC TGTGTTGCAA CAAATTGAAC AAACTTATGA TTTAAGTATT
AGCACCAATG ATGCTTTTCG TGCGGTAAGT AAATACTGGG ATCGTGTAAG TCGTCCTGAA
CAATTAATGA CAGCTTGTAT CAATGCAATG CGTGTATTGA CTGATCCTGC AGAAACCGGT
GCGGTGACAA TTGCATTGCC ACAAGATGTC CAGACTGAAG CTTATGATTT TCCTGAGTAT
TTTTTACAAA AACGTATTCA TCGCATGGAG CGTACGCCTC CAACGGAAGC GATGCTACAA
GATGCTTTGA CCTTAATTCA AAATGCGAAA AAACCACTAA TTATTTGTGG TGGTGGAGTG
CGGTACTCTG AGGCGGCAGA GCAATTAAAA GCCTTCGCTG AAATTTATAA TATTCCATTT
GCAGAGACTC AAGCGGGTAA AAGTGCGGTA GTTTCTGAGC ATTATTTGAA TGTTGGCGGT
ATAGGTGTAA CAGGTTGTGT AGCTGCAAAT TTATTAGCAC CGGAGGCTGA TTTAGTCATT
GGTATTGGTA CAAGATATAC TGATTTTACG ACGGGATCTA AATGGATTTT CAATAATGAC
GAAGTGAAAT TCTTAAATAT CAACGTTGCT CGCTTTGATG CTTATAAGTT AGATGGTGTA
CAAGTTACAG CTGATGCCAA AGAAACGCTG GAAAAATTAA CCGCACTTTT AGCTACAACA
GGTTATCAAG CCCACTGGGG TGATAAAGTG GCACAGGCAA AAAAAGCGTT GGAAACTGAA
TTAGAGCGTG TTCACAATAT AACTTATACC GAAGATTTTG TACCTGAAAT TGATGATCGT
TTAAATCGTG AGGCGGTTTA TGCTGAATTT ATGGAGATGA CCAAATCTTG TTTAGCTCAA
ACAAGGGTGT TAGGTATTTT AAATGAAACA TTAGGTGAAA ATGATGTCAT CGTGGGGGCT
GCGGGTAGTT TGCCAGGCGA TTTACAACGT ATTTGGCAAG CTAAAGGCGA AAATACCTAT
CATTTGGAAT ACGGTTATTC TTGTATGGGG TATGAAGTTA GTGCTGCTCT AGGGGTCAAA
ATTGCAGAAC CTCATCGTGA AGTTTATACC TTATTAGGTG ATGGATCTTA CATGATGTTG
CATTCTGAAC TGGTAACGTC AATTCAAGAA AATAGAAAAA TTAACGTAAT ATTATTCGAT
AATATGACAA ACGGTTGTAT CAATAACTTA CAAATTGGTC ATGGTATGGA CAGTTTTGCA
ACGGAATTTC GTTTTAGAAA CAAGAATACC AATAAATTAG ATGGGGGCTT TGTACCGGTT
AATTTTGCTA TGAATGCCGC ATCTTACGGT TGCAAAACCT ATCAAGTAAC TACCGAAGAA
GAGCTACGAT TGGCTTTGGC TGATGCAAAA AAACAACAGG TTTCTACTTT AATTGATATT
AAGGTTTTAC CAAAAACGAT GGCGAAGGGA TATGACAGCT GGTGGCACGT TGGTGTGGCT
GAGGTGTCAA ATAAAGCTGA AATTAATGCT GCCTATGAAG ATTCTATTGC TCATATTGAG
GTAGCAAGAC GTTACTAA
 
Protein sequence
MKTVRLTVAQ ALVKFLDNQY IEFDGKVTKF VEGIFGIFGH GNVLGLGQAL EQDSGELIVR 
QGRNEQGMAH VAIGFAKQKL RKQIYACTSS VGPGAANMIT AAATATANRI PLLLLPGDVF
ATRQPDPVLQ QIEQTYDLSI STNDAFRAVS KYWDRVSRPE QLMTACINAM RVLTDPAETG
AVTIALPQDV QTEAYDFPEY FLQKRIHRME RTPPTEAMLQ DALTLIQNAK KPLIICGGGV
RYSEAAEQLK AFAEIYNIPF AETQAGKSAV VSEHYLNVGG IGVTGCVAAN LLAPEADLVI
GIGTRYTDFT TGSKWIFNND EVKFLNINVA RFDAYKLDGV QVTADAKETL EKLTALLATT
GYQAHWGDKV AQAKKALETE LERVHNITYT EDFVPEIDDR LNREAVYAEF MEMTKSCLAQ
TRVLGILNET LGENDVIVGA AGSLPGDLQR IWQAKGENTY HLEYGYSCMG YEVSAALGVK
IAEPHREVYT LLGDGSYMML HSELVTSIQE NRKINVILFD NMTNGCINNL QIGHGMDSFA
TEFRFRNKNT NKLDGGFVPV NFAMNAASYG CKTYQVTTEE ELRLALADAK KQQVSTLIDI
KVLPKTMAKG YDSWWHVGVA EVSNKAEINA AYEDSIAHIE VARRY