Gene HS_0054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0054 
Symbol 
ID4239562 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp59882 
End bp60901 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content40% 
IMG OID638103585 
ProductABC transporter, solute-binding, sugar transport 
Protein accessionYP_718260 
Protein GI113460203 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.279397 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATACT CACTTTTAAA AACAACCGCT CTTGCCGTAG CGTTAGGTTT TACCGGTTTT 
AACATAGCAC AGGCTATGGA TAAGGTGGCA TTTATCCCAA AATTAGTTGG CGTTGGTTTT
TTTACCAGTG GTGGGCAAGG TGCGGTCGAA ATGGGAAAAA AATTAGGTTT GGATGTAACC
TATGACGGTC CGGCAGAACC GAGCGTATCG AACCAAGTTC AGATGATCAA TAACTTTGTG
AACCAGGGTT ATAATGCCAT TATCGTCTCA GCAGTATCGC CTGATGGATT GTGCTCTACC
TTAAAAAGAG CGATGAAAAA AGGTGTAAAA GTATTAACTT GGGACTCCGA TACTCAGCCT
GAGTGCCGAA GCTACTATAT TAATCAAGGA ACACCTACTC AACTTGGCTC AATGCTAGTT
GAAATGGTAT CAAGTCAAAT TTCTAAACCA AAAGCAAAAG TTGCATTTTT CTATTCCAGT
CCAACAGTGA CTGACCAAAA CCAGTGGGTT AAAGAGGCAA AAGCAAAAAT TGAAAAAGAA
CATCCCAAAT GGGAAATTGT GACGACACAA TTTGGCTATA ACGATGCAAT TAAATCACTG
CAAACTGCCG AGGGGATCTT AAAAGCCTAT CCTGATTTAG ATGCGATTAT TGCTCCAGAT
GCCAATGCTT TGCCGGCTGC CGCTCAAGCA GTTGAGAACC TTAAACGACA AGGTACAATC
GTTGTCGGAT TCAGTACGCC GAATGTAATG CGTCCTTACG TAAAACGAGG CACAGTAAAT
CAGTTTGGTT TATGGGATGT TGTGAAGCAA GGTCAACTCT CTGTTGCAGT AGCTAATGAA
TTGTTAAAAG GTAATTCTCT TAAGGTTGGC GATAAATTGA ATGTTGATGG TATTGGTGAA
GTAGAAGTAT CAGCAAATAA AGTACAAGGC TATGAGTTTG AAGCAAAGGG AAACGGTATT
GTGTTACTAC CTGAGCGTGT TGTATTCACT AAAGATAATA TTGATAACTA TGATTTCTAA
 
Protein sequence
MKYSLLKTTA LAVALGFTGF NIAQAMDKVA FIPKLVGVGF FTSGGQGAVE MGKKLGLDVT 
YDGPAEPSVS NQVQMINNFV NQGYNAIIVS AVSPDGLCST LKRAMKKGVK VLTWDSDTQP
ECRSYYINQG TPTQLGSMLV EMVSSQISKP KAKVAFFYSS PTVTDQNQWV KEAKAKIEKE
HPKWEIVTTQ FGYNDAIKSL QTAEGILKAY PDLDAIIAPD ANALPAAAQA VENLKRQGTI
VVGFSTPNVM RPYVKRGTVN QFGLWDVVKQ GQLSVAVANE LLKGNSLKVG DKLNVDGIGE
VEVSANKVQG YEFEAKGNGI VLLPERVVFT KDNIDNYDF