Gene HS_1600 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1600 
Symbol 
ID4241127 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1815405 
End bp1817165 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content38% 
IMG OID638105186 
Productbeta-hexosamidase A 
Protein accessionYP_719805 
Protein GI113461736 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.592664 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGTTG ATTTAACGAA AAAACCTTAT TACTTAAAAG ATGAAGATAT TCAATGGGTA 
AAAAACACTA TAGCCAATAT GACGTTGGAA GAAAAAATCG GTCAGCTTTT TGTGAATATG
GGTTCAAGCA GATCCGAGGA ATATCTCACT GATGTTTTAA ACCGTTACCA TATTGGTGCG
GTACGTTATA ATCCCGGTCC TGCTGAGGAG GTTTATGACC AAAACTATAT TCTACAAACT
AAAAGTAAAA TCCCTCTCTT AATTGCAGCG AATACAGAAG CTGGTGGTAA TGGTGCTTGC
AGTGATGGAA CAGAAATTGG TTTACAAGTC AAAATCGGTG CAACCGATGA TGCTAAATAT
GCTTATGAAA TGGGGCGTGT AGCTGGTATT GAAGCAGCAG CTATCGGTTG TAACTGGAGT
TTTGCTCCTA TCGTTGATAT TAGTTATAAC TGGCGAAACC CAATTATTTC TAATCGAGTA
TTTGGTTCAA ATCCAGAGAA AGTACTAGAA ATGTCCTTGG CTTATATGAA AGGCATTCAA
GAAAGTGGTA TTGCCCCGGC TGCAAAACAT TTCCCTGGTG ATGGAGTGGA TGAAAGAGAT
CAGCATCTTT CTTTTAGTGT AAACAGTTTT AGTTGCGAAG AGTGGGACAA TACTTTCGGC
AAAGTTTATC AAGGCTTAAT TGATGCTGGT TTACCATCGT TGATGGCAGG TCATATACAC
TTACCTGCTT ACGAGCAGCA TTTCAATCCA AATTTAGCTT ATGAGGACTG CCTACCAGCG
ACACTTTCAA AACCAATTTT AACAGACTTA TTACGAGGTA AATTAGGTTT CAACGGTGTT
GTTGTTACTG ATGCCAGCCA TATGGTAGCA ATGACTTCAG CAATGAAGCG TAGCGAAATG
CTACCGACTG CTATCGCTGC AGGTTGTGAT TTATTCCTTT TCTTTAACGA TCCTGATGAA
GATTTCGGTT ATATGATGGA AGGTTATAAA AACGGCATTA TTACCGAAGA ACGCCTTCAT
GATGCCCTTA CTCGTATTCT AGGGTTAAAA GCAAAATTAG GCTTACACAA TCGCCCAAGA
GAAACATTGC TTGAACCAAA AGAACAAGCA TTAGCAAAAA TAGGACTTCC AGAACATAAA
GCAATTTTCC GTGATGTTGC AGATAAAGCA ATTACCTTAG TTAAACATAA ACAAGATATT
TTCCCAATTA ATGTTGAACG TTTCCCTAGA ATCTTACTTG TAAATGTTAA AGGTACAGAT
GGTGGCTTCG GTAAAATGGT TGCAGGTAGT CAGCGCAGTG CAACAGAAAT CTTAAAAGAA
AAACTGGAAG AAAAAGGATT TACCGTTTCT ATTTATACAT CACCGATGGA TAATATTCTC
AAGATGTCGG ACGAAGAACA GGTCAAAACT ATTCGTAATG AATACTCCCA AAAACGCCCA
ATTACAACAT TAACGGATCA TTATGATCTG ATTATCAATG TTGCGAATGT GCAAATGAGT
ACAGTTCAAC GTATCGTATG GCAAGCGACC AAAGGAACGC CTGATATTCC ATTCTATGTA
CATGAAGTAC CTACAATTTT CGTTTCCGTT CAATGCCCAT TCCATTTAGT TGATGTTCCT
CAGGTAAAAA CTTACATCAA TGCCTATGAC GGCAAAGAAC CAACCATAGA ATTACTGGTA
GAAAAATTAA TGGGTAATTC CGAGTTTAAA GGTATTAGCC CAGTTGATGC TTTCTGTGGT
TATAAAGACA CCCGTATTTA A
 
Protein sequence
MSVDLTKKPY YLKDEDIQWV KNTIANMTLE EKIGQLFVNM GSSRSEEYLT DVLNRYHIGA 
VRYNPGPAEE VYDQNYILQT KSKIPLLIAA NTEAGGNGAC SDGTEIGLQV KIGATDDAKY
AYEMGRVAGI EAAAIGCNWS FAPIVDISYN WRNPIISNRV FGSNPEKVLE MSLAYMKGIQ
ESGIAPAAKH FPGDGVDERD QHLSFSVNSF SCEEWDNTFG KVYQGLIDAG LPSLMAGHIH
LPAYEQHFNP NLAYEDCLPA TLSKPILTDL LRGKLGFNGV VVTDASHMVA MTSAMKRSEM
LPTAIAAGCD LFLFFNDPDE DFGYMMEGYK NGIITEERLH DALTRILGLK AKLGLHNRPR
ETLLEPKEQA LAKIGLPEHK AIFRDVADKA ITLVKHKQDI FPINVERFPR ILLVNVKGTD
GGFGKMVAGS QRSATEILKE KLEEKGFTVS IYTSPMDNIL KMSDEEQVKT IRNEYSQKRP
ITTLTDHYDL IINVANVQMS TVQRIVWQAT KGTPDIPFYV HEVPTIFVSV QCPFHLVDVP
QVKTYINAYD GKEPTIELLV EKLMGNSEFK GISPVDAFCG YKDTRI