Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_1600 |
Symbol | |
ID | 4241127 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | + |
Start bp | 1815405 |
End bp | 1817165 |
Gene Length | 1761 bp |
Protein Length | 586 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 638105186 |
Product | beta-hexosamidase A |
Protein accession | YP_719805 |
Protein GI | 113461736 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.592664 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGTTG ATTTAACGAA AAAACCTTAT TACTTAAAAG ATGAAGATAT TCAATGGGTA AAAAACACTA TAGCCAATAT GACGTTGGAA GAAAAAATCG GTCAGCTTTT TGTGAATATG GGTTCAAGCA GATCCGAGGA ATATCTCACT GATGTTTTAA ACCGTTACCA TATTGGTGCG GTACGTTATA ATCCCGGTCC TGCTGAGGAG GTTTATGACC AAAACTATAT TCTACAAACT AAAAGTAAAA TCCCTCTCTT AATTGCAGCG AATACAGAAG CTGGTGGTAA TGGTGCTTGC AGTGATGGAA CAGAAATTGG TTTACAAGTC AAAATCGGTG CAACCGATGA TGCTAAATAT GCTTATGAAA TGGGGCGTGT AGCTGGTATT GAAGCAGCAG CTATCGGTTG TAACTGGAGT TTTGCTCCTA TCGTTGATAT TAGTTATAAC TGGCGAAACC CAATTATTTC TAATCGAGTA TTTGGTTCAA ATCCAGAGAA AGTACTAGAA ATGTCCTTGG CTTATATGAA AGGCATTCAA GAAAGTGGTA TTGCCCCGGC TGCAAAACAT TTCCCTGGTG ATGGAGTGGA TGAAAGAGAT CAGCATCTTT CTTTTAGTGT AAACAGTTTT AGTTGCGAAG AGTGGGACAA TACTTTCGGC AAAGTTTATC AAGGCTTAAT TGATGCTGGT TTACCATCGT TGATGGCAGG TCATATACAC TTACCTGCTT ACGAGCAGCA TTTCAATCCA AATTTAGCTT ATGAGGACTG CCTACCAGCG ACACTTTCAA AACCAATTTT AACAGACTTA TTACGAGGTA AATTAGGTTT CAACGGTGTT GTTGTTACTG ATGCCAGCCA TATGGTAGCA ATGACTTCAG CAATGAAGCG TAGCGAAATG CTACCGACTG CTATCGCTGC AGGTTGTGAT TTATTCCTTT TCTTTAACGA TCCTGATGAA GATTTCGGTT ATATGATGGA AGGTTATAAA AACGGCATTA TTACCGAAGA ACGCCTTCAT GATGCCCTTA CTCGTATTCT AGGGTTAAAA GCAAAATTAG GCTTACACAA TCGCCCAAGA GAAACATTGC TTGAACCAAA AGAACAAGCA TTAGCAAAAA TAGGACTTCC AGAACATAAA GCAATTTTCC GTGATGTTGC AGATAAAGCA ATTACCTTAG TTAAACATAA ACAAGATATT TTCCCAATTA ATGTTGAACG TTTCCCTAGA ATCTTACTTG TAAATGTTAA AGGTACAGAT GGTGGCTTCG GTAAAATGGT TGCAGGTAGT CAGCGCAGTG CAACAGAAAT CTTAAAAGAA AAACTGGAAG AAAAAGGATT TACCGTTTCT ATTTATACAT CACCGATGGA TAATATTCTC AAGATGTCGG ACGAAGAACA GGTCAAAACT ATTCGTAATG AATACTCCCA AAAACGCCCA ATTACAACAT TAACGGATCA TTATGATCTG ATTATCAATG TTGCGAATGT GCAAATGAGT ACAGTTCAAC GTATCGTATG GCAAGCGACC AAAGGAACGC CTGATATTCC ATTCTATGTA CATGAAGTAC CTACAATTTT CGTTTCCGTT CAATGCCCAT TCCATTTAGT TGATGTTCCT CAGGTAAAAA CTTACATCAA TGCCTATGAC GGCAAAGAAC CAACCATAGA ATTACTGGTA GAAAAATTAA TGGGTAATTC CGAGTTTAAA GGTATTAGCC CAGTTGATGC TTTCTGTGGT TATAAAGACA CCCGTATTTA A
|
Protein sequence | MSVDLTKKPY YLKDEDIQWV KNTIANMTLE EKIGQLFVNM GSSRSEEYLT DVLNRYHIGA VRYNPGPAEE VYDQNYILQT KSKIPLLIAA NTEAGGNGAC SDGTEIGLQV KIGATDDAKY AYEMGRVAGI EAAAIGCNWS FAPIVDISYN WRNPIISNRV FGSNPEKVLE MSLAYMKGIQ ESGIAPAAKH FPGDGVDERD QHLSFSVNSF SCEEWDNTFG KVYQGLIDAG LPSLMAGHIH LPAYEQHFNP NLAYEDCLPA TLSKPILTDL LRGKLGFNGV VVTDASHMVA MTSAMKRSEM LPTAIAAGCD LFLFFNDPDE DFGYMMEGYK NGIITEERLH DALTRILGLK AKLGLHNRPR ETLLEPKEQA LAKIGLPEHK AIFRDVADKA ITLVKHKQDI FPINVERFPR ILLVNVKGTD GGFGKMVAGS QRSATEILKE KLEEKGFTVS IYTSPMDNIL KMSDEEQVKT IRNEYSQKRP ITTLTDHYDL IINVANVQMS TVQRIVWQAT KGTPDIPFYV HEVPTIFVSV QCPFHLVDVP QVKTYINAYD GKEPTIELLV EKLMGNSEFK GISPVDAFCG YKDTRI
|
| |