Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_1132 |
Symbol | nagZ |
ID | 4240633 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | - |
Start bp | 1271025 |
End bp | 1272080 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 638104695 |
Product | beta-hexosaminidase |
Protein accession | YP_719344 |
Protein GI | 113461275 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00323913 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGATAT TATTAATAGA TTTATCGAGT CAAGAGTTAC GTCAAGAAGA AATCGAATTA CTTGAACATC CTCTAGTCGC CGGGTTAATT TTATTTAGCC GCAATTTTTA TGATATTGAG CAAATCCGAC ATCTCATTAG ATCTGTGCGT CAAAAGGTTA AAAAGCCTTT ATTAATTACT GTTGATCAAG AGGGTGGGCG AGTACAGCGT TTTCGTCAGG GTTTTACGCA ATTACCGGCA ATGCAGTCCT TTGCCTGTTT ACTTTCTGAT CCTCAAGAGC AACAGGAAAT GGCTTGGCGA GCAGGTTGGC AAATGGCGGC GGAAATGACC GCACTTGATA TCGATCTTAG CTTTGCACCT GTTTTGGATC TTGGGCATCA ATGTAAGGCA ATTGGCGACC GCAGTTTTCA TTATGAGGAA AAGAAACTGA TTGAACTTGC TGAGAAATTT ATTCAAGGTA TGAGACAAAT CGGAATGTCG GCAACGGGTA AGCATTTTCC CGGTCATGGA CATGTGTTAG CCGATTCTCA TTTGGAAACA CCTTATGATG ATCGAGCAAA AGAATTAATT TTTGCACAAG ATATTCGACC GTTTCAGTCT TTGATTAAAC AAGGGTTGTT AGATGCAGTG ATGCCGGCAC ATGTTATCTA CACTCAATGT GATAATCAAC CAGCAAGTGG ATCGTCTTAT TGGCTAAAGG AGGTTTTACG TCAACAATTG GGATTCCAAG GAGCGATTTT TTCTGATGAT TTGGGTATGA AAGGTGCCGG TTTTATGGGG GATTTTGTTG CTCGTTGTAC GCAATCTCTT CAGGCGGGTT GTGATTTATT GTTGTTATGT AACGAGCCTG AAGCAGTGGT GCAAGTTTTA GATCGTTTTA AACCACAGGA AAGTCAAAAT AAACGAATAA TACGCCAAAC TAGATTGAAT AAATTATTCA AAAAACAACG GATTGATTGG CAAACCTTAC GCAATCAACG TGATTGGTTG GAAAATCACA AAAAACTTAC CGCACTTCAG CAAGATTGGT TAGCCTATAA GGGCTATGAC AATTAG
|
Protein sequence | MSILLIDLSS QELRQEEIEL LEHPLVAGLI LFSRNFYDIE QIRHLIRSVR QKVKKPLLIT VDQEGGRVQR FRQGFTQLPA MQSFACLLSD PQEQQEMAWR AGWQMAAEMT ALDIDLSFAP VLDLGHQCKA IGDRSFHYEE KKLIELAEKF IQGMRQIGMS ATGKHFPGHG HVLADSHLET PYDDRAKELI FAQDIRPFQS LIKQGLLDAV MPAHVIYTQC DNQPASGSSY WLKEVLRQQL GFQGAIFSDD LGMKGAGFMG DFVARCTQSL QAGCDLLLLC NEPEAVVQVL DRFKPQESQN KRIIRQTRLN KLFKKQRIDW QTLRNQRDWL ENHKKLTALQ QDWLAYKGYD N
|
| |