Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_1505 |
Symbol | |
ID | 8411026 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | - |
Start bp | 1431335 |
End bp | 1433785 |
Gene Length | 2451 bp |
Protein Length | 816 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645019831 |
Product | Glycoside hydrolase, family 20, catalytic core |
Protein accession | YP_003177327 |
Protein GI | 257387554 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3525] N-acetyl-beta-hexosaminidase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.869244 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAGAGA CACGACGCAA CATCCTACGG AAGGCATCGG CACTGTCTGC GCTGGCGATC GGCGTCAGCG GGACCGCATC GGCAGCAGAT TGCAGCGACG TTCAGCAGTG GCAATCCGGC ACCGCCTACA ACGGCGGCGA CCGCGTCGTT TACGACGACG TTCTCTGGGA AGCCGAGTGG TGGACCCAGG CCAACGAACC CGAAGAGAGC GACTCGGTCT GGACGAAGAT CGGCGACTGT AGTGGCGACA CCGAGAACGA GGCTCCGGTC GCGTCGTTCA CGGCGAGCAT CAGTACACCC GCGCCGGGCG AGTCCGTCAC CTTCGACGCG TCGGGCTCGT CCGATCCGGA CGGCTCCGTG AGTTCGTACG CCTGGGACTT TGGCGACGGC GACACGGCGA CCGGACAGAC CGCGAGTCAC ACGTATGGCT CTGCAGGCGA TTACACGGTC ACACTCACCG TCACCGACGA CGACGGCGCG AGCGGGACGG CATCGACGAC GGTGTCCGTC TCGGAAAGCG ACAACGAGGC TCCGAACGCG TCGTTCACCG TCTCGCCGTC CTCGCCGACG ACGGGCGAGT CGGTGACCGT CGACGCTGCC GACTCCTCGG ACGCCGACGG GTCGATTTCG TCGTACGCCT GGGACTTCGG CGACGACGCG ACCGCCAGCG GTCAGACCGC GACCCACACG TACGACTCGT CGGGCGAGTA CACGATCACA CTCACCGTCA CCGACGACGA CGGTGCGACC GACACCAACG CGACGACAGT CAGCGTCGGC GGCGACGGCG GCGAGTGTAG CGGCGTCTCG GAGTGGGACT CCGGAACGAC GTACACCGGC GGCGACCAGG TCATCTACGA CGGCACGCTC TGGGAAGCCA AGTGGTGGAG CAAGGGCGAC GAGCCCAGTA GCGATGGAGG CCCCTGGAAG CAGATCACCG CCTGTGGGCC GCCCGAGCCA GTCACGAAGA CGCTGGCCGA TCTCGTGCCG AAGCCGGGCG ACATCACGAC CGCCGACGAC GGCTTCGAGA TCACGTCCTC GACGACGATC GTCGCCGAGG GCAGCGGGAC CGAAGTCGGA CAGTACCTCG CCGACCTGCT GGGGCCGGCG ACCGGCTTCG ATCTGTCCGT CGAGAGTGGC TCGTCCGCGT CCGCGGACAG CATCGCGCTG TTGCTCAACG GCGCTCCCTC GTCGGTCGGC GACGAGGGCT ACGAGATGAG CGTCGACAGC GACGGCGTCA CGATCCGAGC CAACGAGGCC GCGGGGCTGT TCTACGGCGT CCAGTCGCTT CGCCAGGTGC TCCCGGCGGC GGTCGAGGCC GACACCGATC AGTCCGTCGA CTGGGTCGTC CCCGGCGGTT CGGTCACCGA CACACCGCGC TTCGAGTACC GCGGCGCGAT GCTCGACGTG GCACGGCACT TCTTCGACAA GTCGGTCGTC AAGGAGTTCA TCGACCAGGT GGCGGCCTAC AAGATCAATC ACCTGCACCT GCACCTGACC GACGACCAGG GCTGGCGTAT CGAGATCGAC GACTGGCCGA ACCTCACCGA CGAAGGGGCA GACTCGGAGG TCGACGGCGG GCCCGGCGGC TACTTCACGA AGGCCGACTA CCAGGAGATC ATCCAGTACG CGCAGGATCG CCACATGACG GTCGTCCCGG AGATCGACAT GCCCGGCCAC ACCGGGGCGG CCCTGGAGTC GTACGCGGAA CTGAACTGTG ACGACACGAA ACGCGAGGAA GACACCGGCA TCAACGTCGG CGACACCACG CTGTGCATGG ACGACGAGCA CAAGGAGACG AGTCTCCAGT TCGCAGCCGA CGTCATCAGC GCGGTCGCCG AGATGACGGA CGGCCCGTAC TTCCACGTCG GAGGCGACGA AGCCGACGTG CTGTCGGATG CCAAGTACGA GGAGTTCATC GACGCGGTCC TCCCGATGAT CGAGGACGCC GGAAAGACCC CCATCGGATG GCACCAGATC GCCAGCACGG AGCCCGTTAC GTCTGCGCTC CTCCACTACT GGGGAACCGA CGCACAGGCC CCGGAGGTCG CGGCCCGGGC CAGCGAGGGC AACGACGTCA TCGCCTCGCC CGCCCACCTC GCGTACCTCG ATCAAGACTA CAACTATCAG GACGGTGTGG GCCAGGACTG GGCGGGACCG GTCTCGGTCG AGGACGCCTA CACCTGGGAT CCGGGCAGCT ACATCGACGG CGTCGACGAG TCCTCGGTCG CCGGCGTCGA GGCACCGCTG TGGACGGAGT TCGTCGAGAC CCAGGACGAC ATCGAGTACA TGGTGTTCCC GCGGCTGGCG GCCATCGCGG AACTGGGCTG GTCGTCGTCG TCCGATATCG GCGACTTCGA CGCGTTCAGC CAGCGTCTGG CCCTGCAGGG CCCACGCTGG GCGCAGGCGA ACGTCAACTA CTACCAGTCT GATCTGGTCG ACTGGCAGTA G
|
Protein sequence | MRETRRNILR KASALSALAI GVSGTASAAD CSDVQQWQSG TAYNGGDRVV YDDVLWEAEW WTQANEPEES DSVWTKIGDC SGDTENEAPV ASFTASISTP APGESVTFDA SGSSDPDGSV SSYAWDFGDG DTATGQTASH TYGSAGDYTV TLTVTDDDGA SGTASTTVSV SESDNEAPNA SFTVSPSSPT TGESVTVDAA DSSDADGSIS SYAWDFGDDA TASGQTATHT YDSSGEYTIT LTVTDDDGAT DTNATTVSVG GDGGECSGVS EWDSGTTYTG GDQVIYDGTL WEAKWWSKGD EPSSDGGPWK QITACGPPEP VTKTLADLVP KPGDITTADD GFEITSSTTI VAEGSGTEVG QYLADLLGPA TGFDLSVESG SSASADSIAL LLNGAPSSVG DEGYEMSVDS DGVTIRANEA AGLFYGVQSL RQVLPAAVEA DTDQSVDWVV PGGSVTDTPR FEYRGAMLDV ARHFFDKSVV KEFIDQVAAY KINHLHLHLT DDQGWRIEID DWPNLTDEGA DSEVDGGPGG YFTKADYQEI IQYAQDRHMT VVPEIDMPGH TGAALESYAE LNCDDTKREE DTGINVGDTT LCMDDEHKET SLQFAADVIS AVAEMTDGPY FHVGGDEADV LSDAKYEEFI DAVLPMIEDA GKTPIGWHQI ASTEPVTSAL LHYWGTDAQA PEVAARASEG NDVIASPAHL AYLDQDYNYQ DGVGQDWAGP VSVEDAYTWD PGSYIDGVDE SSVAGVEAPL WTEFVETQDD IEYMVFPRLA AIAELGWSSS SDIGDFDAFS QRLALQGPRW AQANVNYYQS DLVDWQ
|
| |