Gene Hmuk_1505 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_1505 
Symbol 
ID8411026 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1431335 
End bp1433785 
Gene Length2451 bp 
Protein Length816 aa 
Translation table11 
GC content67% 
IMG OID645019831 
ProductGlycoside hydrolase, family 20, catalytic core 
Protein accessionYP_003177327 
Protein GI257387554 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.869244 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGAGA CACGACGCAA CATCCTACGG AAGGCATCGG CACTGTCTGC GCTGGCGATC 
GGCGTCAGCG GGACCGCATC GGCAGCAGAT TGCAGCGACG TTCAGCAGTG GCAATCCGGC
ACCGCCTACA ACGGCGGCGA CCGCGTCGTT TACGACGACG TTCTCTGGGA AGCCGAGTGG
TGGACCCAGG CCAACGAACC CGAAGAGAGC GACTCGGTCT GGACGAAGAT CGGCGACTGT
AGTGGCGACA CCGAGAACGA GGCTCCGGTC GCGTCGTTCA CGGCGAGCAT CAGTACACCC
GCGCCGGGCG AGTCCGTCAC CTTCGACGCG TCGGGCTCGT CCGATCCGGA CGGCTCCGTG
AGTTCGTACG CCTGGGACTT TGGCGACGGC GACACGGCGA CCGGACAGAC CGCGAGTCAC
ACGTATGGCT CTGCAGGCGA TTACACGGTC ACACTCACCG TCACCGACGA CGACGGCGCG
AGCGGGACGG CATCGACGAC GGTGTCCGTC TCGGAAAGCG ACAACGAGGC TCCGAACGCG
TCGTTCACCG TCTCGCCGTC CTCGCCGACG ACGGGCGAGT CGGTGACCGT CGACGCTGCC
GACTCCTCGG ACGCCGACGG GTCGATTTCG TCGTACGCCT GGGACTTCGG CGACGACGCG
ACCGCCAGCG GTCAGACCGC GACCCACACG TACGACTCGT CGGGCGAGTA CACGATCACA
CTCACCGTCA CCGACGACGA CGGTGCGACC GACACCAACG CGACGACAGT CAGCGTCGGC
GGCGACGGCG GCGAGTGTAG CGGCGTCTCG GAGTGGGACT CCGGAACGAC GTACACCGGC
GGCGACCAGG TCATCTACGA CGGCACGCTC TGGGAAGCCA AGTGGTGGAG CAAGGGCGAC
GAGCCCAGTA GCGATGGAGG CCCCTGGAAG CAGATCACCG CCTGTGGGCC GCCCGAGCCA
GTCACGAAGA CGCTGGCCGA TCTCGTGCCG AAGCCGGGCG ACATCACGAC CGCCGACGAC
GGCTTCGAGA TCACGTCCTC GACGACGATC GTCGCCGAGG GCAGCGGGAC CGAAGTCGGA
CAGTACCTCG CCGACCTGCT GGGGCCGGCG ACCGGCTTCG ATCTGTCCGT CGAGAGTGGC
TCGTCCGCGT CCGCGGACAG CATCGCGCTG TTGCTCAACG GCGCTCCCTC GTCGGTCGGC
GACGAGGGCT ACGAGATGAG CGTCGACAGC GACGGCGTCA CGATCCGAGC CAACGAGGCC
GCGGGGCTGT TCTACGGCGT CCAGTCGCTT CGCCAGGTGC TCCCGGCGGC GGTCGAGGCC
GACACCGATC AGTCCGTCGA CTGGGTCGTC CCCGGCGGTT CGGTCACCGA CACACCGCGC
TTCGAGTACC GCGGCGCGAT GCTCGACGTG GCACGGCACT TCTTCGACAA GTCGGTCGTC
AAGGAGTTCA TCGACCAGGT GGCGGCCTAC AAGATCAATC ACCTGCACCT GCACCTGACC
GACGACCAGG GCTGGCGTAT CGAGATCGAC GACTGGCCGA ACCTCACCGA CGAAGGGGCA
GACTCGGAGG TCGACGGCGG GCCCGGCGGC TACTTCACGA AGGCCGACTA CCAGGAGATC
ATCCAGTACG CGCAGGATCG CCACATGACG GTCGTCCCGG AGATCGACAT GCCCGGCCAC
ACCGGGGCGG CCCTGGAGTC GTACGCGGAA CTGAACTGTG ACGACACGAA ACGCGAGGAA
GACACCGGCA TCAACGTCGG CGACACCACG CTGTGCATGG ACGACGAGCA CAAGGAGACG
AGTCTCCAGT TCGCAGCCGA CGTCATCAGC GCGGTCGCCG AGATGACGGA CGGCCCGTAC
TTCCACGTCG GAGGCGACGA AGCCGACGTG CTGTCGGATG CCAAGTACGA GGAGTTCATC
GACGCGGTCC TCCCGATGAT CGAGGACGCC GGAAAGACCC CCATCGGATG GCACCAGATC
GCCAGCACGG AGCCCGTTAC GTCTGCGCTC CTCCACTACT GGGGAACCGA CGCACAGGCC
CCGGAGGTCG CGGCCCGGGC CAGCGAGGGC AACGACGTCA TCGCCTCGCC CGCCCACCTC
GCGTACCTCG ATCAAGACTA CAACTATCAG GACGGTGTGG GCCAGGACTG GGCGGGACCG
GTCTCGGTCG AGGACGCCTA CACCTGGGAT CCGGGCAGCT ACATCGACGG CGTCGACGAG
TCCTCGGTCG CCGGCGTCGA GGCACCGCTG TGGACGGAGT TCGTCGAGAC CCAGGACGAC
ATCGAGTACA TGGTGTTCCC GCGGCTGGCG GCCATCGCGG AACTGGGCTG GTCGTCGTCG
TCCGATATCG GCGACTTCGA CGCGTTCAGC CAGCGTCTGG CCCTGCAGGG CCCACGCTGG
GCGCAGGCGA ACGTCAACTA CTACCAGTCT GATCTGGTCG ACTGGCAGTA G
 
Protein sequence
MRETRRNILR KASALSALAI GVSGTASAAD CSDVQQWQSG TAYNGGDRVV YDDVLWEAEW 
WTQANEPEES DSVWTKIGDC SGDTENEAPV ASFTASISTP APGESVTFDA SGSSDPDGSV
SSYAWDFGDG DTATGQTASH TYGSAGDYTV TLTVTDDDGA SGTASTTVSV SESDNEAPNA
SFTVSPSSPT TGESVTVDAA DSSDADGSIS SYAWDFGDDA TASGQTATHT YDSSGEYTIT
LTVTDDDGAT DTNATTVSVG GDGGECSGVS EWDSGTTYTG GDQVIYDGTL WEAKWWSKGD
EPSSDGGPWK QITACGPPEP VTKTLADLVP KPGDITTADD GFEITSSTTI VAEGSGTEVG
QYLADLLGPA TGFDLSVESG SSASADSIAL LLNGAPSSVG DEGYEMSVDS DGVTIRANEA
AGLFYGVQSL RQVLPAAVEA DTDQSVDWVV PGGSVTDTPR FEYRGAMLDV ARHFFDKSVV
KEFIDQVAAY KINHLHLHLT DDQGWRIEID DWPNLTDEGA DSEVDGGPGG YFTKADYQEI
IQYAQDRHMT VVPEIDMPGH TGAALESYAE LNCDDTKREE DTGINVGDTT LCMDDEHKET
SLQFAADVIS AVAEMTDGPY FHVGGDEADV LSDAKYEEFI DAVLPMIEDA GKTPIGWHQI
ASTEPVTSAL LHYWGTDAQA PEVAARASEG NDVIASPAHL AYLDQDYNYQ DGVGQDWAGP
VSVEDAYTWD PGSYIDGVDE SSVAGVEAPL WTEFVETQDD IEYMVFPRLA AIAELGWSSS
SDIGDFDAFS QRLALQGPRW AQANVNYYQS DLVDWQ