Gene Hmuk_3227 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_3227 
Symbol 
ID8409302 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013201 
Strand
Start bp8010 
End bp11405 
Gene Length3396 bp 
Protein Length1131 aa 
Translation table11 
GC content65% 
IMG OID645018163 
ProductHyaluronate lyase 
Protein accessionYP_003175688 
Protein GI257372914 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGACG GCTGGTCACG CCGCTCGGTG CTCAAGTCCA GTCTGGGACT GTCGCTGGCC 
GGTGTTTCGC TGTCTGGTAC GACCGAGACA GTGACCGGCG CGAGCGAGTA CGAGACGCTC
AGACAGCGGT GGGCACAGTT GCTCACCGGT GGCGACTTCG ACGAGACACA GTCCGAGTAT
CAGGATCCGC TGGCCGAGCT GGACCAGACG GCACAGGACC ACTGGGAGAC GATGGACACC
AGCGCCGACC GTGATCGGCT CTGGTCCGAC CTCCCGATCC CGGCGTCGTC CAGTGCGAGC
GCGAGCGAGA GCAACATCAC CGACAGCTAC GGCCGCCTCC AGGAGATGGC CATGGCCTAC
GCCACGAACG GGAGTTCACT GGAGAACGAT AGCGCGCTCG TGACAGACGT CGTCGACGGA
CTCGACTTCC TCTACGATCG GGTCTACAAC GAAGACCAGT CTCAGTTTGG CAACTGGTGG
CACTGGGAGA TCGGCTCGCC GATGCGTCTC GTCAGCGTCT GCGCGCTGGT CGGCGACGAA
CTGTCCTCGA CACAGGAGAC AAACTACACG AACGCCGTCG GCGCTCACAC CGGAACGCCC
TACGAGTACA CGGAGTACGA CGTGACGAGT GGCGGTGCCA ACCGCGTCGA CATGTGTATC
ATCACCGCGC TCCGTGGCGC GATCTCGGGG ACGGACTCGA CGATCGCTCT CGCCCGAGAC
TGCATCGAAG AGAGCGATAT CTTCCAGTAC AACACCAGCG GCGGCGGCAA CGGACTCTAC
CGCGACGGAT CTTACGTCTA TCACGAGGAG ATACCCTACA TCGGGTCGTA CGGTGCAATC
TTGCTCGAAG GGCTCGGTGA ACTGTTCACG GTTCTGGACG GGACGACTTG GGAGATCACG
GCCGTCGACC ACGACGTGAT CTACGACGCC GTCGGTGACG CGGTCGCCCC GTTCATGTAC
AGAGGGCTGA TGATGGACGC GGTCAGCGGC CGGTCGATCT CGCGAGCAGA CCAGACCGAC
CACGTACGCG GCCACGGCAT CACCGCGACC GTCCTTCGGC TGGCGAATAC CGCACCGGAA
CCCTACGCCA GCGAGTTCCG GTCGCTCGCC AAGGGCTGGA TCGAGAACGA CACGTGGGAC
AGTTTCCTGA GCGATGCAGA CGTGCCCGAC ATCGCGAACG CGACGGCGGT CCTCGACGAT
TCGACGATCA GTGCCGCCGA CGAACCGGTC CGACACGACG TGTTCCACAA CATGGACCGG
GTCGTACACA ACCGCTCGGA GTGGGCCTAC ACGATCAGCA TGTGCTCGGA GCGGATCGCT
CGCTACGAGG CGATCAACGA GGAGAACCTC CGTGGCTGGT ACACCGGTGC CGGGATGACC
CAGCTGTACA ACGACGACCT CGGCCACTAC ACCGACGGCT ACTGGCCCAC CGTCGACCCC
TACCGGCTCC CCGGAACGAC CGTCGACACC CGCGATCGGT CCGCCCTCGA CGGCACTCAC
CACCCACGAC CGTCGACACA GTGGGTCGGC GGCGCGTCGG TCGACGAGTT CGGGATCGCC
GGAATGGAGT TCGACGCCGA GGGCGCATCG CTCACCGGCA AGAAGTCCTG GCTACTCCTC
GACGACACCG TCGTCGCACT CGGAGCCGAC ATCACCAGCT CCGACGGTCG CCCGATCGAG
ACCACCGTCG AGAACCGGAA TCTCCACACG GACGGAAGCG AGACGCTCAC CGTCGACGAA
ACCGAGAAGT CCACGACACC GGACTGGTCC GAGACGCTCA CCGACGTGTC CTGGGCGCAC
CTCGACGGCG TCGGTGGCTA CCTGTTCCCG AACCAACCGA CCCTCGAAGC CAAACGCGAG
GAACGCACCG GTTCCTGGCA GGAGATCAAC GCGGGCGGAC CGAGCGAGGC GCTGACCCGC
GAGTACCAGA CCCTCTGGCT CGACCACGGT GTCGATCCCA GCGCCGAGAC GTACGCGTAC
GCACTGCTCC CGGGCCACAC CGCGTCCGAG ACGCGACAGC GGAGTCAGGA GCCCGGCTTC
GAGATCGTCG CCAACGACGC CACGGTCCAG GCCGTCACGG TGCCCCGTCT GGGGTTGACG
GCCGCCAACT TCTGGAGTAG CGGTTCGATC ACGGTCCCCG GCACCGAGCG AACGCTCTCG
GTGAGCGGCC CGGCAGCCGT CGTCGTCAGA CACCGAAACG ACGAACTCGT GGTCGGCGTC
GCCGACCCCT CCCGAACACA GGAGACGGTC ACTGTCGAGT ACGATCACTA CACGGACGGG
ATCGTCGGTA CTGACTCGGC GGTCGGTGTC ACGCAGTTCC AACCACGCGT CACGATAGAG
GTCGCCGTCG GCGGAACTCG CGGTACGACC CACTCGGCGA CCTTCGACGC CCCCGTCACA
GAGCTCTCAC CGCGTGCAGA CACGTTCGTT CGCGACGGCT CGTACGCTGG CGACAACTAC
GGGTCGTGGT CCTCGTTGGT CGTCAAAGGC GGTCCGACGG GCTACAGCAG AGAGTCGTAT
CTCGCGTTCG ACCTGGCGTC GGTCGCGGGC GAGGTTCAAG AGGCCATGCT CGACGTGTAC
GGCGCGGTCA CCGACGACAA CGGCGGAGCG TCCGTCGACT GTACAGTCGC CGCCGTCGAC
GACGACAGCT GGACGGAGGA CGGACTCACC TGGGATACGA AACCCGATCT GGGCTCGTCG
CTGGGCTCTC TGACCGTCAC ACGGGAACGC CGCTGGTGGC GCGAAGACGT GACGGAGTTC
GTACAGACAG CCGCCAGCGG TGACGGAACC GCCAGCGTCG CGCTTCGGCA ACCGAACGAC
GAGCGCTACG CCAGCTTCGA TAGCCGAGAA GCAGACGAGA ACCCGCCGTC ACTGCGCGTC
ACGACCAGTC GCCCCGACAC GACGGCACTC ACCCCGACAG CAGACACCTT CGTCCGGAAC
GGGTCGTACT CCGGTGACAA CTACGGGTCG TGGTCCTCGC TGGTCGTCAA GAACGCGTCG
ACCGGCTACA GTCGCCAGGG GTACCTCACC TTCGACCTGA GCGCGCTGTC GGGTTCGGTC
GACGAGGCCG TTCTCTACCT CTACGGCGCG GTCACCGACG ACAGCGGCGG AGATGCCGTC
GACTGTGCGA TCAACGCCGT CGACGACGAC AGCTGGACGG AGGACGGAGT TACATGGGAC
ACGAAGCCCG AGGTGGGCTC GGCACTCGGT TCCGTGATCG TCACCCGGAC GCCACAGTGG
TGGACCGTCG ACGTGACCGA GTTCGTCCGA TCGGAGGCCA GCAGCGATGG CGTCGTCAGT
CTGGCCGTCC AGCAGCCACA GAGCGGCCTG TACACCGACT TCAACAGCCG GGACGCCGAC
GAGAAGGTGC CGACGCTTCG GGTGCAGACC TCGTAG
 
Protein sequence
MSDGWSRRSV LKSSLGLSLA GVSLSGTTET VTGASEYETL RQRWAQLLTG GDFDETQSEY 
QDPLAELDQT AQDHWETMDT SADRDRLWSD LPIPASSSAS ASESNITDSY GRLQEMAMAY
ATNGSSLEND SALVTDVVDG LDFLYDRVYN EDQSQFGNWW HWEIGSPMRL VSVCALVGDE
LSSTQETNYT NAVGAHTGTP YEYTEYDVTS GGANRVDMCI ITALRGAISG TDSTIALARD
CIEESDIFQY NTSGGGNGLY RDGSYVYHEE IPYIGSYGAI LLEGLGELFT VLDGTTWEIT
AVDHDVIYDA VGDAVAPFMY RGLMMDAVSG RSISRADQTD HVRGHGITAT VLRLANTAPE
PYASEFRSLA KGWIENDTWD SFLSDADVPD IANATAVLDD STISAADEPV RHDVFHNMDR
VVHNRSEWAY TISMCSERIA RYEAINEENL RGWYTGAGMT QLYNDDLGHY TDGYWPTVDP
YRLPGTTVDT RDRSALDGTH HPRPSTQWVG GASVDEFGIA GMEFDAEGAS LTGKKSWLLL
DDTVVALGAD ITSSDGRPIE TTVENRNLHT DGSETLTVDE TEKSTTPDWS ETLTDVSWAH
LDGVGGYLFP NQPTLEAKRE ERTGSWQEIN AGGPSEALTR EYQTLWLDHG VDPSAETYAY
ALLPGHTASE TRQRSQEPGF EIVANDATVQ AVTVPRLGLT AANFWSSGSI TVPGTERTLS
VSGPAAVVVR HRNDELVVGV ADPSRTQETV TVEYDHYTDG IVGTDSAVGV TQFQPRVTIE
VAVGGTRGTT HSATFDAPVT ELSPRADTFV RDGSYAGDNY GSWSSLVVKG GPTGYSRESY
LAFDLASVAG EVQEAMLDVY GAVTDDNGGA SVDCTVAAVD DDSWTEDGLT WDTKPDLGSS
LGSLTVTRER RWWREDVTEF VQTAASGDGT ASVALRQPND ERYASFDSRE ADENPPSLRV
TTSRPDTTAL TPTADTFVRN GSYSGDNYGS WSSLVVKNAS TGYSRQGYLT FDLSALSGSV
DEAVLYLYGA VTDDSGGDAV DCAINAVDDD SWTEDGVTWD TKPEVGSALG SVIVTRTPQW
WTVDVTEFVR SEASSDGVVS LAVQQPQSGL YTDFNSRDAD EKVPTLRVQT S