Gene Hmuk_3226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_3226 
Symbol 
ID8409301 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013201 
Strand
Start bp4681 
End bp7782 
Gene Length3102 bp 
Protein Length1033 aa 
Translation table11 
GC content68% 
IMG OID645018162 
Productglycoside hydrolase family 2 TIM barrel 
Protein accessionYP_003175687 
Protein GI257372913 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.553706 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGACT GGACAGATCC GCGAGTAGTC GGTCGTAACA GGCTCGCACC ACACACTGAC 
GTGTTCCCGT TCTCGGACGA GCAATCTGCA CGCCGGGACA GCGTGACTGC CTCGCCGTGG
GTTCGGCGAT TGAACGGCGA GTGGCAGTTT CACCTCGCCG AGACCCCTGC GGACGCGCCA
GCGATCCCGG GCGCGACGGA CGACGTCGAC TGGGACCGAA TCGAGGTCCC GCTGAACTGG
CAACTCGACG GCCACGGACA CCCACACTAC ACGAACGTCG TCTACCCGTT CCCGGTCGAT
CCGCCCCACG TTCCGACCGA GAACCCGACG GGGACGTACC GGCGCTCCGT CCACGTCGAC
GAGGACTGGG ACGGTCGACA GATCCGCCTG CGCTTCGAGG GGGTCGACTC CGCCTTCCAC
CTCTGGGTCA ACGGCGAGCG CGTCGGCTAC AGCGAAGGCG CGCGTCTCCC CGCGGAGTTC
GACGTGAGCG ACTACGTCGA GCCGGGGGAG AACACGGTCA CTGCACGGGT CTACAAGTGG
TCTAACGGGA GCTACCTCGA AGATCAGGAC ATGTGGTGGC TCAGCGGCAT CTTCCGTGAC
GTCACCCTGT ACGCCGTGCC GGAGACTCAC GTCGCGGACG TCGACGTCCG TACCGAACTG
GACGACGACT ACGTCGACGC CCGGTTCTCC GCGGCGGTCG ACATCGAATC GGTAGCCGGT
GGCGACGCGA CGGGCCGTCT CCGCGCCTCT CTGGTCGACG AAGCGGGCGA CACTGCCGCC
ACCTTCGCGG AGTCGTACGC GCTGACCGAC GGTGAGACGA CGCTCTCGCT GTCGACGACC
GTCGAGGACC CGGACAAGTG GACCGCAGAG ACGCCGGACC GGTACACGCT CCTGGTGACG
CTACTGGACG ACGGAACGCC CGTCGAGACC GTTCGCGAGA CGGTCGGCTT TCGCGAGGTC
GAGATCGACG GCGGCCAGTT CCTGGTCAAC GGCGAAGCCG TGACGATCCG CGGCGTGAAC
CGCCACGACT TCGATCCCGA CCGGGGGCGG GCCGTCACCG TCGACCAGAT GCGCGCGGAC
ATCGAGCTGA TGAAACGGCA CAACGTCAAC GCCGTCCGGA CCGCTCACTA CCCCAACGAC
ACGCGGTTCT ACGACCTCTG TGACGAGTAC GGGCTCTACG TCATGGACGA GACCGACATC
GAGTGTCACG GGATGGAACG CATCGACGCC GTCCAGCATC TCAGTGACGA CCCCGCGTGG
GAAGACACCT ACGTCGATCG GATGGTCCGG ATGCTCGAAC GAGACAAGAA CCACCCGAGC
GTCGTGATCT GGTCGCTGGG CAACGAGTCC GCAGTCGGGG CCCACCACGA GACGATGTAC
GAGCTGACAC GCGAGCGCGA TCCGACGCGG CCGGTCCACT ACGAGCAGGA CCACGACCAG
CGGGTCAGCG ACATCGTCGG GCCCATGTAC ACCCCGCCCG AGGACATCGA GGCCCTGGCA
GTCGACGATC CGGACCACCC CGTGATCCTC TGTGAGTACG CCCACGCCAT GGGGAACGGT
CCGGGCGGGT TCGAGGAGTA CTGGGACGTA TTCGACGGCC ACGAGCGACT GCAGGGCGGG
TTCGTCTGGG ACTGGATCGA CCAGGGCATC CGCCAGACGA CCGAGGACGG AGCGGAGTGG
TTCGCCTACG GCGGCGACTT CGGCGACGAG CCCAACACGG GGAACTTCAA CATCAACGGT
CTCGTCTTTC CCGATCGAAC GCCCTCGCCG GGCCTCACCG AGTTCAAGAA CGTCGTCGAG
CCGGTCACGT TCGAGCCCGC AGCCCTCGAA CGGGGCGAAC TCGTCGTCGA GAACCGCTAC
GACTTCCGCG GGCTAGATCA CCTCCGCGCC CGGTGGCGCG TCGAGGCCGA CGGCCGGGTC
GTCCAGAGTG GCACGCTCGA CCTCCCAGCC GTCGCACCCG GCGACAGCGA ACCCGTCACG
GTGCCGTTCG AGGATGCTGG CCGGGGCGAG CGGTTCCTCG TCGTCGAGGT CTCGCTGGCC
AGTGACACGT CGTGGGCACC GGCCGGCCAC ACGGTCGCAA CGAGCCAGCA CGAACTCCCG
ACGAGCGAGG CACCGACGGT GCCGGCCGCG ACGCTTCCCG CACCGCTGTC CTGTGAACGC
AGCGACGACG AGATCGTCGT CTCGAACGCG GAGTTCGAGC TCCGGTTCGA CGCCCGATAC
GGCGTCGTCG ACTCGCTGTC CTACCGCGGT CGGTCGGTCG TCACCGAGGG TCCCGAGGTC
GGACTCTGGC GTGCCCCGAC GGACAACGAC AGAGGGTTGC CGCTGGTCCC GACTTTCTTC
ACGCGCTTCC TCGAACTACA CGAAAACGAG GAACCGATCG CCGACTGGGA CGCCCGGACG
GTCGGCTTCG CACAGATCTG GCGCGAGCAC GGTCTCGACA GCTTGCAGTC CCGCGTCGAC
GCCGTCGACA CCGAGGTCGG GGAGGAGACG GTCACGATCA CCGTCGACGG TCGCCTCGCG
CCGCCGATGT TCGCCCACGG GTTCGCGACG ACGCAGACGT ACACGATCCA CCCCACCGGA
GCCGTCGAGA TCGAGACGGA TCTCGACCCT GAGGGCGACC TCTCGATGCT CCCGTCGCTG
CCACGGATCG GACTCGACCT GACGCTCGAC GGGGACTTCG ACCACGTCAC GTGGTACGGC
CGCGGCCCGG GCGAGTCCTA CGCCGACAGC GAGCAGGCGT CCCCCGTCGG TCGGTACGAG
GCCGACGTTG CCGACCTCCA CACGCCCTAC GTCAGGCCCC AGGCGAACGG CACTCGCAGC
GACACCCGCT GGGTGGCCGT TACCGACCGC AACGGTACCG GGCTCCTCGC GACCGGGGAC
TCGCTGCTCG ACGTGACCGC ACACCACTAC TCGACCGAGC AGCTGGAGGC CGCCGATCAC
GAACACGAAC TGTCCCGCGA GGACGACGTG TTCCTCTCGC TGGACCACGC ACACTCCGGA
CTCGGGTCGG GCAGCTGTGG GCCCGAGACG TTCGAGTCCG ACCGCGTCCA GCCCGAGCGG
ACCTCGTTCA CGGTCACGCT CCACCCGTAC GTCGACGACT GA
 
Protein sequence
MPDWTDPRVV GRNRLAPHTD VFPFSDEQSA RRDSVTASPW VRRLNGEWQF HLAETPADAP 
AIPGATDDVD WDRIEVPLNW QLDGHGHPHY TNVVYPFPVD PPHVPTENPT GTYRRSVHVD
EDWDGRQIRL RFEGVDSAFH LWVNGERVGY SEGARLPAEF DVSDYVEPGE NTVTARVYKW
SNGSYLEDQD MWWLSGIFRD VTLYAVPETH VADVDVRTEL DDDYVDARFS AAVDIESVAG
GDATGRLRAS LVDEAGDTAA TFAESYALTD GETTLSLSTT VEDPDKWTAE TPDRYTLLVT
LLDDGTPVET VRETVGFREV EIDGGQFLVN GEAVTIRGVN RHDFDPDRGR AVTVDQMRAD
IELMKRHNVN AVRTAHYPND TRFYDLCDEY GLYVMDETDI ECHGMERIDA VQHLSDDPAW
EDTYVDRMVR MLERDKNHPS VVIWSLGNES AVGAHHETMY ELTRERDPTR PVHYEQDHDQ
RVSDIVGPMY TPPEDIEALA VDDPDHPVIL CEYAHAMGNG PGGFEEYWDV FDGHERLQGG
FVWDWIDQGI RQTTEDGAEW FAYGGDFGDE PNTGNFNING LVFPDRTPSP GLTEFKNVVE
PVTFEPAALE RGELVVENRY DFRGLDHLRA RWRVEADGRV VQSGTLDLPA VAPGDSEPVT
VPFEDAGRGE RFLVVEVSLA SDTSWAPAGH TVATSQHELP TSEAPTVPAA TLPAPLSCER
SDDEIVVSNA EFELRFDARY GVVDSLSYRG RSVVTEGPEV GLWRAPTDND RGLPLVPTFF
TRFLELHENE EPIADWDART VGFAQIWREH GLDSLQSRVD AVDTEVGEET VTITVDGRLA
PPMFAHGFAT TQTYTIHPTG AVEIETDLDP EGDLSMLPSL PRIGLDLTLD GDFDHVTWYG
RGPGESYADS EQASPVGRYE ADVADLHTPY VRPQANGTRS DTRWVAVTDR NGTGLLATGD
SLLDVTAHHY STEQLEAADH EHELSREDDV FLSLDHAHSG LGSGSCGPET FESDRVQPER
TSFTVTLHPY VDD