Gene Hmuk_3174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_3174 
Symbol 
ID8412727 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp3065499 
End bp3067085 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content69% 
IMG OID645021521 
Productglycoside hydrolase family 3 domain protein 
Protein accessionYP_003178986 
Protein GI257389213 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.108856 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.164805 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCTGC CAGACGTGGC CGAACTCGAT CTCGCCACCA AAGTCGGACA GCTGTTCGTC 
GTCGGCTTCG AGGGACCGGA GCCGACGGAG GACCTCCGCG AACTCCTCAC CGACTATCGC
TGTGGCAACG TCATCTACTT CAGCCGGAAC ATCGACTCGC CCGAGCAGGT CGCCGAGCTC
AGTCGCGAGC TACAGACCAT CGCCACCGAA GCGGGACCCG AGATCCCGCT GTTCGTGACG
GCCGACCAGG AGGGCGGCGT CGTCTCACGG ACCGACTGGG GGACCGAACC CCCGAGTCAG
ATGAGCATCG GTGCGGGCCG GGACGCCGAC CTCGCCCGTT CGGTGGGTGG CGCTGTCGGG
GCGGAACTCG CGTCGATCGG CGTCAACTTC GATCTGACGC CGGTGCTGGA CGTGAACAAC
AACCCGGACA ACCCCGTCAT CGGCGTCCGG TCGTTCGGCG AGGAGCCCGA ACTCGTCGGC
GATCTCGGCG CGGCGATGGC CGACGGCATG CAGTCCGAGG GCGTGCTGGC CTGTGGGAAG
CACTTCCCGG GCCACGGCGA CACCAGCGCC GACTCGCACC ACTCGCTACC CGTCGTCGAC
CACGACCGCG AGCGTCTCGA CGCCGTCGAA CTCGCCCCCT TCCGCCGGGC GATCGACGCC
GGGATCGACG CGATCATGAC GACACACGTC TCGTTCCCGA CGATCACCGG CGACGACGAA
CTCCCGGCGA CCGTCTCCCG AGACGTACAG ACCGGACTCC TCCGCGAGCA GTTAGGGTTC
GATGGCCTGG TCGTCACCGA CGGAATGGAG ATGAACGCCA TCGCCGACGA GATGGGGACG
CCGGAAGGGT GTGTCCAGGC CGTCGAAGCG GGCTGTGATC TCCTCCTGGT CTGTCACACC
CCCGCGGTCC AGAAAGACTC GGTCGAGGCG GTCATCGACG CCGTCGAGTC CGGCCGCATC
GACGAGTCGC GGATCGACGA CGCTGTCGAG CGCGTCCTCG AGTACAAGGA GCGACGCGGC
GTCGGCCAGC AGACGCCCTC GCTCGACCGC TGGGAGGCGA CCAGCGATCG CTCCCGCGAG
GTCGGTCGTG AGGTCGCCGC GGCCGGCATC ACGGTCGCTC GGGATCGAAA CGAGACGATT
CCCTTCGACA CCGGACGACC GCTTCACCTC GTCGGCTTCC CCGGCGGACG CGCTTCGCCG
GCAGAGGACG ACCGCTACGA GCCGACGCTG GTCGCCGACG CACTCGAAGC CGGCGGCTTC
GACGTGGAAC TGCACGAGGT GGAGACCGCC GACGCCCTGC CGTCGTTCGA GGGCGACGAA
CAGGTCGTGC TGGCGACCTA CAACGCCGCT GGCGACGACG AACAGGTGCG AGCGGTCGAG
CGACTGGACG AGGCCGTCGA CGCCTTCGCC GCGCTGGTGG TGCGCAACCC CTACGACCTC
GCTCGCTTCC CCGACGTGTC GACGGCGGTG TCGACCTACG ACTACACGCC CGCGACGCTG
TCGGTGACCG GCGAGATCCT GGCCGGCCGG CGTCGGGCCA GCGGTCGGCT GCCGGTGACG
ATCGCGGGCT TCGAGGCCGA GAACTGA
 
Protein sequence
MSLPDVAELD LATKVGQLFV VGFEGPEPTE DLRELLTDYR CGNVIYFSRN IDSPEQVAEL 
SRELQTIATE AGPEIPLFVT ADQEGGVVSR TDWGTEPPSQ MSIGAGRDAD LARSVGGAVG
AELASIGVNF DLTPVLDVNN NPDNPVIGVR SFGEEPELVG DLGAAMADGM QSEGVLACGK
HFPGHGDTSA DSHHSLPVVD HDRERLDAVE LAPFRRAIDA GIDAIMTTHV SFPTITGDDE
LPATVSRDVQ TGLLREQLGF DGLVVTDGME MNAIADEMGT PEGCVQAVEA GCDLLLVCHT
PAVQKDSVEA VIDAVESGRI DESRIDDAVE RVLEYKERRG VGQQTPSLDR WEATSDRSRE
VGREVAAAGI TVARDRNETI PFDTGRPLHL VGFPGGRASP AEDDRYEPTL VADALEAGGF
DVELHEVETA DALPSFEGDE QVVLATYNAA GDDEQVRAVE RLDEAVDAFA ALVVRNPYDL
ARFPDVSTAV STYDYTPATL SVTGEILAGR RRASGRLPVT IAGFEAEN