Gene Hmuk_3416 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_3416 
Symbol 
ID8409494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013201 
Strand
Start bp220610 
End bp221812 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content68% 
IMG OID645018337 
Productglycosyl hydrolase family 88 
Protein accessionYP_003175858 
Protein GI257373084 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGAA GTCCTCAGGC GCTCGCGGCC GCAGTGACCG AACCGGCGCT TCCCGACCGA 
TACTTCGAGC GGCCCGAGCG AAGCCAGGAA CAGTTAGAAC GGGCGTTGAC GGACGCGATC
GAGCGGATCG GCGAGAACCT CGACCGGTAC TACGACCGGT TCCCGACGGC TTCGAGCGAC
GACCTCGTCT ACGGGTCGAC CGACAACACC GACGGGTGGA CGACCGCCTT CTGGACCGGA
CTGTGCTGGC TCGCCTACGA CGTGACCGGC CAGCGGCGGT TCAGAGACGC CGCCGAGGCA
CAACTGGAGA CGTTCGCGCA CCGCCTCGAC GACGGCCTCG TCGAGACGCA CGATCTGGGC
TTTCTGTACA CGCTGTCGGC GGTCGCCGGC TACCGGCTCA CCGACGAGGA GCGGTATCGA
TCGATCGCGC TCCGCGGGGC CGATCTGCTC ACCGACCGCT ACTGGCAGGC TCCCGGGCTC
CTCCAGGCCT GGGGGAGCAT GGACGACGAA GACGACGAGA ACCGCGGGCG GATGATCGTC
GACACGATGA TGAACCTCCC GCTGCTGTTG TGGGCCAGCG AGGTCACGGA AGAGCCGCGG
TACCGAGCTA TCGCGGCCTC CCACGCCCGC ACGAACGCCG CCCACATCGT CCGCCCGGAC
GCCTCGACGT TTCACACGTT CCGGTGTACC GTCGATGACG GGACGCCACT GGGTGGTGAG
ACGGCCCAGG GGTACGACGA CGACTCCTGC TGGTCGCGCG GGCAGACGTG GGCGATCTAC
GGCTACGCGG TCGCCGCCGA CTACCTCGAC ACCGCTGCCT ACGCGGGGCT CTCGGCCAAG
GTCGCGAACT ACTACCTCTC GCACGTCGAG GACGACCACG TCCCGCTGTG GGACTTCGAC
GCCCCGACTG ACCCGGCGAT CCGAGACAGC TCGGCCGCCG CCGTCGCCGC CTGCGGGCTG
GACGAACTCT CCCGACAGCT GCCAAGCGGC GACGAGCGCG TCCCGGCCTA CCGCAACGCC
TCGCTGGCGA CGCTGGCCAG TCTCACCGAG CACTACACCG CGGGCGCGGA CTCGAACGGA
CTCCTGACCG ACGGTGCGTA CCACCCGTCG GACGGCGACT ACGGCGAGTG TTGCATCTGG
GGCGACTACT TCTACGTCGA GGCGCTCGTC CGGGCGACCC GACACTACGA CCGGTTCTGG
TAA
 
Protein sequence
MSRSPQALAA AVTEPALPDR YFERPERSQE QLERALTDAI ERIGENLDRY YDRFPTASSD 
DLVYGSTDNT DGWTTAFWTG LCWLAYDVTG QRRFRDAAEA QLETFAHRLD DGLVETHDLG
FLYTLSAVAG YRLTDEERYR SIALRGADLL TDRYWQAPGL LQAWGSMDDE DDENRGRMIV
DTMMNLPLLL WASEVTEEPR YRAIAASHAR TNAAHIVRPD ASTFHTFRCT VDDGTPLGGE
TAQGYDDDSC WSRGQTWAIY GYAVAADYLD TAAYAGLSAK VANYYLSHVE DDHVPLWDFD
APTDPAIRDS SAAAVAACGL DELSRQLPSG DERVPAYRNA SLATLASLTE HYTAGADSNG
LLTDGAYHPS DGDYGECCIW GDYFYVEALV RATRHYDRFW