Gene Hmuk_0843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0843 
Symbol 
ID8410358 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp814031 
End bp815710 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content69% 
IMG OID645019179 
Productglycoside hydrolase family 9 
Protein accessionYP_003176681 
Protein GI257386908 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGATTC TGGTCAACCA ACTCGGTTAC GAGACCGATG GCCCGAAACG AGCGGTCTGT 
CGCGCGACAG AGCGCCACGA CCTCGATGGC TTCGTGTTGC ACGACGGGGA CAGCGTCGTC
TTCGAGGGGA CGCCGGAGTT CGTCGGCGGC GTCGCCGACT GGGGAGAGTG GGTCTTCTGG
ACGCTGGAGT TCTCCTCGCT CACCGAGCCC GGCGAGTACA CGCTCCGGGC CGGCGAGGCC
CACTCGCGGC GCTTCGAGAT CGGCGAGGAC CTCCACAAGG AGACGCTCCT CTCCGATCTG
CTGTACTACT GCAAGACCCA GCGTGCCAGC GGCGAGTACG ACCGGGCCGA CCGATCGGTG
CCGTTCGTGG GCGACCGCGA GGGGACGGTC GACGTACGCG GCGGCTGGTA CGACGCCTCG
GGCGACATGA GCAAGTATCT GAGCCACCTC AGCTACGCCA ACTACTTCAA CCCACAGCAG
ATTCCGATCG TCGCCTGGGG ACTGGCCGAC GCCCGTGACA GACTGCACGA CGGCGACCAC
AGGCTGGGCG GGGAACTCGA CGCGCGACTG CGCGAGGAGA TCCACCACGG CGCGGACTTC
CTCGTCCGGA TGCAGGACGA CGCGGGCTAC TTCTACATGA CCGTCTTCGA CCAGTGGTCC
AAGGACGTGG ACCGCCGAGA GATCTGTGCC TACGAGACCG AAGAGGGACA CAAGACGATC
GACTACGAGG CCGGATACCG CCAGGGCGGA GGGGTCGCCA TCGCCGCGCT GGCTCGCGCC
AGCCAGGTCG AGGGACCGGG CGCGTTCGAC CGTGAGACCT ACCGCGAGGC CGCGGTCGAG
GGGTTCGATC ACCTGCAGGC CCACAACACG GAGTACCTCG ACGACGGGAC CGAGAACGTC
ATCGACGACT ACTGCGCGTT GCTCGCCGCC ACGGAACTGG CCGCCGCCAC GGACGACGAG
CGGTTCCGGA CGGCCGCCAG AGAGCGCGCT CAGTCGCTGC TGGACCGCCA GACCGGCGAC
GACCGCTACG ACGGGTGGTG GCGTGCCGAC GACGACGATC GCCCCTTCTA TCACGCGTCC
GACGAGGGAC TGCCGATCGT CGCGCTGTTG CGCTACCGGG CGGTCGACGC CGACGGGCCG
CTGGACGACG CCATCGTGAA CGCGATCGAG CGCTTCTGGG GCTTCCAGAC GACGGTCGGC
GACGAGGTCA CCAACCCCTT CGACTACCCG CGCCAGTACG CCAGACCGGT CGACGAGGAC
GAACCGCGCG CGTCGTTCTT CATGCCCCAC GAGAACGAGA CCGGCTACTG GTGGCAGGGC
GAGAACGCCC GGATCGCCTC GCTTGCGACC GCGGCCGCGC GGTCGCGGGC ACAGCTCGAC
GCGGAACTGG GCGAGCGCCT CGATCGGTTC GCCCAGGCCC AACTCGACTG GATCCTCGGG
TCGAACCCCT TCGGCGTCTG CATGGTCCAC GGCGTCGGCG CGCCGGAGCC GACCTACCAC
CGCCAGTTCC GCAACGTCCC CGGCGGCGTC CAGAACGGTA TCACCGCCGG CTTCGAGAAC
GAGGCCGACA TCGCCTACTG TCCCGAGCCC TGGGGCGACG ACCACGCGCA CCGCTGGCGC
TGGGCCGAGC AGTGGATTCC CCACTCGGCC TGGCTGTTCC TGGCGGTCAG CTCGCTGTAG
 
Protein sequence
MEILVNQLGY ETDGPKRAVC RATERHDLDG FVLHDGDSVV FEGTPEFVGG VADWGEWVFW 
TLEFSSLTEP GEYTLRAGEA HSRRFEIGED LHKETLLSDL LYYCKTQRAS GEYDRADRSV
PFVGDREGTV DVRGGWYDAS GDMSKYLSHL SYANYFNPQQ IPIVAWGLAD ARDRLHDGDH
RLGGELDARL REEIHHGADF LVRMQDDAGY FYMTVFDQWS KDVDRREICA YETEEGHKTI
DYEAGYRQGG GVAIAALARA SQVEGPGAFD RETYREAAVE GFDHLQAHNT EYLDDGTENV
IDDYCALLAA TELAAATDDE RFRTAARERA QSLLDRQTGD DRYDGWWRAD DDDRPFYHAS
DEGLPIVALL RYRAVDADGP LDDAIVNAIE RFWGFQTTVG DEVTNPFDYP RQYARPVDED
EPRASFFMPH ENETGYWWQG ENARIASLAT AAARSRAQLD AELGERLDRF AQAQLDWILG
SNPFGVCMVH GVGAPEPTYH RQFRNVPGGV QNGITAGFEN EADIAYCPEP WGDDHAHRWR
WAEQWIPHSA WLFLAVSSL