Gene Hmuk_0051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0051 
Symbol 
ID8409548 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp48422 
End bp50194 
Gene Length1773 bp 
Protein Length590 aa 
Translation table11 
GC content66% 
IMG OID645018389 
Productalpha amylase catalytic region 
Protein accessionYP_003175909 
Protein GI257386136 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.237414 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGAGA CAGTCGAGCG GACGCGATCC GACCGCCAGT GGTGGGAAGA GGCCGTCGTC 
TACCAGATCT ACCCGAAGAG TTTCTTCGAC AGCGACGGCG ACGGGATCGG TGACCTCGCG
GGCATCACCG AGAAGGTCGA CTACCTCGAC GCGCTCGGGG TCGACGCGGT CTGGCTCTGT
CCCGTCTACG ACTCGCCACA GGCCGACAAC GGCTACGACA TCCGCGACTA CCGGTCGATC
TTCGACACCT ACGGCGACAT GGCCGACTGG GAGCGCCTGC TGGCCGAACT CCACGACCGA
AATATTCGGC TGGTCATGGA CCTCGTCGTC AACCATACAT CGGCCGAACA CGAGTGGTTC
CAGCACTCGC GCCGTGGCGA GAGCGAGTAC GCCGACTACT ACCACTGGCG CGACGGCCGC
CCGGCTGAGA CGGCCGACTA CGACACCGAC GACGGTCCCG CCGACGAGGT CGCCCCCAAC
AACTGGGACT CGATCTTCGG CGGCCCGGCG TGGGCCTACG ACGAGGAGCG CGGACAGTGG
TACCTCCACC TCTTCGACGA GAGCCAGCCG GACCTCAACT GGGAGAACGA GGCCGTCCGC
GAGGACGTGT ACGAGATGAT GCGCTGGTGG CTCGACCGCG GGATCGACGG CTTCCGCATG
GACGTAATCA ACCTCGTCTC GAAGACGCCG GGGCTCCCCG ACGGCGACCC GGAGAGTGGG
CTCGTCGGTG CAGAGCACTT CATGAACGGG CCCAAAGTCC ACGATTACCT CTCGGAGCTG
GTCGCCGAAG CCATCCCGGA CGACGAGATC CTCACCGTCG GCGAGACGCC GGGAGCGGGC
GTCGAGGACG CCCAGCGGTT CGTCGGCGAG GACGGCCTCT CGATGGTGTT CCAGTTCGAA
CACGTCAACG TCGGCCACGA GGGCGACAAG TGGTCGCTGG AGGACTACTC GCTGGTCGAA
CTCAAAGAGA GTCTCGCACG CTGGCAGAAC GGTCTGGCCG ACGAGGGGTG GAACAGCCTC
TACCTGAGCA ACCACGACCA GCCACGGACC GTCTCCCGGT TCGGCGACGA CGGGCGGTAC
CGTCGCCGCT CCGCGAAGCT GTTCGGGACG CTGCTGTACA CCCTCCAGGG AACGCCGTTC
GTCTACCAGG GCCAGGAGAT CGGCATGACC AACGCCCCCT TCGAGTCGAA GTCAGAGCTG
CGGGACGTGG AGTCGATCAA CTTCGTCGAG GAAGCCATCG AGAGCGGCGC GGCCGAGAGC
TACGAGGACG TACGCGACGC CGTCGAGACC GTCGGCCGCG ACAACGCCCG GACGCCGATG
CAGTGGTCCG ACGACGAGAA CGCCGGCTTC ACCGACGGCG AGCCCTGGCT CAAGGTCAAC
CCCAACTACG ACGCGATCAA CGTCGAGCGG GCACGCGCCG ACCCCGATTC GGTGTGGCAC
TACTACCGCG AACTCGGCGA TCTTCGGGGG AATCGGGACC TGCTGGTCTA CGGCGACTTC
GAGCTGCTCG CGCCCGGGGA CGAGCAGGTG TTCGCCTACA CCCGGACGCT GGGCGACGAA
CGGGCCCTCG TTGTCCTCAA CGTCTCCGCC GAGCCCGCGA CGTTCGCGGT GCCCGACGAC
GCTGGCGACG ACCTCGAACT CGCCATTGGG AACCTGAACG CCCTCGAAAC GCCCGGCAGC
ACCGTCGAAC TCGATCCCTA CGAGGCGCGC GTCTATCTGA CGCCCGCGAC CGACACTGAC
CGACCGACGA CCACGAACTC CGATTCGACA TGA
 
Protein sequence
MSETVERTRS DRQWWEEAVV YQIYPKSFFD SDGDGIGDLA GITEKVDYLD ALGVDAVWLC 
PVYDSPQADN GYDIRDYRSI FDTYGDMADW ERLLAELHDR NIRLVMDLVV NHTSAEHEWF
QHSRRGESEY ADYYHWRDGR PAETADYDTD DGPADEVAPN NWDSIFGGPA WAYDEERGQW
YLHLFDESQP DLNWENEAVR EDVYEMMRWW LDRGIDGFRM DVINLVSKTP GLPDGDPESG
LVGAEHFMNG PKVHDYLSEL VAEAIPDDEI LTVGETPGAG VEDAQRFVGE DGLSMVFQFE
HVNVGHEGDK WSLEDYSLVE LKESLARWQN GLADEGWNSL YLSNHDQPRT VSRFGDDGRY
RRRSAKLFGT LLYTLQGTPF VYQGQEIGMT NAPFESKSEL RDVESINFVE EAIESGAAES
YEDVRDAVET VGRDNARTPM QWSDDENAGF TDGEPWLKVN PNYDAINVER ARADPDSVWH
YYRELGDLRG NRDLLVYGDF ELLAPGDEQV FAYTRTLGDE RALVVLNVSA EPATFAVPDD
AGDDLELAIG NLNALETPGS TVELDPYEAR VYLTPATDTD RPTTTNSDST