Gene Hmuk_0052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0052 
Symbol 
ID8409549 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp50191 
End bp51924 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content65% 
IMG OID645018390 
Productalpha amylase catalytic region 
Protein accessionYP_003175910 
Protein GI257386137 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.316768 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACGA ACACACTCAC GACCCTAGAG AACGTCGACC GCGAATGGTG GAAAGAGGCC 
GTCGTCTACC AGATCTATCC ACGAAGCTTC AGCGACACCG ACGGGGACGG CGTGGGCGAC
TTACAGGGTA TCGTCGACAA GGTCGACTAC ATCGACTCGC TGGGCGTCGA TCTCGTCTGG
CTCTGCCCGG TCTATGGCTC GCCCAACGAG GACAACGGCT ACGACATCAG CGACTACCAC
GCGATCATGG ACGAGTTCGG CGACATGGCC GACTGGGAGC GCCTGCTCGA CGAGCTCCAC
GACCGAGATA TTCGGCTGAT CATGGACCTC GTCCTGAACC ATACCTCGAG CGAACACGAG
TGGTTCCAGC GCTCGCGCCG TGGCGAGGGC GAGTACGCCG ATTACTACTA CTGGCGCGAC
GGCCGCCCAG CCACCGACGC CGACTACGAC ACCGACGACG GCCCCGCCGA CGAGGTCGCC
CCCAACAACT GGGACTCGAT CTTCGGCGGG CCCGCCTGGA GCTACGTCGA GGAGCGCGAG
CAGTGGTACC TCCACCTCTT CGACGAGAAC CAGCCGGACC TCAACTGGCG AAACCCGACC
GTCCGTCAGG CGATGAAAGA CGTGGTCTCG TGGTGGCTCG ACCGCGGGAT CGACGGCTTC
CGCATGGACG CCGTGTCCCA CATGTCCAAG ACCGACGGCC TCCCGGACGG CGATCCCGAG
GGCACGCCCA CCGGGATCGA ACACTTCAGT CACGGCCCGC GGCTGGAGGA GTACCTGACC
GAGTTGTCGG CGGACGTGCT CTCGGACCAC GACATCACGA CCGTCGCGGA GATGGGCCAC
ACCTTGATCG AACAGGCCGC CGACTACCTC GACGACGAGG CCATCGGGCT GGACATGATA
TTCCAGTTCA GCCACATGGG CGTCGACGGC GGCTGGGACC CCGAGACCGT CGGCGAGTGG
GACCTCCCCG AGTTCAAACG GATCATGTCC GAGCGCCAGG ACGCGCTCGC GGACCGTGGC
TGGGACGCCC TCTTCCTGGG CAACCACGAC CAGCCCCGGA TCGTCTCTCG CTTTGGGGAC
GAAGCCTACC ACCGCGAGTC GGCCTCGCTG ATCGGGACCT TCCTGTTGAC GATGCGAGGG
ACCCCCTACG TCTACCAGGG CGAGGAGATC GGCATGACTA ACGCGGACTT CGAGAGCCTC
GACGAGATCG ACGACCCACA GACGGTCGGT CGCGTCGAAC AGTACATCGC CGACGGCGTC
GCGGACTCCT ACGAGGAGCT TCGCGAGGTC GTCAACGCAC GGAGTCGCGA TCACGCCCGG
ACGCCGATGC AGTGGTCCGA CGAGCCAGGT GCCGGCTTCA CCGACGGCGA GCCCTGGCTC
AAGTGCAACG CCGACTACAC CGAGATCAAC GTCGCCGCCC AGCGAGACGA TCCGGCGTCG
GTACTGAACC AGTACCGACG ACTGATCGAC CTCCGCCAGT CGACGGACGT GCTTGTCTAC
GGCGACTACG AACTGCTCGC GCCCGAGGAC GACCAGCTGT ATGCCTACAC GCGCACCCTC
GACGACGAAC GGGCACTCGT GGTGCTGAAC TGGTCGAGCG AGCCGGCGAC GTTCGAGGGC
AGCGAGATCG AGGTCGACGA TCCGACAGTA CTGTACGGGA ACTACGACGA CGCGCCGACC
GAGCCCACCA ATGTGTCGCT TCGGCCGTGG GAAGCAGTCG TCTACCGACT GTAG
 
Protein sequence
MSTNTLTTLE NVDREWWKEA VVYQIYPRSF SDTDGDGVGD LQGIVDKVDY IDSLGVDLVW 
LCPVYGSPNE DNGYDISDYH AIMDEFGDMA DWERLLDELH DRDIRLIMDL VLNHTSSEHE
WFQRSRRGEG EYADYYYWRD GRPATDADYD TDDGPADEVA PNNWDSIFGG PAWSYVEERE
QWYLHLFDEN QPDLNWRNPT VRQAMKDVVS WWLDRGIDGF RMDAVSHMSK TDGLPDGDPE
GTPTGIEHFS HGPRLEEYLT ELSADVLSDH DITTVAEMGH TLIEQAADYL DDEAIGLDMI
FQFSHMGVDG GWDPETVGEW DLPEFKRIMS ERQDALADRG WDALFLGNHD QPRIVSRFGD
EAYHRESASL IGTFLLTMRG TPYVYQGEEI GMTNADFESL DEIDDPQTVG RVEQYIADGV
ADSYEELREV VNARSRDHAR TPMQWSDEPG AGFTDGEPWL KCNADYTEIN VAAQRDDPAS
VLNQYRRLID LRQSTDVLVY GDYELLAPED DQLYAYTRTL DDERALVVLN WSSEPATFEG
SEIEVDDPTV LYGNYDDAPT EPTNVSLRPW EAVVYRL