Gene Hmuk_0210 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0210 
Symbol 
ID8409708 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp207969 
End bp210194 
Gene Length2226 bp 
Protein Length741 aa 
Translation table11 
GC content60% 
IMG OID645018535 
Productalpha amylase catalytic region 
Protein accessionYP_003176054 
Protein GI257386281 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0538024 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACGC TTGGGACCGC TGCAGCGGCC TTCGGAATGA GCCGGAGTGT CGAGGCAGAG 
ATCAACGGAT ATCATACACT GGCTACCGAA GACAACCGTC TGGGTCCGGT CACACCACGT
GATACGGTGT ATCAGATCCT GACAGATCGA TTCTACGACG GCGACACGAG CAACAACGAC
TTCGAGGATT TCGATTCCTC ACTCTACGAC GGCAGTGGAG AGGACCTCAA GCTGTACCAG
GGCGGTGACT GGCAGGGGAT CATCGACAAG ATACCGTATC TCGAGGAGAT GGGTGTGACA
GCGGTCTGGA TTTCCGCACC CTACGACAAC CGAGAGAGCA AGATCGAAGA CTACCAGGAC
GACGGTACGG TCGACGTCTG GACCAGTTAT CACGGCTACC ACGCGCGCAA CTACTTCCGG
ACCCACCGGT TTTTCGGCGG GATGCAGGAC TTCTACGCGA TGCGCGACGC GTTGCACAAC
AACGGCATCA AACTCGTCAT CGACTTCGTC TCGAACCACA CGAGCCGCTG GCGGAATCCG
ACGAAGAACA ACGAGCCCGA GGAAGGAGAA CTCTACGAAC CGGACACGGA TGCAGACGGC
AATTACGTCT TCGACGAGAA CGGGGACGCC GTCGGTGAGA ACTTGCTGGC AGATCCACAC
GACGATGTGA ACGGCTGGTT CCACGGACTG GGTGACCGTG ACGGGGACTC GTCGAAGTTC
GGCTATCGCC ACAAGGAACT GGGCTCGCTG GCCGACTTCG CACACGAAAA CGGGGGGGTC
GTCGACCACC TCGAACGAGC GACGCAGTTC TGGAAGTCGA AAGGGATCGA CGGAATTCGT
CACGACGCAA CGTTGCACAT GAACCCGGCA TTCGCAAAGA ACCTCAAGAC CGCGACCGAC
AGTTCGCAAG GCGGTCCGAT CACGCACTTC GGGGAGTTCT TCATCAGCCG ACCCGACCCG
AAATACGAGG AGTACCGGAC GTTTCCCGAC AGAACCGGCA TCAACAACCT CGACTTCGAG
TTCAACCGGG CGGCGACGAA CGCGTTCGGC GACTTCTCCG AGACGATGTC CGACTTCGGA
GACATGCTGA TCAAGACCCA CGGCGACTAC ACGCACGAAC ACCAGACGAT CACTGCCGTC
GACAATCACG ACCTCACTCG GTTCCGGTAC ATCCAGCCAA ACGACAAGCC GTACCACGCG
GCGATCGCCG CGTTGATGAC CTGCCGCGGA ACGCCGAAGA TCTACTACGG GACCGAGCAG
TACATGAATC CCGGATCGAG CGGTGCGAAC GCAGGTCGGC TGTTCATGCA AACCGACAGC
GATTTCGACA CGTCGACCAC GGCCTATCAG TTGATCAGCG ACCTCGCACA GCTCCGGCGT
GACAACCTCG CGCTCGCGTA CGGTCAGACG AGTATTCTCC ACTCGACAGA CGACGTGATC
GTCTACGAGC GGACCTTCTA CGATCACGTG GTCGTGACGG CGATCAATCG ACAGCCCGAC
CAGTCCGAGA GCGTCCCGTC GGTCGGTACG AGCCTGCCGG ACGGTAGTTA CGGCGACTAC
CTCTCGGGGG ACCTCTACGG ACAGGGCATC TCGGTCGACA GCGGTGCACT GGACTCCTTC
ACGTTGGGCG GTGGAGAGGT CAGCGTCTGG ACAGCGAGCC CCGACCTGGG GAATGCGCCG
AAGATCGGAA CGACGGTCAG CACGATGGGA CAGGCCGGCG ACTCCGTCCA CATCTACGGA
TCAGGTCTCG ACGGCGACGT GTCAGTCACG TTCGGCGGTA CGCCCGCGAA CGTCGTGTCC
AACAGCGCGA CCCGTATGAC GGCGACAGTC CCCGATGGGC CCGCCGGACT CGTCGACGTG
GTCGTCGAAA AGAACGGCCA GACGAGCAAC GCCGCGAAAT ACGACATCCT CACGGACGAG
CCGGTACAGG TGATCTTCCA CGTCGAGGCC GAGACCGAAC CCGGCGAGAC GATCCACGTC
GTCGGTGATC CACCGGAACT GGGTAGCTGG GATGCCCTGT CGGGCAGCGA ATCCTTCATG
AACCCGGACT ATCCGGAGTG GTTCCTCCCA GTGAGCGTGC CAAAGGACAC GACCTTCGAG
TTCAAATTCG TCAAGATCGA CGAGAGCGAC AACGTGACCT GGGAAAGCGG GAGCAACCGG
ACCTTCACGT CACCCACGGA TTCGACCGAG ACCGAAGATA CGCCGCTCTA CTACTGGCAA
AGCTAG
 
Protein sequence
MKTLGTAAAA FGMSRSVEAE INGYHTLATE DNRLGPVTPR DTVYQILTDR FYDGDTSNND 
FEDFDSSLYD GSGEDLKLYQ GGDWQGIIDK IPYLEEMGVT AVWISAPYDN RESKIEDYQD
DGTVDVWTSY HGYHARNYFR THRFFGGMQD FYAMRDALHN NGIKLVIDFV SNHTSRWRNP
TKNNEPEEGE LYEPDTDADG NYVFDENGDA VGENLLADPH DDVNGWFHGL GDRDGDSSKF
GYRHKELGSL ADFAHENGGV VDHLERATQF WKSKGIDGIR HDATLHMNPA FAKNLKTATD
SSQGGPITHF GEFFISRPDP KYEEYRTFPD RTGINNLDFE FNRAATNAFG DFSETMSDFG
DMLIKTHGDY THEHQTITAV DNHDLTRFRY IQPNDKPYHA AIAALMTCRG TPKIYYGTEQ
YMNPGSSGAN AGRLFMQTDS DFDTSTTAYQ LISDLAQLRR DNLALAYGQT SILHSTDDVI
VYERTFYDHV VVTAINRQPD QSESVPSVGT SLPDGSYGDY LSGDLYGQGI SVDSGALDSF
TLGGGEVSVW TASPDLGNAP KIGTTVSTMG QAGDSVHIYG SGLDGDVSVT FGGTPANVVS
NSATRMTATV PDGPAGLVDV VVEKNGQTSN AAKYDILTDE PVQVIFHVEA ETEPGETIHV
VGDPPELGSW DALSGSESFM NPDYPEWFLP VSVPKDTTFE FKFVKIDESD NVTWESGSNR
TFTSPTDSTE TEDTPLYYWQ S