Gene Hmuk_1560 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_1560 
Symbol 
ID8411081 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1486825 
End bp1489578 
Gene Length2754 bp 
Protein Length917 aa 
Translation table11 
GC content66% 
IMG OID645019886 
Productmolybdopterin oxidoreductase 
Protein accessionYP_003177382 
Protein GI257387609 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.444202 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTACGCA ACGACGACGA CAACCTCGAA CTGACGCGTC GCGACGCGAT GAAAGCCGGC 
GGGGCGGCGG CGGTGGCGCT TGGACTCGGC GGAACGGCCT CCCACCTCAC GTCACTCGGG
GAGGCCCAGT CGACCTCACT CGAATACGAG GAGGTGCCGA CGGCCTGCTG GATCGGGAAG
ATGGACTGTT CGGCGACGGC CAAGAAGGTC GGTAACCGGG TCGTCAAGTA CGAGGGCAAC
CCGGAGGACC CGCGGACCGA GGGGTCGCTC TGTCCGAAGG GGCAGGCCCA GATAGAGCAG
GTGTACAACC CCTACCGTGT CAAGGCGCCG CTAAAGCGCA CGAACGAGAA GGGCCAGCAC
GGCGAGTGGC AGGAGATCAG CTGGGACCAG GCGATGGACG AGATCGGCGA GGACCTGAAG
GAGAAACTCC AGGACGACCC ACGGCGGGTC GTCTTCCAGG TCGGCCGGAA GAAGTCCCCC
CAGTGGCAGG AAGACGCCTG GGTGACGGGC ACGAGCAACA AGTACGGCAG CATGGAGAAG
TACGGCCACG GCGCGGCCTG TTCGGACTCC GGCTACCGCG GCCAGGAGTT GATATTCGCG
ACCCACGGGG TCTCGGAGAC CGACTTCGAG AACTGCGAGT TCTTCATCGG CTGGGGCCAC
AACATGACCC AGGGCGGCGG TGCGCACCTC TGTCAGATCA CGTGGCCCAA ACAGATCGCC
GACGCCCGCG ACGACCAGGG AATGGAGACG GTCGCGATCG ACCCCCAGCG TCGTAACTCC
GGGCCCTACA CCGACCAGTG GCTCCCGATC GAGCCGGGGA CGGACATGGC GTTCTGGCTG
GCGTTCAACA GCGTCCTCGT CCGCGAGGGG TACATCGACG AGGAGTACCT CACGACCGCC
ACGAACGCGC CGTGTCTCGT CGCGACCGAG GGCGACGAGG ACGGCCACAT CCTCCGGACC
GACGACGCCC AGGACCCCGG CGAGGAGTAC ACCTGGGCCG ACGGCGAACT GGTCTGGGAC
GAGGACGCTG GCGAGGCGGT CGCTCACGAG GAGGCGTCGT CGCCGGAGAA CGTCGCGCTG
GAGGGCACCT ACGAGGTCGA CGGCGTCGAA GCGCGGCCGG CCTTCGACCT GTACCTCGAT
CAGATCTCCC AGTACGACCC CGAGTGGGCC GCCGAGATCA CTGGTATCGA CGCCGACACC
ATCGACGAAC TTGCGATGAA GTGGGGGGAG AAGGCGAAGA TCGGAGCGAC TGTCGAGGTC
GACGGCATCG AGATCCCCTA CCGGCCGGTG GGGATGCACG GCTACCACGT CGCCCAGCAG
GAGATGGGCG TCTCGACGAC GATAGCCCAC TACCACGCCG CGATGCTGGT CGGGGCGGTC
GACGTCGTCG GCTCCACGCG CGTTCGGAAG GCCAAGTACG ACGGCCCCAA GGACTACCGT
GAGCCGTTCC GCGACCAGGC GTTCCACCCC GAGAAGATCA CGAAAGAGCC CGATGGGCCG
TCGCTGGGCG GGAGCATGTT CCACCCCATC GACTCGACGG CCTTCTCCCA GAGCCACGTC
AGCCAGACGA ACCCCGACAA GTACAACCTC CCCTACGAAC CGGAGGAGAT GGCGTGGATC
GTCCAGATGG CCAACCCGGT CACGTCGGCC CCTGGGGTCG AGACCGTCAT CGAGAGCATG
AGCAGGGTCG ACACGACCAT CGTCTGCGAC CCCTGGATGA GCGAGACGGC TGACGTGGCC
GCCGACTACG TGCTACCGGC GGCGACCGCC GACAAACTCC AGGGTCCGAC GGGCGGCTGG
GACGGCTACG CCGACATCGA ACACATCCGG TTCCCCTCGA TGGATCCCCT CTGGGACACC
AAGCCAGACG CCGAGATATA CATCCAGCTC GCCAAGGCCG TCGACGCCTA CGAGGAGTAC
GTCGCGGACA TCAACGACGA ACTCGGCCTC GACGGGACCG ATTACGCGTA CGCCGGTGCC
GACGAGACGC CCGACGACCC CAGCGAGTTC CTGCGTGACG GCCTCGACCG CTGGGCACAG
ACGAAGGGCA AGAGCCTGGA GTGGTTCCGC GAGGGCAACG TCATCACCAA CGAGTGGGAC
GTCGGCGGCG GCAACCGCTA CGCCTACACC TGGGGCATGG ACAACGGCTA CGGCGAGTTC
AACCCCTACG ACGCCAAACA CGAGTTCTAC AGCGAGACGC TGTTCCGCCT CGGCGAGCGC
GTCGACGAGC TGATGGCCGA CTCGAAGTTC GACGACCCGG TCTCGGAGTT CCCCTACCTC
CAGGACTACA ACGCGTACCC GACCTGGCGG GAACCGACGA TGTACGACTC CCCCGACGAG
TACGACCTGA CGCTCTTTAG CTGGCACCAG ATCGAGCACA AGCAGACCCG AACGGCGAAC
AACAAACTGC TCAACGAGAT CGCGCCGAAG AGCGCGTTCC GGCTGAATCC GGAGGACGCG
GACCGCATCG GTGTCGAGGA CGGTGACGAG GTCGTCCTCG AGACGCACGA CGCCCAGAAC
GACGAGACGT ATCAGGTCGA GGGCGTGGTG ATGATCCAGG ACGGCGTCAA ACCCGGCACG
GTCGGCGTTC CCCACCACCA CGGGAGCTGG AAGGATCCCG AGAGCGAGGC GCTCGACGAG
GGTCCCAGCA TCAACAGAGC GATTCCGAGC GGGCCCGGAT ACCTCGGTCT CGACAACGGC
CAGGCGTTCC AGGTCCGTGC GAAGGTCGAA CCCAAAGGAG GTGACAGCGA ATGA
 
Protein sequence
MLRNDDDNLE LTRRDAMKAG GAAAVALGLG GTASHLTSLG EAQSTSLEYE EVPTACWIGK 
MDCSATAKKV GNRVVKYEGN PEDPRTEGSL CPKGQAQIEQ VYNPYRVKAP LKRTNEKGQH
GEWQEISWDQ AMDEIGEDLK EKLQDDPRRV VFQVGRKKSP QWQEDAWVTG TSNKYGSMEK
YGHGAACSDS GYRGQELIFA THGVSETDFE NCEFFIGWGH NMTQGGGAHL CQITWPKQIA
DARDDQGMET VAIDPQRRNS GPYTDQWLPI EPGTDMAFWL AFNSVLVREG YIDEEYLTTA
TNAPCLVATE GDEDGHILRT DDAQDPGEEY TWADGELVWD EDAGEAVAHE EASSPENVAL
EGTYEVDGVE ARPAFDLYLD QISQYDPEWA AEITGIDADT IDELAMKWGE KAKIGATVEV
DGIEIPYRPV GMHGYHVAQQ EMGVSTTIAH YHAAMLVGAV DVVGSTRVRK AKYDGPKDYR
EPFRDQAFHP EKITKEPDGP SLGGSMFHPI DSTAFSQSHV SQTNPDKYNL PYEPEEMAWI
VQMANPVTSA PGVETVIESM SRVDTTIVCD PWMSETADVA ADYVLPAATA DKLQGPTGGW
DGYADIEHIR FPSMDPLWDT KPDAEIYIQL AKAVDAYEEY VADINDELGL DGTDYAYAGA
DETPDDPSEF LRDGLDRWAQ TKGKSLEWFR EGNVITNEWD VGGGNRYAYT WGMDNGYGEF
NPYDAKHEFY SETLFRLGER VDELMADSKF DDPVSEFPYL QDYNAYPTWR EPTMYDSPDE
YDLTLFSWHQ IEHKQTRTAN NKLLNEIAPK SAFRLNPEDA DRIGVEDGDE VVLETHDAQN
DETYQVEGVV MIQDGVKPGT VGVPHHHGSW KDPESEALDE GPSINRAIPS GPGYLGLDNG
QAFQVRAKVE PKGGDSE