Gene Hmuk_2470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2470 
Symbol 
ID8412014 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp2367018 
End bp2369936 
Gene Length2919 bp 
Protein Length972 aa 
Translation table11 
GC content67% 
IMG OID645020811 
ProductDMSO reductase family type II enzyme, molybdopterin subunit 
Protein accessionYP_003178285 
Protein GI257388512 
COG category[C] Energy production and conversion 
COG ID[COG5013] Nitrate reductase alpha subunit 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR01580] respiratory nitrate reductase, alpha subunit
[TIGR03479] DMSO reductase family type II enzyme, molybdopterin subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0826406 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAAC ACGACATCGA CGACGACCGG TGGATGGACA GTTCCGGAAT TACCCGACGC 
GACTTCGTCC GCGGCCTCGG GGCCGCCTCG ATCGTCGGCG CGACCGGCCT GTCGTTCGCC
GACGAAGAGA TGGACGGGCT GCAGGCGGTC GACGATCCGA TCGGATCGTA CCCCTACCGC
GAGTGGGAGG ACCTCTACCG CGAGGAGTGG GACTGGGACT CCGTGGCGCG GTCGACCCAC
AGCGTCAACT GCACCGGCTC CTGTTCGTGG AACGTCTACG TCAAGGACGG GCAGGTCTGG
CGCGAGGAAC AGGCCGGCGA CTACCCCGTG ATCGACGAGG ACCTCCCAGA TCCCAACCCG
CGGGGCTGCC AGAAAGGGGC CTGCTACACG GACTACGTCA ACGCCGACCA GCGCGTGCTC
CACCCGCTGC GCCGCACCGG CGAACGCGGC GAGGGCCAGT GGGAGCGCAT CTCCTGGGAC
GAGGCACTGA CGGAGATCGC CGACCACGTC ATCGACGAGG TGCAGGCGGG CCGGTACGAC
GCAATCTCGG GCTTTACGCC GATCCCGGCC ATGTCGCCCG TGTCGTTCGC CAGCGGCTCC
CGGCTCGTGA ACCTGCTGGG CGGCGTGTCC CACAGCTTCT ACGACTGGTA CTCCGACCTG
CCGCCGGGTC AGCCGATCAC CTGGGGGACC CAGACGGACA ACGCCGAGTC CGCCGACTGG
CACAACGCCG ACTACATCAT CGCCTGGGGG TCGAACATCA ACGTCACCCG TATCCCCGAC
GCCAAGTACT TCCTCGACGC CGGCTACGAG GGAGCGAAAC GCGTCGGGAT CTTCACCGAC
TACTCCCAGA CCGCGATCCA CACCGACGAG TGGCTCGCGC CGGAGGGTGG CACCGACACC
GCGCTCGCGC TCGGGATGGC CCAGACGATC GTCGACGAGG AGCTGTACGA CGAGTCCCAC
CTCAAAGAGC AGACCGACAT GCCGCTGCTC GTGCGAGAAG ACACCGGGAA GTTCCTCCGG
GCGGGCGAGG TCGGTCTCGC GGCGGACGCC GACGACCCGG ACAAGGTGTT CGTGATGGTC
GACGAGGACG GGGAGCTCCG GCCCGCGCCC GGTTCGCTCG GCGAGCGCGA CGGCCAGCAC
GACCCCGAGT CGAGCATCGA ACTCGACTTC GACCCCCAGC TGGGCGTCGA GCGGAGCGTC
GAGACCGACG ACGGCGAGGT GGCGGTCCGG TCGGTCTGGG AGAACCTCCG TGAGGAACTG
TCACAGTACA CGCCCGCGTT CGTCCACGAG GAGACCGGGG TCGGCGAGAA CACCTACCAG
CGCGTCGCCC GAGAGTTCGC CGACGCCGAC GCGGCCAAGA TCATCCACGG CAAGGGCGTC
AACGACTGGT ACCACAACGA CCTGGGGAAC CGGGCGATCC AGTTGCTCGT GACCCTGACC
GGGAACCTCG GCGAACCCGG CACCGGGCTG GATCACTACG TCGGCCAGGA GAAGATCTGG
ACGTTCCACG GCTGGAAGAC GCTCTCGTTC CCGACCGGCA GCGTGCGGGG CGTGCCGACG
ACGCTGTGGA CCTACTTCCA CGCCGGCATC CTCGACAACA CCGATCCCGA CACCGCCGAG
AAGATCCGCG AGTCCATCGA CGAGGGCTGG ATGCCGGTGT ACCCCGAGGA GCGCGAGGAC
GGGTCCTGGC CGGACCCCTC GACGATGTTC GTCTGGCGTG GCAACTACTT CAACCAGGCC
AAGGGCAACG TCGCCGTCGA GGAACAGCTC TGGCCGAAAC TGGATCTGGT CGTGGACATC
AACTTCCGGA TGGACTCGAC GGCGATGTAC TCCGACATCG TCCTCCCGGC GGCGAGCCAC
TACGAGAAGC ACGACCTGAA CATGACCGAC ATGCACACGT ACGTCCACCC GTTCACGCCC
GCCGTCGAGC CGCTGGGCGA GGCCAAGTCT GACTGGGAGA TCTTCCGCCT GCTGGCCGAG
AAGATCCAGG AGCGGGCCCA GGAACGGGGG GTCGAACCGG TCGAGGACCG ATCGTTCGAC
CGCGAGATCG ACCTGACGAC GATCTACGAC GACTACGTGC GCGACTGGGA GACCGGCGAG
GAGGGAGCCC TCGAAGACGG CCGCGAGGCC TCCGAGTTCG TCCTCGAACA CAGCGAGGAG
TCCAACCCCG CAGACAGCGA CGAGCAGATC ACGTTCGCCG ACACCGTCGA GCAGCCCCAG
CGTCTCGAAG CGGCGGGGGA TCACTGGACC TCCGACATCG AGGACGGAGC ACCGTACGTC
CCCTGGCAGG ACTACGTCCA GGACAAACAG CCCTGGCCCA CCTTCACCGG CCGCCAGCAG
TACTACGTCG ACCACGACTG GTTCCTCGAA CTCGGCGAGG AGTTGCCCAC GCACAAGGAG
GGGCCACAGG ACACCGGCGG CGACTACCCC CTCTCCTACA ACACGCCCCA CAGCCGCTGG
TCGATCCACT CGACGTGGCG CGAGAACACG AAGATGCTCC GGCTCCAGCG GGGCGAACCG
ACGGTCTTCC TCAATCCCGA GGACGCCGAG GAGCGCGGGA TCGAGGACGG CGACACCGTC
GAGGTGTACA ACGACATGGG CAGCGTCGAG GTACAGGCGA AGATCTACCC GTCGGGTGAC
CCCGGAACGG TCCGGCACTT CTTCTCGTGG GAGAAGTTCC AGTATCCGGG CCGAGACAAC
TTCAACACGC TCGTCCCGAT GTACATGAAG CCCACGCAGC TGGTCCAGTA CCCCGAGGAC
ACCGGCGAAC ACCTCTACTT CTTCCCGAAC TACTGGGGTC CGACCGGCGT CAACAGCGAC
GTGAACGTGG ACGTCCGACT GACCGACGGG CAGTCGGGCT CGGGAGACGA GCAGCGCGAG
TCTGTCGGTG TGCGCCCCGC AGGAGGTGAC GACCAATGA
 
Protein sequence
MSEHDIDDDR WMDSSGITRR DFVRGLGAAS IVGATGLSFA DEEMDGLQAV DDPIGSYPYR 
EWEDLYREEW DWDSVARSTH SVNCTGSCSW NVYVKDGQVW REEQAGDYPV IDEDLPDPNP
RGCQKGACYT DYVNADQRVL HPLRRTGERG EGQWERISWD EALTEIADHV IDEVQAGRYD
AISGFTPIPA MSPVSFASGS RLVNLLGGVS HSFYDWYSDL PPGQPITWGT QTDNAESADW
HNADYIIAWG SNINVTRIPD AKYFLDAGYE GAKRVGIFTD YSQTAIHTDE WLAPEGGTDT
ALALGMAQTI VDEELYDESH LKEQTDMPLL VREDTGKFLR AGEVGLAADA DDPDKVFVMV
DEDGELRPAP GSLGERDGQH DPESSIELDF DPQLGVERSV ETDDGEVAVR SVWENLREEL
SQYTPAFVHE ETGVGENTYQ RVAREFADAD AAKIIHGKGV NDWYHNDLGN RAIQLLVTLT
GNLGEPGTGL DHYVGQEKIW TFHGWKTLSF PTGSVRGVPT TLWTYFHAGI LDNTDPDTAE
KIRESIDEGW MPVYPEERED GSWPDPSTMF VWRGNYFNQA KGNVAVEEQL WPKLDLVVDI
NFRMDSTAMY SDIVLPAASH YEKHDLNMTD MHTYVHPFTP AVEPLGEAKS DWEIFRLLAE
KIQERAQERG VEPVEDRSFD REIDLTTIYD DYVRDWETGE EGALEDGREA SEFVLEHSEE
SNPADSDEQI TFADTVEQPQ RLEAAGDHWT SDIEDGAPYV PWQDYVQDKQ PWPTFTGRQQ
YYVDHDWFLE LGEELPTHKE GPQDTGGDYP LSYNTPHSRW SIHSTWRENT KMLRLQRGEP
TVFLNPEDAE ERGIEDGDTV EVYNDMGSVE VQAKIYPSGD PGTVRHFFSW EKFQYPGRDN
FNTLVPMYMK PTQLVQYPED TGEHLYFFPN YWGPTGVNSD VNVDVRLTDG QSGSGDEQRE
SVGVRPAGGD DQ