Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_2470 |
Symbol | |
ID | 8412014 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | - |
Start bp | 2367018 |
End bp | 2369936 |
Gene Length | 2919 bp |
Protein Length | 972 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645020811 |
Product | DMSO reductase family type II enzyme, molybdopterin subunit |
Protein accession | YP_003178285 |
Protein GI | 257388512 |
COG category | [C] Energy production and conversion |
COG ID | [COG5013] Nitrate reductase alpha subunit |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence [TIGR01580] respiratory nitrate reductase, alpha subunit [TIGR03479] DMSO reductase family type II enzyme, molybdopterin subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0826406 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGAAC ACGACATCGA CGACGACCGG TGGATGGACA GTTCCGGAAT TACCCGACGC GACTTCGTCC GCGGCCTCGG GGCCGCCTCG ATCGTCGGCG CGACCGGCCT GTCGTTCGCC GACGAAGAGA TGGACGGGCT GCAGGCGGTC GACGATCCGA TCGGATCGTA CCCCTACCGC GAGTGGGAGG ACCTCTACCG CGAGGAGTGG GACTGGGACT CCGTGGCGCG GTCGACCCAC AGCGTCAACT GCACCGGCTC CTGTTCGTGG AACGTCTACG TCAAGGACGG GCAGGTCTGG CGCGAGGAAC AGGCCGGCGA CTACCCCGTG ATCGACGAGG ACCTCCCAGA TCCCAACCCG CGGGGCTGCC AGAAAGGGGC CTGCTACACG GACTACGTCA ACGCCGACCA GCGCGTGCTC CACCCGCTGC GCCGCACCGG CGAACGCGGC GAGGGCCAGT GGGAGCGCAT CTCCTGGGAC GAGGCACTGA CGGAGATCGC CGACCACGTC ATCGACGAGG TGCAGGCGGG CCGGTACGAC GCAATCTCGG GCTTTACGCC GATCCCGGCC ATGTCGCCCG TGTCGTTCGC CAGCGGCTCC CGGCTCGTGA ACCTGCTGGG CGGCGTGTCC CACAGCTTCT ACGACTGGTA CTCCGACCTG CCGCCGGGTC AGCCGATCAC CTGGGGGACC CAGACGGACA ACGCCGAGTC CGCCGACTGG CACAACGCCG ACTACATCAT CGCCTGGGGG TCGAACATCA ACGTCACCCG TATCCCCGAC GCCAAGTACT TCCTCGACGC CGGCTACGAG GGAGCGAAAC GCGTCGGGAT CTTCACCGAC TACTCCCAGA CCGCGATCCA CACCGACGAG TGGCTCGCGC CGGAGGGTGG CACCGACACC GCGCTCGCGC TCGGGATGGC CCAGACGATC GTCGACGAGG AGCTGTACGA CGAGTCCCAC CTCAAAGAGC AGACCGACAT GCCGCTGCTC GTGCGAGAAG ACACCGGGAA GTTCCTCCGG GCGGGCGAGG TCGGTCTCGC GGCGGACGCC GACGACCCGG ACAAGGTGTT CGTGATGGTC GACGAGGACG GGGAGCTCCG GCCCGCGCCC GGTTCGCTCG GCGAGCGCGA CGGCCAGCAC GACCCCGAGT CGAGCATCGA ACTCGACTTC GACCCCCAGC TGGGCGTCGA GCGGAGCGTC GAGACCGACG ACGGCGAGGT GGCGGTCCGG TCGGTCTGGG AGAACCTCCG TGAGGAACTG TCACAGTACA CGCCCGCGTT CGTCCACGAG GAGACCGGGG TCGGCGAGAA CACCTACCAG CGCGTCGCCC GAGAGTTCGC CGACGCCGAC GCGGCCAAGA TCATCCACGG CAAGGGCGTC AACGACTGGT ACCACAACGA CCTGGGGAAC CGGGCGATCC AGTTGCTCGT GACCCTGACC GGGAACCTCG GCGAACCCGG CACCGGGCTG GATCACTACG TCGGCCAGGA GAAGATCTGG ACGTTCCACG GCTGGAAGAC GCTCTCGTTC CCGACCGGCA GCGTGCGGGG CGTGCCGACG ACGCTGTGGA CCTACTTCCA CGCCGGCATC CTCGACAACA CCGATCCCGA CACCGCCGAG AAGATCCGCG AGTCCATCGA CGAGGGCTGG ATGCCGGTGT ACCCCGAGGA GCGCGAGGAC GGGTCCTGGC CGGACCCCTC GACGATGTTC GTCTGGCGTG GCAACTACTT CAACCAGGCC AAGGGCAACG TCGCCGTCGA GGAACAGCTC TGGCCGAAAC TGGATCTGGT CGTGGACATC AACTTCCGGA TGGACTCGAC GGCGATGTAC TCCGACATCG TCCTCCCGGC GGCGAGCCAC TACGAGAAGC ACGACCTGAA CATGACCGAC ATGCACACGT ACGTCCACCC GTTCACGCCC GCCGTCGAGC CGCTGGGCGA GGCCAAGTCT GACTGGGAGA TCTTCCGCCT GCTGGCCGAG AAGATCCAGG AGCGGGCCCA GGAACGGGGG GTCGAACCGG TCGAGGACCG ATCGTTCGAC CGCGAGATCG ACCTGACGAC GATCTACGAC GACTACGTGC GCGACTGGGA GACCGGCGAG GAGGGAGCCC TCGAAGACGG CCGCGAGGCC TCCGAGTTCG TCCTCGAACA CAGCGAGGAG TCCAACCCCG CAGACAGCGA CGAGCAGATC ACGTTCGCCG ACACCGTCGA GCAGCCCCAG CGTCTCGAAG CGGCGGGGGA TCACTGGACC TCCGACATCG AGGACGGAGC ACCGTACGTC CCCTGGCAGG ACTACGTCCA GGACAAACAG CCCTGGCCCA CCTTCACCGG CCGCCAGCAG TACTACGTCG ACCACGACTG GTTCCTCGAA CTCGGCGAGG AGTTGCCCAC GCACAAGGAG GGGCCACAGG ACACCGGCGG CGACTACCCC CTCTCCTACA ACACGCCCCA CAGCCGCTGG TCGATCCACT CGACGTGGCG CGAGAACACG AAGATGCTCC GGCTCCAGCG GGGCGAACCG ACGGTCTTCC TCAATCCCGA GGACGCCGAG GAGCGCGGGA TCGAGGACGG CGACACCGTC GAGGTGTACA ACGACATGGG CAGCGTCGAG GTACAGGCGA AGATCTACCC GTCGGGTGAC CCCGGAACGG TCCGGCACTT CTTCTCGTGG GAGAAGTTCC AGTATCCGGG CCGAGACAAC TTCAACACGC TCGTCCCGAT GTACATGAAG CCCACGCAGC TGGTCCAGTA CCCCGAGGAC ACCGGCGAAC ACCTCTACTT CTTCCCGAAC TACTGGGGTC CGACCGGCGT CAACAGCGAC GTGAACGTGG ACGTCCGACT GACCGACGGG CAGTCGGGCT CGGGAGACGA GCAGCGCGAG TCTGTCGGTG TGCGCCCCGC AGGAGGTGAC GACCAATGA
|
Protein sequence | MSEHDIDDDR WMDSSGITRR DFVRGLGAAS IVGATGLSFA DEEMDGLQAV DDPIGSYPYR EWEDLYREEW DWDSVARSTH SVNCTGSCSW NVYVKDGQVW REEQAGDYPV IDEDLPDPNP RGCQKGACYT DYVNADQRVL HPLRRTGERG EGQWERISWD EALTEIADHV IDEVQAGRYD AISGFTPIPA MSPVSFASGS RLVNLLGGVS HSFYDWYSDL PPGQPITWGT QTDNAESADW HNADYIIAWG SNINVTRIPD AKYFLDAGYE GAKRVGIFTD YSQTAIHTDE WLAPEGGTDT ALALGMAQTI VDEELYDESH LKEQTDMPLL VREDTGKFLR AGEVGLAADA DDPDKVFVMV DEDGELRPAP GSLGERDGQH DPESSIELDF DPQLGVERSV ETDDGEVAVR SVWENLREEL SQYTPAFVHE ETGVGENTYQ RVAREFADAD AAKIIHGKGV NDWYHNDLGN RAIQLLVTLT GNLGEPGTGL DHYVGQEKIW TFHGWKTLSF PTGSVRGVPT TLWTYFHAGI LDNTDPDTAE KIRESIDEGW MPVYPEERED GSWPDPSTMF VWRGNYFNQA KGNVAVEEQL WPKLDLVVDI NFRMDSTAMY SDIVLPAASH YEKHDLNMTD MHTYVHPFTP AVEPLGEAKS DWEIFRLLAE KIQERAQERG VEPVEDRSFD REIDLTTIYD DYVRDWETGE EGALEDGREA SEFVLEHSEE SNPADSDEQI TFADTVEQPQ RLEAAGDHWT SDIEDGAPYV PWQDYVQDKQ PWPTFTGRQQ YYVDHDWFLE LGEELPTHKE GPQDTGGDYP LSYNTPHSRW SIHSTWRENT KMLRLQRGEP TVFLNPEDAE ERGIEDGDTV EVYNDMGSVE VQAKIYPSGD PGTVRHFFSW EKFQYPGRDN FNTLVPMYMK PTQLVQYPED TGEHLYFFPN YWGPTGVNSD VNVDVRLTDG QSGSGDEQRE SVGVRPAGGD DQ
|
| |