Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_0024 |
Symbol | |
ID | 8382284 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 23959 |
End bp | 26814 |
Gene Length | 2856 bp |
Protein Length | 951 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644971082 |
Product | DMSO reductase family type II enzyme, molybdopterin subunit |
Protein accession | YP_003128946 |
Protein GI | 257051113 |
COG category | [C] Energy production and conversion |
COG ID | [COG5013] Nitrate reductase alpha subunit |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence [TIGR01580] respiratory nitrate reductase, alpha subunit [TIGR03479] DMSO reductase family type II enzyme, molybdopterin subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGATC CAACCCAATC CGACGACAGT ACAGACGGCG TCTCACGTCG CGACTTCCTG CTGGGTGCCG GCGCGGCCGG CGTGGTCGGC GCGACCGGCC TGACCGTCGC CGATCGCGCC CTCGACGGGC TCGAAACCGT CGACGATCCG ATCGGCAATT ATCCCTACCG CGACTGGGAG GACTTCTACC GTGAGGAGTG GGACTGGGAT TCGGTCGCCC GGTCGACGCA CTCGGTCAAC TGCACCGGCT CCTGTTCGTG GGACGTCTAC GTCAAGAACG GCCAGGTCTG GCGCGAACAA CAGGCCAACG ACTACCCGAC CTTCGACGAG AGTCTCCCCG ATCCCAACCC CCGGGGCTGC CAGAAGGGGG CCTGTTACAA CGACTACGTC GACGCCGAAC AGCGTGTCCA GTACCCGATG CGTCGCACCG GCGAGCGCGG GGCCGGCGAG TGGGAGCGCA TCTCCTGGGA CGAGGCGCTG ACCGAGATCG CCGAGCACGT CATCGAGGAG ATCCAGAACG GCCGCTACGA CGCGATTTCC GGGTTCACGC CGATCCCCGC GATGAGTCCC GTGAGCTTCG CCTCGGGGAC GCGGCTGGTC AACCTGCTCG GCGGCGTCTC TCACTCCTTC TACGACTGGT ATTCGGACCT CCCGCCGGGC CAGCCGATCA CCTGGGGCCA TCAGACCGAC AACGCCGAGA GCGCCGACTG GCACAACGCC GAGTACATCA TCGCGTGGGG GTCAAACATC AACGTCACGC GCATCCCCGA CGCGAAGTAC TTCCTCGACG CGGGCTACGA GGGCGCAAAG CGGGTCGGCG TCTTCTCCGA TTACTCCCAG ACCGCCATCC ACACCGACGA GTGGATCGCG CCTGATCCCG GTACCGACAC CGCGCTGGCG CTTGGGATGG CCCGCACCAT CGTCGAGGAG GAGCTCTACG ACGAGGCACA CCTCAAGGAA CAGTCCGACA TGCCACTGCT CGTTCGGAAC GATACGGGCA AGTTCCTCCG GGCGAGCGAG GTCCCCGGCC TCTCGGTTGC AGCCGACGAA CCCGAGAAGG TCATGGTCAT GCAGGACGGG GAGGGCAACC TGCGTGCTGC GCCGGGGTCG CTCGGTGAAC GCGAGGCCAA GTACGACGAC TCCTTGTCGA TCGAACTCGA TTTCGACCCG CAACTGGCCG TCGAGGACAC CGTCGGGACG ACCGACGGCG AGGTGGCCGT CACGTCGGTC TGGAACAACC TCCGGGAGGA ACTGGCCAAC TACACGCCCG AGTACGTCGC CGACGAGACC GGCGTCGGGA AGGAGACTCA CCAGAAGATC GCCCGGGAGT TCGCCGACGT CGACCGCGGG AAGATCATCC ACGGCAAGGG CGTCAACGAC TGGTATCACA ACGACCTGGG CAACCGTGCG ATCCAGTTGC TCGTCACGTT GACGGGCCAC ATCGGGCGGA ACGGCACCGG CGTCGACCAC TACGTCGGCC AGGAGAAGAT CTGGACGTTC AACGGCTGGA AGACCCTCTC GTTCCCGACC GGTTCCGTCC GGGGCGTCCC GACGACGTTG TGGACCTACT ACCACGCGGG CATCCTGGAG AACACCGACG CGGAGACCCG CCGGAAGATC GAGGAGGCCG TCGAGAAGGA CTGGATGCCC GTCTACCCCG AGGAGCGCGG GGACGGCACC CGACCCGATC CCTCGACGAT GTTCGTCTGG CGGGGCAACT TCTTCAACCA GGCGAAGGGC AACGTCGCCG TCGAGGAGGT CCTCTGGGAC AAACTCGACC TCGTCGTGGA CATCAACTTC CGGCTGGACT CGACAGCGCT GTACGCCGAC ATCGTCCTGC CGGCCGCGAG CCACTACGAG AAACACGACC TCAACATGAC GGACATGCAC ACCTACGTGC ATCCGTTTAC GCCCGCGGTC GAACCGCTGG GTGAGTCCAA GTCCGACTGG CAGATCTTCC GGGAACTCGC GGCGAAGATC CAGGAAATCG CCCGCGATCG GGACATCGAC CCGATCGACG ACCGGAAGTT CGACCGCCAG ATCGACCTCC AGTCGGTCCA CGACGACTAC GTCCGGGACT GGGTGAGCGA CGAAGACGGT GCCCTGGAAG AGGACCGGGC GGCCTGCGAG GCCATCCTCG AACACTCCAC GGAGACCAAT CCGGACGACG GCGGCGAGAT CACCTTCGCC GACACCGTCG ACCAGCCCCA GCGCTTCGAG GCCGCGGGCG ACCACTGGAC CTCCGACATC GAGGACGGCA CGGCCTACGC GCCCTGGAAG GACTTCGTCC AGGACAAGGA GCCCTGGCCA ACCCTGACGG GTCGCCAGCA GTACTACATC GATCACGACT GGTTCCTCGA TGTCGACGAG CAACTCCCGA CGCACAAGCG CCCGGTCGAG ACCAACGATC AGAGCGAGTA CCCCCTGCGG TACAACACGC CCCACGGTCG GTGGTCGATC CACTCGACGT GGCGCGACAG CGAAAAAATG TTGCAGTTGA ACCGGGGTGA GCCGGTGGTC TTCATTCACC CCGAGGACGC AAAGCACCGC GGGATCGAGG ACGGCGACAC GGTCGAGATC TACAACGACC TGGCGACGAT CGAGGCCAAC GCCAAGCTCT ACCCGGCCAG CGAACCCGGG ACCGTCCGGC ATTACTTCGC CTGGGAGCGC TACCAGTACC CCAGCCGGAA CAACTTCAAC TCGCTGATCC CGATGTACAT GAAACCCACC CAGCTCGTCC AGTACCCCGA AGACTCGGGC GAGCACCTCC ATTTCTTCCC GAACTTCTGG GGCCCGACCG GGGTCAACAG CGACGTCCGC TGTGATATTA GGCCGAAAGA AGGGGGTGAC GACTGA
|
Protein sequence | MSDPTQSDDS TDGVSRRDFL LGAGAAGVVG ATGLTVADRA LDGLETVDDP IGNYPYRDWE DFYREEWDWD SVARSTHSVN CTGSCSWDVY VKNGQVWREQ QANDYPTFDE SLPDPNPRGC QKGACYNDYV DAEQRVQYPM RRTGERGAGE WERISWDEAL TEIAEHVIEE IQNGRYDAIS GFTPIPAMSP VSFASGTRLV NLLGGVSHSF YDWYSDLPPG QPITWGHQTD NAESADWHNA EYIIAWGSNI NVTRIPDAKY FLDAGYEGAK RVGVFSDYSQ TAIHTDEWIA PDPGTDTALA LGMARTIVEE ELYDEAHLKE QSDMPLLVRN DTGKFLRASE VPGLSVAADE PEKVMVMQDG EGNLRAAPGS LGEREAKYDD SLSIELDFDP QLAVEDTVGT TDGEVAVTSV WNNLREELAN YTPEYVADET GVGKETHQKI AREFADVDRG KIIHGKGVND WYHNDLGNRA IQLLVTLTGH IGRNGTGVDH YVGQEKIWTF NGWKTLSFPT GSVRGVPTTL WTYYHAGILE NTDAETRRKI EEAVEKDWMP VYPEERGDGT RPDPSTMFVW RGNFFNQAKG NVAVEEVLWD KLDLVVDINF RLDSTALYAD IVLPAASHYE KHDLNMTDMH TYVHPFTPAV EPLGESKSDW QIFRELAAKI QEIARDRDID PIDDRKFDRQ IDLQSVHDDY VRDWVSDEDG ALEEDRAACE AILEHSTETN PDDGGEITFA DTVDQPQRFE AAGDHWTSDI EDGTAYAPWK DFVQDKEPWP TLTGRQQYYI DHDWFLDVDE QLPTHKRPVE TNDQSEYPLR YNTPHGRWSI HSTWRDSEKM LQLNRGEPVV FIHPEDAKHR GIEDGDTVEI YNDLATIEAN AKLYPASEPG TVRHYFAWER YQYPSRNNFN SLIPMYMKPT QLVQYPEDSG EHLHFFPNFW GPTGVNSDVR CDIRPKEGGD D
|
| |