Gene Huta_0024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_0024 
Symbol 
ID8382284 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp23959 
End bp26814 
Gene Length2856 bp 
Protein Length951 aa 
Translation table11 
GC content65% 
IMG OID644971082 
ProductDMSO reductase family type II enzyme, molybdopterin subunit 
Protein accessionYP_003128946 
Protein GI257051113 
COG category[C] Energy production and conversion 
COG ID[COG5013] Nitrate reductase alpha subunit 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR01580] respiratory nitrate reductase, alpha subunit
[TIGR03479] DMSO reductase family type II enzyme, molybdopterin subunit 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGATC CAACCCAATC CGACGACAGT ACAGACGGCG TCTCACGTCG CGACTTCCTG 
CTGGGTGCCG GCGCGGCCGG CGTGGTCGGC GCGACCGGCC TGACCGTCGC CGATCGCGCC
CTCGACGGGC TCGAAACCGT CGACGATCCG ATCGGCAATT ATCCCTACCG CGACTGGGAG
GACTTCTACC GTGAGGAGTG GGACTGGGAT TCGGTCGCCC GGTCGACGCA CTCGGTCAAC
TGCACCGGCT CCTGTTCGTG GGACGTCTAC GTCAAGAACG GCCAGGTCTG GCGCGAACAA
CAGGCCAACG ACTACCCGAC CTTCGACGAG AGTCTCCCCG ATCCCAACCC CCGGGGCTGC
CAGAAGGGGG CCTGTTACAA CGACTACGTC GACGCCGAAC AGCGTGTCCA GTACCCGATG
CGTCGCACCG GCGAGCGCGG GGCCGGCGAG TGGGAGCGCA TCTCCTGGGA CGAGGCGCTG
ACCGAGATCG CCGAGCACGT CATCGAGGAG ATCCAGAACG GCCGCTACGA CGCGATTTCC
GGGTTCACGC CGATCCCCGC GATGAGTCCC GTGAGCTTCG CCTCGGGGAC GCGGCTGGTC
AACCTGCTCG GCGGCGTCTC TCACTCCTTC TACGACTGGT ATTCGGACCT CCCGCCGGGC
CAGCCGATCA CCTGGGGCCA TCAGACCGAC AACGCCGAGA GCGCCGACTG GCACAACGCC
GAGTACATCA TCGCGTGGGG GTCAAACATC AACGTCACGC GCATCCCCGA CGCGAAGTAC
TTCCTCGACG CGGGCTACGA GGGCGCAAAG CGGGTCGGCG TCTTCTCCGA TTACTCCCAG
ACCGCCATCC ACACCGACGA GTGGATCGCG CCTGATCCCG GTACCGACAC CGCGCTGGCG
CTTGGGATGG CCCGCACCAT CGTCGAGGAG GAGCTCTACG ACGAGGCACA CCTCAAGGAA
CAGTCCGACA TGCCACTGCT CGTTCGGAAC GATACGGGCA AGTTCCTCCG GGCGAGCGAG
GTCCCCGGCC TCTCGGTTGC AGCCGACGAA CCCGAGAAGG TCATGGTCAT GCAGGACGGG
GAGGGCAACC TGCGTGCTGC GCCGGGGTCG CTCGGTGAAC GCGAGGCCAA GTACGACGAC
TCCTTGTCGA TCGAACTCGA TTTCGACCCG CAACTGGCCG TCGAGGACAC CGTCGGGACG
ACCGACGGCG AGGTGGCCGT CACGTCGGTC TGGAACAACC TCCGGGAGGA ACTGGCCAAC
TACACGCCCG AGTACGTCGC CGACGAGACC GGCGTCGGGA AGGAGACTCA CCAGAAGATC
GCCCGGGAGT TCGCCGACGT CGACCGCGGG AAGATCATCC ACGGCAAGGG CGTCAACGAC
TGGTATCACA ACGACCTGGG CAACCGTGCG ATCCAGTTGC TCGTCACGTT GACGGGCCAC
ATCGGGCGGA ACGGCACCGG CGTCGACCAC TACGTCGGCC AGGAGAAGAT CTGGACGTTC
AACGGCTGGA AGACCCTCTC GTTCCCGACC GGTTCCGTCC GGGGCGTCCC GACGACGTTG
TGGACCTACT ACCACGCGGG CATCCTGGAG AACACCGACG CGGAGACCCG CCGGAAGATC
GAGGAGGCCG TCGAGAAGGA CTGGATGCCC GTCTACCCCG AGGAGCGCGG GGACGGCACC
CGACCCGATC CCTCGACGAT GTTCGTCTGG CGGGGCAACT TCTTCAACCA GGCGAAGGGC
AACGTCGCCG TCGAGGAGGT CCTCTGGGAC AAACTCGACC TCGTCGTGGA CATCAACTTC
CGGCTGGACT CGACAGCGCT GTACGCCGAC ATCGTCCTGC CGGCCGCGAG CCACTACGAG
AAACACGACC TCAACATGAC GGACATGCAC ACCTACGTGC ATCCGTTTAC GCCCGCGGTC
GAACCGCTGG GTGAGTCCAA GTCCGACTGG CAGATCTTCC GGGAACTCGC GGCGAAGATC
CAGGAAATCG CCCGCGATCG GGACATCGAC CCGATCGACG ACCGGAAGTT CGACCGCCAG
ATCGACCTCC AGTCGGTCCA CGACGACTAC GTCCGGGACT GGGTGAGCGA CGAAGACGGT
GCCCTGGAAG AGGACCGGGC GGCCTGCGAG GCCATCCTCG AACACTCCAC GGAGACCAAT
CCGGACGACG GCGGCGAGAT CACCTTCGCC GACACCGTCG ACCAGCCCCA GCGCTTCGAG
GCCGCGGGCG ACCACTGGAC CTCCGACATC GAGGACGGCA CGGCCTACGC GCCCTGGAAG
GACTTCGTCC AGGACAAGGA GCCCTGGCCA ACCCTGACGG GTCGCCAGCA GTACTACATC
GATCACGACT GGTTCCTCGA TGTCGACGAG CAACTCCCGA CGCACAAGCG CCCGGTCGAG
ACCAACGATC AGAGCGAGTA CCCCCTGCGG TACAACACGC CCCACGGTCG GTGGTCGATC
CACTCGACGT GGCGCGACAG CGAAAAAATG TTGCAGTTGA ACCGGGGTGA GCCGGTGGTC
TTCATTCACC CCGAGGACGC AAAGCACCGC GGGATCGAGG ACGGCGACAC GGTCGAGATC
TACAACGACC TGGCGACGAT CGAGGCCAAC GCCAAGCTCT ACCCGGCCAG CGAACCCGGG
ACCGTCCGGC ATTACTTCGC CTGGGAGCGC TACCAGTACC CCAGCCGGAA CAACTTCAAC
TCGCTGATCC CGATGTACAT GAAACCCACC CAGCTCGTCC AGTACCCCGA AGACTCGGGC
GAGCACCTCC ATTTCTTCCC GAACTTCTGG GGCCCGACCG GGGTCAACAG CGACGTCCGC
TGTGATATTA GGCCGAAAGA AGGGGGTGAC GACTGA
 
Protein sequence
MSDPTQSDDS TDGVSRRDFL LGAGAAGVVG ATGLTVADRA LDGLETVDDP IGNYPYRDWE 
DFYREEWDWD SVARSTHSVN CTGSCSWDVY VKNGQVWREQ QANDYPTFDE SLPDPNPRGC
QKGACYNDYV DAEQRVQYPM RRTGERGAGE WERISWDEAL TEIAEHVIEE IQNGRYDAIS
GFTPIPAMSP VSFASGTRLV NLLGGVSHSF YDWYSDLPPG QPITWGHQTD NAESADWHNA
EYIIAWGSNI NVTRIPDAKY FLDAGYEGAK RVGVFSDYSQ TAIHTDEWIA PDPGTDTALA
LGMARTIVEE ELYDEAHLKE QSDMPLLVRN DTGKFLRASE VPGLSVAADE PEKVMVMQDG
EGNLRAAPGS LGEREAKYDD SLSIELDFDP QLAVEDTVGT TDGEVAVTSV WNNLREELAN
YTPEYVADET GVGKETHQKI AREFADVDRG KIIHGKGVND WYHNDLGNRA IQLLVTLTGH
IGRNGTGVDH YVGQEKIWTF NGWKTLSFPT GSVRGVPTTL WTYYHAGILE NTDAETRRKI
EEAVEKDWMP VYPEERGDGT RPDPSTMFVW RGNFFNQAKG NVAVEEVLWD KLDLVVDINF
RLDSTALYAD IVLPAASHYE KHDLNMTDMH TYVHPFTPAV EPLGESKSDW QIFRELAAKI
QEIARDRDID PIDDRKFDRQ IDLQSVHDDY VRDWVSDEDG ALEEDRAACE AILEHSTETN
PDDGGEITFA DTVDQPQRFE AAGDHWTSDI EDGTAYAPWK DFVQDKEPWP TLTGRQQYYI
DHDWFLDVDE QLPTHKRPVE TNDQSEYPLR YNTPHGRWSI HSTWRDSEKM LQLNRGEPVV
FIHPEDAKHR GIEDGDTVEI YNDLATIEAN AKLYPASEPG TVRHYFAWER YQYPSRNNFN
SLIPMYMKPT QLVQYPEDSG EHLHFFPNFW GPTGVNSDVR CDIRPKEGGD D