Gene Hlac_1335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1335 
Symbol 
ID7399430 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1347855 
End bp1350845 
Gene Length2991 bp 
Protein Length996 aa 
Translation table11 
GC content66% 
IMG OID643708399 
ProductDMSO reductase family type II enzyme, molybdopterin subunit 
Protein accessionYP_002565997 
Protein GI222479760 
COG category[C] Energy production and conversion 
COG ID[COG5013] Nitrate reductase alpha subunit 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR01580] respiratory nitrate reductase, alpha subunit
[TIGR03479] DMSO reductase family type II enzyme, molybdopterin subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGACA CGACAAACAC GTCCGACGAA TCGACACGCG ACGAGTCCGG TACCGCGCGC 
CGCGACTTCC TGAAGGGGGC CGGGGTGGCG GCCGCCGTCG GCGCCACCGG CTTGGGCTCC
GCGCAGGACC TCACCGAGAT GACCGCGCTG GAGGTCGTCG ACGACCCGAT CGGCAACTAC
CCGTACCGCG ACTGGGAGGA CCTCTACCGC GAGGAGTGGG ACTGGGACGG GAAGGCCCGC
TCGACGCACA GCGTCAACTG CACGGGCAGC TGCTCGTGGC AGGTGTACAC GCGGAATGGG
CAGGTGTGGC GCGAGGAGCA AGCCGGCGAC TACCCCCGAT TCGACGAGTC GCTCCCGGAT
CCGAACCCGC GGGGCTGTCA GAAGGGAGCG TGTTTCAGCG ACTACGTGAA CGCCGACCAG
CGCGTCACGC ACCCGCTCCG TCGGACCGGC GAGCGCGGCG AGGGGAAGTG GGAGCGGATC
TCCTGGGACG AGGCGCTCAC CGAGATCGCC GAGGAGGTCA TCGACGCCGT CGAGGACGAG
GAGTACGACG CGATCAGCGG CTTCACCCCG ATCCCGGCGA TGAGCCCGGT CTCGTTCGCC
TCTGGGAGCC GCCTCGTCAA CCTGCTCGGC GGCGTCAGCC ACTCCTTCTA CGACTGGTAC
TCCGACCTCC CGCCGGGACA GCCGATCACG TGGGGGACCC AGACCGAGAA CGCCGAGAGC
GCGGACTGGT ACAACGCCGA CTACATCATC GCGTGGGGCT CGAACATCAA CGTCACGCGG
ATCCCCGATG CGAAGTACTT CCTCGAAGCC GGCTACGACG GCGCCAAGCG CGTCGGCATC
TTCACCGACT ACAGCCAGAC GGCGATCCAC TGCGACGAGT GGATCGGCCC GGAGCCCGGC
AGCGACACGG CGCTCGCGCT CGGGATGGCG CGCACCATCG TCGACGAGGG GCTCTACGAC
GAGGAGCATT TAAAAGAGCA GACGGACATG CCGCTGCTCG TCCGCGAGGA CACCGGGAAG
TTCCTCCGCG CGAGCGAGGT GTCCGGGCTG AGCGTCGACG CCGACCGGCC GGAGAAGGTG
TTCGTCATGC AGGACGCCGA CGGCACCCTC CGGACGGCGC CCGGATCGCT GGGCGAGCGC
GACGGACAGC ACGACCACTC CGTGAGCATC GAGCTCGACT TCGACCCCGA ACTCGCCGTC
AAGCGCACGG TCGGCACGAC CGACGGCGAC ATCGAGACGC GCTCGGTGTG GCTGAATCTC
CGCGACGAGC TGTCGGAGTA CACCCCCGAG CGCGTCAACG AGATCACGGG CGTCGGCCGG
CAGACTCACC AGAAGATCGC CCGCGAGTTC GCCGAAGTCG ACCGCGCGAA GATCATCCAC
GGGAAGGGGG TCAACGACTG GTACCACAAC GATCTGGGGA ACCGTTCGAT CCAACTGCTC
GTCACGCTCA CGGGGAACCT CGGCCGACAG GGAACGGGTC TCGACCACTA CGTCGGCCAG
GAGAAGATCT GGACGTTCAA CGGGTGGAAG AACCTCTCGT TCCCGACCGG AAGCGTCCGG
GGCGTCCCGA CGACGCTCTG GACGTACTAC CACGCCGGGA TCATGGAGAA CACCGACCCG
GACACGGCGG CGAAGATTCG GGAGTCGATC GACAAGGGGT GGATGCCGCT GTACCCGAGC
GAACGCGACG ACAGCGGCCG CCCGGACCCC CGAGTCATGT TCGTCTGGCG CGGGAACTAC
TTCAATCAGG CCAAGGGGAA CATCGCGGTC GAAGAGGAGC TGTGGCCGAA GCTCGATTTG
GTCGTCGACA TCAACTTCCG GATGGACTCG ACGGCGCTCA ACAGCGATAT CGTCCTGCCG
ACGGCGAGCC ACTACGAGAA ACACGACCTC TCGGAGACGG ACATGCACAC CTACGTGCAC
CCGTTCACGC CGGCGGTCGA GCCGCTGGGC GAGTCGAAGA CCGACTGGCA GATCTTCCGC
GATCTCGCCG CGAAGATACA GGAACTGGCC GCGGACCGCG GTACGGAGCC GGTCGACGAT
CGGAAGTTCG ACCGCCAGAT CGACCTCCAG TCGGTCCACG ACGACTACGT CCGCGACTGG
CTCGACGACG AGCCCGGTGC GCTGTCCGAG GACAAGGCGG CCGCGGAGTT CATCCTCGAG
AACTCCGAGG AGACGAACCC CGAGGGGACC GACGAGCAGA TCACGTTCGA CGAGATCGAC
GAACAGCCCA AACGGCTCCT CGAAACCGGC GACCACTGGT CGTCGGACAT CGAGGAGGGC
GAGGCGTACA CGCCGTGGAA AGACTACGTG CAGGACAAGA ACCCGTGGCC GACGTTCACG
GGGCGACAGC AGTACTACAT CGACCACGAC TGGTTCTTGG AGCTCGACGA GCAGCTTCCG
ACCCACAAGG AGGCGCCGGT GTTACAGGAG AAGTCGGAGT ACCCGCTGGG GTACAACACG
CCCCACAGCC GCTGGTCGAT CCACTCGACG TGGCGCGACA GCAGCAAGAT GCTCCGGCTG
CAGCGCGGCG AGCCGACCGT CTACCTCAAC CCCGACGACG CCGAGGAGCG GGGGATCGAG
GACGGCGACA CCGTGCGGGT GTACAACGAC CTCGACTCGG TCGAGGTGCA GGCGAAGATC
TACCCGAGCG GGGAGCCCGG CACCGTCCGG CACTTCTTCA GCTGGGAGCG GTTCCAGTAC
CCCGACCGCA ACAACTTCAA TTCGCTCGTC CCGATGTACA TGAAGCCGAC GCAGCTCGTC
CAGTACCCCG AGGACACGGG CGAGCACCTC CACTTCTTCC CGAACTACTG GGGTCCGACC
GGCGTGAACA GCGACGTTCG GATCGAGGTC GAGAAGGTCG AAGAGGGGGC AGCGTCGCTC
GACGCCGGTT TGCTCGGAGA GTCGGCGACG GAGTTGGAGT CGCCCGATTC GGGCTCGAAA
TCGCGCGGCG CGTTGGGTCG CGTCTCGGAC CTGCTCGGAG GTGACGACTA A
 
Protein sequence
MTDTTNTSDE STRDESGTAR RDFLKGAGVA AAVGATGLGS AQDLTEMTAL EVVDDPIGNY 
PYRDWEDLYR EEWDWDGKAR STHSVNCTGS CSWQVYTRNG QVWREEQAGD YPRFDESLPD
PNPRGCQKGA CFSDYVNADQ RVTHPLRRTG ERGEGKWERI SWDEALTEIA EEVIDAVEDE
EYDAISGFTP IPAMSPVSFA SGSRLVNLLG GVSHSFYDWY SDLPPGQPIT WGTQTENAES
ADWYNADYII AWGSNINVTR IPDAKYFLEA GYDGAKRVGI FTDYSQTAIH CDEWIGPEPG
SDTALALGMA RTIVDEGLYD EEHLKEQTDM PLLVREDTGK FLRASEVSGL SVDADRPEKV
FVMQDADGTL RTAPGSLGER DGQHDHSVSI ELDFDPELAV KRTVGTTDGD IETRSVWLNL
RDELSEYTPE RVNEITGVGR QTHQKIAREF AEVDRAKIIH GKGVNDWYHN DLGNRSIQLL
VTLTGNLGRQ GTGLDHYVGQ EKIWTFNGWK NLSFPTGSVR GVPTTLWTYY HAGIMENTDP
DTAAKIRESI DKGWMPLYPS ERDDSGRPDP RVMFVWRGNY FNQAKGNIAV EEELWPKLDL
VVDINFRMDS TALNSDIVLP TASHYEKHDL SETDMHTYVH PFTPAVEPLG ESKTDWQIFR
DLAAKIQELA ADRGTEPVDD RKFDRQIDLQ SVHDDYVRDW LDDEPGALSE DKAAAEFILE
NSEETNPEGT DEQITFDEID EQPKRLLETG DHWSSDIEEG EAYTPWKDYV QDKNPWPTFT
GRQQYYIDHD WFLELDEQLP THKEAPVLQE KSEYPLGYNT PHSRWSIHST WRDSSKMLRL
QRGEPTVYLN PDDAEERGIE DGDTVRVYND LDSVEVQAKI YPSGEPGTVR HFFSWERFQY
PDRNNFNSLV PMYMKPTQLV QYPEDTGEHL HFFPNYWGPT GVNSDVRIEV EKVEEGAASL
DAGLLGESAT ELESPDSGSK SRGALGRVSD LLGGDD