Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_1335 |
Symbol | |
ID | 7399430 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 1347855 |
End bp | 1350845 |
Gene Length | 2991 bp |
Protein Length | 996 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643708399 |
Product | DMSO reductase family type II enzyme, molybdopterin subunit |
Protein accession | YP_002565997 |
Protein GI | 222479760 |
COG category | [C] Energy production and conversion |
COG ID | [COG5013] Nitrate reductase alpha subunit |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence [TIGR01580] respiratory nitrate reductase, alpha subunit [TIGR03479] DMSO reductase family type II enzyme, molybdopterin subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGACA CGACAAACAC GTCCGACGAA TCGACACGCG ACGAGTCCGG TACCGCGCGC CGCGACTTCC TGAAGGGGGC CGGGGTGGCG GCCGCCGTCG GCGCCACCGG CTTGGGCTCC GCGCAGGACC TCACCGAGAT GACCGCGCTG GAGGTCGTCG ACGACCCGAT CGGCAACTAC CCGTACCGCG ACTGGGAGGA CCTCTACCGC GAGGAGTGGG ACTGGGACGG GAAGGCCCGC TCGACGCACA GCGTCAACTG CACGGGCAGC TGCTCGTGGC AGGTGTACAC GCGGAATGGG CAGGTGTGGC GCGAGGAGCA AGCCGGCGAC TACCCCCGAT TCGACGAGTC GCTCCCGGAT CCGAACCCGC GGGGCTGTCA GAAGGGAGCG TGTTTCAGCG ACTACGTGAA CGCCGACCAG CGCGTCACGC ACCCGCTCCG TCGGACCGGC GAGCGCGGCG AGGGGAAGTG GGAGCGGATC TCCTGGGACG AGGCGCTCAC CGAGATCGCC GAGGAGGTCA TCGACGCCGT CGAGGACGAG GAGTACGACG CGATCAGCGG CTTCACCCCG ATCCCGGCGA TGAGCCCGGT CTCGTTCGCC TCTGGGAGCC GCCTCGTCAA CCTGCTCGGC GGCGTCAGCC ACTCCTTCTA CGACTGGTAC TCCGACCTCC CGCCGGGACA GCCGATCACG TGGGGGACCC AGACCGAGAA CGCCGAGAGC GCGGACTGGT ACAACGCCGA CTACATCATC GCGTGGGGCT CGAACATCAA CGTCACGCGG ATCCCCGATG CGAAGTACTT CCTCGAAGCC GGCTACGACG GCGCCAAGCG CGTCGGCATC TTCACCGACT ACAGCCAGAC GGCGATCCAC TGCGACGAGT GGATCGGCCC GGAGCCCGGC AGCGACACGG CGCTCGCGCT CGGGATGGCG CGCACCATCG TCGACGAGGG GCTCTACGAC GAGGAGCATT TAAAAGAGCA GACGGACATG CCGCTGCTCG TCCGCGAGGA CACCGGGAAG TTCCTCCGCG CGAGCGAGGT GTCCGGGCTG AGCGTCGACG CCGACCGGCC GGAGAAGGTG TTCGTCATGC AGGACGCCGA CGGCACCCTC CGGACGGCGC CCGGATCGCT GGGCGAGCGC GACGGACAGC ACGACCACTC CGTGAGCATC GAGCTCGACT TCGACCCCGA ACTCGCCGTC AAGCGCACGG TCGGCACGAC CGACGGCGAC ATCGAGACGC GCTCGGTGTG GCTGAATCTC CGCGACGAGC TGTCGGAGTA CACCCCCGAG CGCGTCAACG AGATCACGGG CGTCGGCCGG CAGACTCACC AGAAGATCGC CCGCGAGTTC GCCGAAGTCG ACCGCGCGAA GATCATCCAC GGGAAGGGGG TCAACGACTG GTACCACAAC GATCTGGGGA ACCGTTCGAT CCAACTGCTC GTCACGCTCA CGGGGAACCT CGGCCGACAG GGAACGGGTC TCGACCACTA CGTCGGCCAG GAGAAGATCT GGACGTTCAA CGGGTGGAAG AACCTCTCGT TCCCGACCGG AAGCGTCCGG GGCGTCCCGA CGACGCTCTG GACGTACTAC CACGCCGGGA TCATGGAGAA CACCGACCCG GACACGGCGG CGAAGATTCG GGAGTCGATC GACAAGGGGT GGATGCCGCT GTACCCGAGC GAACGCGACG ACAGCGGCCG CCCGGACCCC CGAGTCATGT TCGTCTGGCG CGGGAACTAC TTCAATCAGG CCAAGGGGAA CATCGCGGTC GAAGAGGAGC TGTGGCCGAA GCTCGATTTG GTCGTCGACA TCAACTTCCG GATGGACTCG ACGGCGCTCA ACAGCGATAT CGTCCTGCCG ACGGCGAGCC ACTACGAGAA ACACGACCTC TCGGAGACGG ACATGCACAC CTACGTGCAC CCGTTCACGC CGGCGGTCGA GCCGCTGGGC GAGTCGAAGA CCGACTGGCA GATCTTCCGC GATCTCGCCG CGAAGATACA GGAACTGGCC GCGGACCGCG GTACGGAGCC GGTCGACGAT CGGAAGTTCG ACCGCCAGAT CGACCTCCAG TCGGTCCACG ACGACTACGT CCGCGACTGG CTCGACGACG AGCCCGGTGC GCTGTCCGAG GACAAGGCGG CCGCGGAGTT CATCCTCGAG AACTCCGAGG AGACGAACCC CGAGGGGACC GACGAGCAGA TCACGTTCGA CGAGATCGAC GAACAGCCCA AACGGCTCCT CGAAACCGGC GACCACTGGT CGTCGGACAT CGAGGAGGGC GAGGCGTACA CGCCGTGGAA AGACTACGTG CAGGACAAGA ACCCGTGGCC GACGTTCACG GGGCGACAGC AGTACTACAT CGACCACGAC TGGTTCTTGG AGCTCGACGA GCAGCTTCCG ACCCACAAGG AGGCGCCGGT GTTACAGGAG AAGTCGGAGT ACCCGCTGGG GTACAACACG CCCCACAGCC GCTGGTCGAT CCACTCGACG TGGCGCGACA GCAGCAAGAT GCTCCGGCTG CAGCGCGGCG AGCCGACCGT CTACCTCAAC CCCGACGACG CCGAGGAGCG GGGGATCGAG GACGGCGACA CCGTGCGGGT GTACAACGAC CTCGACTCGG TCGAGGTGCA GGCGAAGATC TACCCGAGCG GGGAGCCCGG CACCGTCCGG CACTTCTTCA GCTGGGAGCG GTTCCAGTAC CCCGACCGCA ACAACTTCAA TTCGCTCGTC CCGATGTACA TGAAGCCGAC GCAGCTCGTC CAGTACCCCG AGGACACGGG CGAGCACCTC CACTTCTTCC CGAACTACTG GGGTCCGACC GGCGTGAACA GCGACGTTCG GATCGAGGTC GAGAAGGTCG AAGAGGGGGC AGCGTCGCTC GACGCCGGTT TGCTCGGAGA GTCGGCGACG GAGTTGGAGT CGCCCGATTC GGGCTCGAAA TCGCGCGGCG CGTTGGGTCG CGTCTCGGAC CTGCTCGGAG GTGACGACTA A
|
Protein sequence | MTDTTNTSDE STRDESGTAR RDFLKGAGVA AAVGATGLGS AQDLTEMTAL EVVDDPIGNY PYRDWEDLYR EEWDWDGKAR STHSVNCTGS CSWQVYTRNG QVWREEQAGD YPRFDESLPD PNPRGCQKGA CFSDYVNADQ RVTHPLRRTG ERGEGKWERI SWDEALTEIA EEVIDAVEDE EYDAISGFTP IPAMSPVSFA SGSRLVNLLG GVSHSFYDWY SDLPPGQPIT WGTQTENAES ADWYNADYII AWGSNINVTR IPDAKYFLEA GYDGAKRVGI FTDYSQTAIH CDEWIGPEPG SDTALALGMA RTIVDEGLYD EEHLKEQTDM PLLVREDTGK FLRASEVSGL SVDADRPEKV FVMQDADGTL RTAPGSLGER DGQHDHSVSI ELDFDPELAV KRTVGTTDGD IETRSVWLNL RDELSEYTPE RVNEITGVGR QTHQKIAREF AEVDRAKIIH GKGVNDWYHN DLGNRSIQLL VTLTGNLGRQ GTGLDHYVGQ EKIWTFNGWK NLSFPTGSVR GVPTTLWTYY HAGIMENTDP DTAAKIRESI DKGWMPLYPS ERDDSGRPDP RVMFVWRGNY FNQAKGNIAV EEELWPKLDL VVDINFRMDS TALNSDIVLP TASHYEKHDL SETDMHTYVH PFTPAVEPLG ESKTDWQIFR DLAAKIQELA ADRGTEPVDD RKFDRQIDLQ SVHDDYVRDW LDDEPGALSE DKAAAEFILE NSEETNPEGT DEQITFDEID EQPKRLLETG DHWSSDIEEG EAYTPWKDYV QDKNPWPTFT GRQQYYIDHD WFLELDEQLP THKEAPVLQE KSEYPLGYNT PHSRWSIHST WRDSSKMLRL QRGEPTVYLN PDDAEERGIE DGDTVRVYND LDSVEVQAKI YPSGEPGTVR HFFSWERFQY PDRNNFNSLV PMYMKPTQLV QYPEDTGEHL HFFPNYWGPT GVNSDVRIEV EKVEEGAASL DAGLLGESAT ELESPDSGSK SRGALGRVSD LLGGDD
|
| |