Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_2005 |
Symbol | |
ID | 7402024 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 1999106 |
End bp | 2000125 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643709076 |
Product | aldo/keto reductase |
Protein accession | YP_002566653 |
Protein GI | 222480416 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.188789 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.288175 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCACC GAACGCTCGG GAACGTCGGC GAGGTCAGCG AGGTCGGCTA CGGCGCGTGG CAGATCGGGA GCGACTGGGG CGACGTGACC GAGTCCGAGG CGGTCGCAGC GGTCGAGGCC GCCCGCGACG CCGGAATCGA TTTCTTCGAC ACCGCAGATG TGTACGGCGA CGGGCGCTCG GAGCGCATCC TCGGCGACGT GCTCGAAGAC GACATCGACG CCGAGGAGGT GACGGTCGCC ACGAAGGCCG GTCGCCGGCT CAACCCCCAC GTCCCCGACG AGTATAACGA GACGAACCTC CGGCGGTTCG TGAACCGATC GCGCGAGAAC CTCGGGATGG AGACGCTGGA TCTGGTCCAG CTTCACTGCC CGCCGACGGA TGTGTACTAC CAGCCGGAGA CGTTCGACGC GCTCGCGGCC CTCGCCGACG AGGGAAAGAT CGCGAACTAC GGCGTCAGCG TTGAACGCGT CGAGGAGGGA CTGAAAGCGA TCGAGTACCC TGGCGTCGAG ACGGTCCAGA TCATCTTCAA CCCCTTCCGC CAGCGTCCCG CGAAGCTGTT CCTCGACGAG GCCGCCGCCC GCGACGTGGG CGTGATCTGT CGGGTGCCGC TCGCCTCCGG CCTCCTGACC GGCGCGCTCT CGCGGGACAC CGACTTCGCC GAGGACGACC ACCGCAACTA CAACCGCGAG GGCGACGCGT TCGACGTGGG CGAGACGTTC GCCGGCGTCC CCTACGAGGT CGGCCTCGAC GCCGCCGACG CGCTCGCCGA GCGGGTCGAC GCCGTCGCCG ACGACCCGAC CGACGCGACC GGCAACGACC TCTCGCTCCC GCAGCTCTCG CTGCGCTGGC TCCTCGACCA CGACGCCGTC TCGGCGGTCA TTCCGGGGTC GACAACGCCG GAGCACATTC GGTCGAACGC GGCCGCGAGC GACCTCGTGC CGCTCGGCGA GGCCGACCGC GAGGCCGTCG CTGACGTGTA CGACGAGTTC GTCCGCGAGC ACGTCCACCA GCGCTGGTAG
|
Protein sequence | MNHRTLGNVG EVSEVGYGAW QIGSDWGDVT ESEAVAAVEA ARDAGIDFFD TADVYGDGRS ERILGDVLED DIDAEEVTVA TKAGRRLNPH VPDEYNETNL RRFVNRSREN LGMETLDLVQ LHCPPTDVYY QPETFDALAA LADEGKIANY GVSVERVEEG LKAIEYPGVE TVQIIFNPFR QRPAKLFLDE AAARDVGVIC RVPLASGLLT GALSRDTDFA EDDHRNYNRE GDAFDVGETF AGVPYEVGLD AADALAERVD AVADDPTDAT GNDLSLPQLS LRWLLDHDAV SAVIPGSTTP EHIRSNAAAS DLVPLGEADR EAVADVYDEF VREHVHQRW
|
| |