Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_0519 |
Symbol | |
ID | 7400400 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 539759 |
End bp | 541756 |
Gene Length | 1998 bp |
Protein Length | 665 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643707584 |
Product | aldo/keto reductase |
Protein accession | YP_002565191 |
Protein GI | 222478954 |
COG category | [R] General function prediction only |
COG ID | [COG0656] Aldo/keto reductases, related to diketogulonate reductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.798148 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.732895 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACTGTC TGTTCGTGGG AGCGGGGTCG ATCGCGCCGG AGTACGCCGC CGGCCTGTCT GGGAGTTCAC TGTCGCTGGC CGGCGTCGTC GACCTCGACG CGGACCGCGC TGCAGCGCTC GCCGCGGATC ACGACTGCCC GTCGTTCACG GATCTGGAGA CGGCGCTTTC GGCGGTCGAC GCGCCGCTCG TGGTGAACCT GACGAGCCAC GCCGCCCACG CGCCGGTGAC GCGGACCGCG CTGGAGGCAG ACCGTCACGT CTACTCGCAG AAGCCGCTCG CGCTCGACGC CGACGAGGCG AAGGCGCTGG TGGCGCTCGC CCGTGAACGC GACCTCGGAC TGGGCTGTGC GCCCGGGACG CCGCGAGCGC CCTCCCAACG CCGCGCGGGT CGACTCCTCG CCGACGGTCG GCTCGGCCCG GTGGGACTGG GGTACGCCCA CGCCCACGTC GGCCGCGTGA CCGACTGGCA CGACCGCCCC GACTCCTTTC TCGAAATCGG ACCCCTGTAC GACGGCGCGG TGTACCCGCT CGCGCTGCTG GCCTCGTGGT TCGGGCCCGT CGAGCGCGTT CGCGTCGCCG ACGCTCTCGA CGTCTGGCCC GAGCGGGAGG ACCGACGACC GTCGACGCCG AGCCACGTCG AGGCGACGCT AGCGTTCGCG GCGGGTCCGA CGGTCAGGCT GACCGCGAGC TTCTACGCGC CCCACCGGAG CCGGGAGTTC TACGGGCTGG AGCTCCACGG CGACGACGGC TCGCTGTACC TCAAGGGAAC CGGCGCGATG GAGACCGGCC GCGATCACGT CCGGTTCGGC CGCGTCGGAC GGGAGTACGT GAGCGCGCCG CCGACCTCTC CCGAGGAGCC GTACGAGTAC GTCGGCGCCG TCGAGCGCCT CGCCGCGACG ATCGAGGCGG GCTCGCCGTC CCGGGCCGGC GGCCGCCGGG GCGCCCACGT CGTCGCCGTC TGTAACGCGA TCGAGGCGGC CGCCGGGGGC GAGGGGCCAG TCGTCGTCGA CGACTGCGGA GCGACCGCCG ACCCGCCGCC GGCGCCGGTC GTCAGACCGG CGACGACCAC CGAGAGCGAC GCGGGACGCG AGAGCGAGAA GGGCCCGGGC GCTGCCGCGA TCCGGCTCCC CGCGGTCGGA TTCGGCTGCT CGCGCTACCG CGACGGAGAG TACGTCGACC GAGTCGACTC GATCGCGACC GCGCTCGACG CCGGGTACCG CCTCCTCGAC TCCGCCGAGC TGTACGGGAA CGAACACCGG ATCGGGGAGC TGCTCGCCGC GCCGGGCGCG CCGGACCGCG AGCGCGTGTT CCTCCTCGGA AAGGCGTGGC GTACCAATCA CCGCCGCGAA CACCTGCTCG CCGCCTGCGC TGGCAGCCGC GAGGAGCTGG GGATCGACGC GTTCGACTGC TACGCGCTCC ACTGGCCGTC GGCGCTCGAA CACCGGGGCG AGCTGAATCG TCTCGCCGAG AAGCCGGTTG AGCGACAGGA GACGCTCACC TTTCCAGAGG GAGAGGACGG CGAGCCCGCG ACCGCCGACG TGGCGCTCGC GACGGCGTGG CGGAACCTGG AGGCCGTCCA CGAGCGGGGG TGGGCCCGGA CGCTCGGGAT CTGTAACGTC TCGCGGGCGC AACTGGAGAC AGTGCTGGAG ACGGGCGAGA TCGACCCGGC GCTTGTACAG GTGGAGCGGC ACCCGTATCG ACCCCGGAAC GGGCTCGTCG AGCTCTGTCA CGGGCGGGGA ATCCGCGTCG TCGCGCACTC GCCGCTGTCG GCGCCCGGCC TCCTCGACGA GCCCGTCCTG AACGCGATCG GAACGGAGCG AGGGCTCTCA CCCGCCGAAG TCGTCATCGC GTGGAACGCT TCTCAAGGAG TCGTCCCGAT CCCGTCCAGC ACCGCCGAGT CGCACGTCGT CTCGAACCTG GCCGCCGGGA GCGAGCGACT GACTGCTGAC GAGGTCGCGC GCATCGATGC CCTGCGGGAT CCGAACTTCG AGCGGTAG
|
Protein sequence | MNCLFVGAGS IAPEYAAGLS GSSLSLAGVV DLDADRAAAL AADHDCPSFT DLETALSAVD APLVVNLTSH AAHAPVTRTA LEADRHVYSQ KPLALDADEA KALVALARER DLGLGCAPGT PRAPSQRRAG RLLADGRLGP VGLGYAHAHV GRVTDWHDRP DSFLEIGPLY DGAVYPLALL ASWFGPVERV RVADALDVWP EREDRRPSTP SHVEATLAFA AGPTVRLTAS FYAPHRSREF YGLELHGDDG SLYLKGTGAM ETGRDHVRFG RVGREYVSAP PTSPEEPYEY VGAVERLAAT IEAGSPSRAG GRRGAHVVAV CNAIEAAAGG EGPVVVDDCG ATADPPPAPV VRPATTTESD AGRESEKGPG AAAIRLPAVG FGCSRYRDGE YVDRVDSIAT ALDAGYRLLD SAELYGNEHR IGELLAAPGA PDRERVFLLG KAWRTNHRRE HLLAACAGSR EELGIDAFDC YALHWPSALE HRGELNRLAE KPVERQETLT FPEGEDGEPA TADVALATAW RNLEAVHERG WARTLGICNV SRAQLETVLE TGEIDPALVQ VERHPYRPRN GLVELCHGRG IRVVAHSPLS APGLLDEPVL NAIGTERGLS PAEVVIAWNA SQGVVPIPSS TAESHVVSNL AAGSERLTAD EVARIDALRD PNFER
|
| |