Gene Hlac_0519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0519 
Symbol 
ID7400400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp539759 
End bp541756 
Gene Length1998 bp 
Protein Length665 aa 
Translation table11 
GC content73% 
IMG OID643707584 
Productaldo/keto reductase 
Protein accessionYP_002565191 
Protein GI222478954 
COG category[R] General function prediction only 
COG ID[COG0656] Aldo/keto reductases, related to diketogulonate reductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.798148 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.732895 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACTGTC TGTTCGTGGG AGCGGGGTCG ATCGCGCCGG AGTACGCCGC CGGCCTGTCT 
GGGAGTTCAC TGTCGCTGGC CGGCGTCGTC GACCTCGACG CGGACCGCGC TGCAGCGCTC
GCCGCGGATC ACGACTGCCC GTCGTTCACG GATCTGGAGA CGGCGCTTTC GGCGGTCGAC
GCGCCGCTCG TGGTGAACCT GACGAGCCAC GCCGCCCACG CGCCGGTGAC GCGGACCGCG
CTGGAGGCAG ACCGTCACGT CTACTCGCAG AAGCCGCTCG CGCTCGACGC CGACGAGGCG
AAGGCGCTGG TGGCGCTCGC CCGTGAACGC GACCTCGGAC TGGGCTGTGC GCCCGGGACG
CCGCGAGCGC CCTCCCAACG CCGCGCGGGT CGACTCCTCG CCGACGGTCG GCTCGGCCCG
GTGGGACTGG GGTACGCCCA CGCCCACGTC GGCCGCGTGA CCGACTGGCA CGACCGCCCC
GACTCCTTTC TCGAAATCGG ACCCCTGTAC GACGGCGCGG TGTACCCGCT CGCGCTGCTG
GCCTCGTGGT TCGGGCCCGT CGAGCGCGTT CGCGTCGCCG ACGCTCTCGA CGTCTGGCCC
GAGCGGGAGG ACCGACGACC GTCGACGCCG AGCCACGTCG AGGCGACGCT AGCGTTCGCG
GCGGGTCCGA CGGTCAGGCT GACCGCGAGC TTCTACGCGC CCCACCGGAG CCGGGAGTTC
TACGGGCTGG AGCTCCACGG CGACGACGGC TCGCTGTACC TCAAGGGAAC CGGCGCGATG
GAGACCGGCC GCGATCACGT CCGGTTCGGC CGCGTCGGAC GGGAGTACGT GAGCGCGCCG
CCGACCTCTC CCGAGGAGCC GTACGAGTAC GTCGGCGCCG TCGAGCGCCT CGCCGCGACG
ATCGAGGCGG GCTCGCCGTC CCGGGCCGGC GGCCGCCGGG GCGCCCACGT CGTCGCCGTC
TGTAACGCGA TCGAGGCGGC CGCCGGGGGC GAGGGGCCAG TCGTCGTCGA CGACTGCGGA
GCGACCGCCG ACCCGCCGCC GGCGCCGGTC GTCAGACCGG CGACGACCAC CGAGAGCGAC
GCGGGACGCG AGAGCGAGAA GGGCCCGGGC GCTGCCGCGA TCCGGCTCCC CGCGGTCGGA
TTCGGCTGCT CGCGCTACCG CGACGGAGAG TACGTCGACC GAGTCGACTC GATCGCGACC
GCGCTCGACG CCGGGTACCG CCTCCTCGAC TCCGCCGAGC TGTACGGGAA CGAACACCGG
ATCGGGGAGC TGCTCGCCGC GCCGGGCGCG CCGGACCGCG AGCGCGTGTT CCTCCTCGGA
AAGGCGTGGC GTACCAATCA CCGCCGCGAA CACCTGCTCG CCGCCTGCGC TGGCAGCCGC
GAGGAGCTGG GGATCGACGC GTTCGACTGC TACGCGCTCC ACTGGCCGTC GGCGCTCGAA
CACCGGGGCG AGCTGAATCG TCTCGCCGAG AAGCCGGTTG AGCGACAGGA GACGCTCACC
TTTCCAGAGG GAGAGGACGG CGAGCCCGCG ACCGCCGACG TGGCGCTCGC GACGGCGTGG
CGGAACCTGG AGGCCGTCCA CGAGCGGGGG TGGGCCCGGA CGCTCGGGAT CTGTAACGTC
TCGCGGGCGC AACTGGAGAC AGTGCTGGAG ACGGGCGAGA TCGACCCGGC GCTTGTACAG
GTGGAGCGGC ACCCGTATCG ACCCCGGAAC GGGCTCGTCG AGCTCTGTCA CGGGCGGGGA
ATCCGCGTCG TCGCGCACTC GCCGCTGTCG GCGCCCGGCC TCCTCGACGA GCCCGTCCTG
AACGCGATCG GAACGGAGCG AGGGCTCTCA CCCGCCGAAG TCGTCATCGC GTGGAACGCT
TCTCAAGGAG TCGTCCCGAT CCCGTCCAGC ACCGCCGAGT CGCACGTCGT CTCGAACCTG
GCCGCCGGGA GCGAGCGACT GACTGCTGAC GAGGTCGCGC GCATCGATGC CCTGCGGGAT
CCGAACTTCG AGCGGTAG
 
Protein sequence
MNCLFVGAGS IAPEYAAGLS GSSLSLAGVV DLDADRAAAL AADHDCPSFT DLETALSAVD 
APLVVNLTSH AAHAPVTRTA LEADRHVYSQ KPLALDADEA KALVALARER DLGLGCAPGT
PRAPSQRRAG RLLADGRLGP VGLGYAHAHV GRVTDWHDRP DSFLEIGPLY DGAVYPLALL
ASWFGPVERV RVADALDVWP EREDRRPSTP SHVEATLAFA AGPTVRLTAS FYAPHRSREF
YGLELHGDDG SLYLKGTGAM ETGRDHVRFG RVGREYVSAP PTSPEEPYEY VGAVERLAAT
IEAGSPSRAG GRRGAHVVAV CNAIEAAAGG EGPVVVDDCG ATADPPPAPV VRPATTTESD
AGRESEKGPG AAAIRLPAVG FGCSRYRDGE YVDRVDSIAT ALDAGYRLLD SAELYGNEHR
IGELLAAPGA PDRERVFLLG KAWRTNHRRE HLLAACAGSR EELGIDAFDC YALHWPSALE
HRGELNRLAE KPVERQETLT FPEGEDGEPA TADVALATAW RNLEAVHERG WARTLGICNV
SRAQLETVLE TGEIDPALVQ VERHPYRPRN GLVELCHGRG IRVVAHSPLS APGLLDEPVL
NAIGTERGLS PAEVVIAWNA SQGVVPIPSS TAESHVVSNL AAGSERLTAD EVARIDALRD
PNFER