Gene Hlac_0997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0997 
Symbol 
ID7401892 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp990229 
End bp991440 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content72% 
IMG OID643708063 
Productiron-containing alcohol dehydrogenase 
Protein accessionYP_002565664 
Protein GI222479427 
COG category[C] Energy production and conversion 
COG ID[COG1454] Alcohol dehydrogenase, class IV 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTACCGA TCGCAGACTC CTTCGAACAC GACTATCAGG GCTGTGAGAT CCGGTACGGC 
CGCGGGCGCG TCGCCGAACT CGGCGATGCC CTCGACGAAC GGGAACTCGG TGACGCCCTC
GTCGTCTGCG GCTCGAACGT CGGCGCCAAC GAGGACCTGA TGGACCCGAT ACGCGAAGGG
CTCGGTGACC GGCTCGCAGG GGTCTTCGAC GGGACGACCC CGGACAAGCG CGTCGAGACG
GCGTTCGATC TGCTCGATCG ACGGGCCGAG GTCGGCGCAG ACGCTCTCGT CGCGGTCGGC
GGCGGGAGCA GCCTCGACAT CGCGCGGCAG GCGACGCTCC TCGATGTCGA CGGGCGGGAC
CTCGCTGACC TTCGCGCGGA CGCTGAGGTC GGGGCGGACG CGCTCGGCGA CCTCGCACCC
AGGACCGACC CCGCGCTCCC CGTCGTCGTG ATTCCGACGA CGTTCGCGGG CGCAGACGTC
TCGACAGGCG GCTCACTGGA GGTGCTCGAC GCGGACGCCT CGCCCACCGG CCAGCCGATG
ACGGTCAGCG GCGGGGGCGC GATGCCCGCG ATCGACCTCG CGGACCCGGC GCTGTTCGAG
ACGACCCCGC AGTCGGTGCT GGCGGGCTCG GCCATGAACG GATTCAACAA GGGGATCGAG
ACGCCGTACG CTGCCGACGC TTCGCCCGTG AGCGACGCGA CCGCGGTCCA CGGGACGCGG
CTCCTGCGGG ACGCGCTCCC GCACGTCGCC GGCGACCGGC CCGACGATCC GGCGGCGACC
GACCGCGCCG TGGTCGGCGC GCTGCTCGTC CAACTCGGAC GGAAGATCTC GGTGATTCAC
GCGTTCGGCC ACGGCTTCGC GCGTCGGTAC GACGTACAGC AGGGGACCGT CCACGCGGTG
GTCGCGCCGC ACGTCCTCGC GTACCTCTTC GACGAGGTGG ACGCGAGCCG GCGGGCGCTC
GCGAACGGGC TCGGCGTCGC GACCGCGGGC CGCGACGACG CCGCGATCGC CGAGGACGTG
GTTAGCGAGG TCGCCGCGGT CCGCGACTCC CTCCCGGTCC CCTCGCGGCT CCGCGAGCTG
GACCCGGTCG ACGAAGACGA TTTCCCCGCG ATCGCCGAGT ACATCGCCGA CGACTGGTCG
ATGGAACAGG CCCCCGCCGA CCTCGACGCG ACGCCCGAAG CGATCGAGGG TGTGCTACGC
GAGGCGTGGT GA
 
Protein sequence
MLPIADSFEH DYQGCEIRYG RGRVAELGDA LDERELGDAL VVCGSNVGAN EDLMDPIREG 
LGDRLAGVFD GTTPDKRVET AFDLLDRRAE VGADALVAVG GGSSLDIARQ ATLLDVDGRD
LADLRADAEV GADALGDLAP RTDPALPVVV IPTTFAGADV STGGSLEVLD ADASPTGQPM
TVSGGGAMPA IDLADPALFE TTPQSVLAGS AMNGFNKGIE TPYAADASPV SDATAVHGTR
LLRDALPHVA GDRPDDPAAT DRAVVGALLV QLGRKISVIH AFGHGFARRY DVQQGTVHAV
VAPHVLAYLF DEVDASRRAL ANGLGVATAG RDDAAIAEDV VSEVAAVRDS LPVPSRLREL
DPVDEDDFPA IAEYIADDWS MEQAPADLDA TPEAIEGVLR EAW