Gene Hlac_3048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_3048 
Symbol 
ID7399022 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012028 
Strand
Start bp307438 
End bp308505 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content62% 
IMG OID643706855 
ProductAlcohol dehydrogenase zinc-binding domain protein 
Protein accessionYP_002564477 
Protein GI222475956 
COG category[R] General function prediction only 
COG ID[COG1064] Zn-dependent alcohol dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGCAG TACTCCTTGA GAGCTTCCAA GAACCGTTGA CGGTCCAAGA CGTCGACCGA 
CCGGAACCCG ACCCCGACGG CGCCGTCGCC GAGGTCATCG GCTGTGGCGT CTGTCGCTCG
GACTGGCATT GCTGGCAGGG CGATTGGGAC TGGTTCGGCT ACCGCCCCGA TCCGCCACAC
GTGCTCGGTC ACGAACCAAC CGGTCGCATC GTCGCTACCG GCGAGGACGT TGAGAGCATC
GAAGAGGGAC AAGAGGTCGC CATCCCGTTC AACTTCGCCT GTGGCAGCTG CGATATGTGT
CGCAACGGCC GCGAGAACAT CTGTGAGAAC CACATCGGCC TCGGGTTTAT GAATCAGGCT
CCCGGCGCGT TCGCCGAAGA AGTCCACATC CCCAACGCCG ATATCAACGC AGTCCCGCTG
CCCGACAGCA TCGATGCCGA AGCCGCCGCC GGCTGTGGCT GTCGGTTCAT GACCTCATTC
CACGCAATGG CCCACAGAGC TCCGGTTAGC GCTGGCGACG ACGTCGTCAT CCACGGCTGT
GGCGGGATCG GCCTCTCGGC GGTCCACATC GCCAACGCAC TCGGTGGGAA CGTCATCGGC
GTCGACCTGA TGGACGAGAA ACTGGACAAG GCCGAGGAAC TCGGTGCTGT CGACACTGTC
AACGCTAGGG AGGTCGACGA TGCCGCGGCC GAAGTCCACG ATATCACCAA CGGTGGCGCG
GACGTTTCGG CCGATGCGTT GGGTATTGCA ACCACCTGCC GGAACGCGGT GAACAGTCTC
CGCAAAGGTG GCACCCACGT CCAGATCGGG CTGACCACCT CCGAAGAGGA GGGGATGGTG
TCGCTGCCGA CCGACGAAAT CGTCGCCAAG GAAATCGAGT TTAAGGGATC ACTCGGCCTC
CAGCCCTCCC GATACAGCGA GATGCTGGAT ATGATCGAGT CCGGCAAACT CGATCCGACA
ACGCTTGTCG AGAAGAAGAT CGACATCCAC AGTGTGCCGG ACGAACTGGC CGCCATGAGC
GACTACGACA CGCTCGGCAT TCCCGTCTGC AACGAGTTCA GTAGCTAA
 
Protein sequence
MQAVLLESFQ EPLTVQDVDR PEPDPDGAVA EVIGCGVCRS DWHCWQGDWD WFGYRPDPPH 
VLGHEPTGRI VATGEDVESI EEGQEVAIPF NFACGSCDMC RNGRENICEN HIGLGFMNQA
PGAFAEEVHI PNADINAVPL PDSIDAEAAA GCGCRFMTSF HAMAHRAPVS AGDDVVIHGC
GGIGLSAVHI ANALGGNVIG VDLMDEKLDK AEELGAVDTV NAREVDDAAA EVHDITNGGA
DVSADALGIA TTCRNAVNSL RKGGTHVQIG LTTSEEEGMV SLPTDEIVAK EIEFKGSLGL
QPSRYSEMLD MIESGKLDPT TLVEKKIDIH SVPDELAAMS DYDTLGIPVC NEFSS