Gene Hlac_0522 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0522 
Symbol 
ID7400403 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp543071 
End bp544087 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content71% 
IMG OID643707587 
ProductAlcohol dehydrogenase zinc-binding domain protein 
Protein accessionYP_002565194 
Protein GI222478957 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0876145 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.61344 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACA CCGCGCTCTA CTTTACGGCC CCGGAGACCG TCGAGGTGCG GGAGACGGCG 
GTCGGCCCGC CGGCCGCAGA CGAACTCCTC GTCGACACCC GCGCGTCGGC GATAAGCGCC
GGGACGGAAC TGCTCGTGTA TCGCGACCAG ACGCCGGCCG ACCTCCCGGC CGACGAGACC
CTTGATGCGC TCGACGGGGA TCTGTCGTAC CCGCTCCGGT ACGGCTACGC CGCGAGCGGC
GTTGTCCGCG AGGTCGGTAG CGACGTCGAT CCGAACTGGG TCGGCCGGTC AGTGTTCTCG
TTCGTCCCGC ACCAAACGAG CTTCTGCGCG ACCCCCGACT CGGTGGTCGC ACTCCCGCCG
GAGACGACGC CGGCCGCCGG GTCGTTGCTC CCGTCGGTCG AGACCGCGAC GAACATCGTC
CTCGACGCCG CCCCTCGGCT CGGAGAGCGA GTCGTGGTGT TCGGTGCCGG GGTGATCGGG
CTCTGCGTCA CCCGACTGCT GGCCGCGTTT CCGCTGGAGT CGCTCGTCGT GGTCGACCCG
ATCGAGCGCC GCCGGGCGCT CGCCGCGGAG TTCGGCGCTG ACCGAACGAC GACGCCGACC
GAGCTCGGTG ACGCCGATCC CGCCGGCGCG GACCTCGCCG TCGACGGCGC CGATCTCGCG
ATCGAGCTGT CCGGCCAGCC GAGCGCGCTG GACGATGCGA TCGGGGTCGT CGGCTACGAC
GCGCGGATCG TCGTCGGCTC GTGGTACGGG ACCAAACGCG AGCCGATCGA TCTGGGCGGG
CGATTCCACC GGAACCGCAT CGACATCGTC TCCAGTCAGG TGTCGACGAT CAGCCCGGAA
CTGCGCGGCC GCTGGGACCG CGACCGGCGC ATGGACGCGG CGCTCGATCG GCTCGACTGG
ATCCCCGCCG ACGAGCTGAT CACCCACCGG ATCCCCTTCG AGCGCGCACC GGAGGCGTAC
GAGCTGCTCG ACTCGGCGCC CGACGACGCG GTACAGGTCA TCTTGGAGTA CGAGTGA
 
Protein sequence
MTDTALYFTA PETVEVRETA VGPPAADELL VDTRASAISA GTELLVYRDQ TPADLPADET 
LDALDGDLSY PLRYGYAASG VVREVGSDVD PNWVGRSVFS FVPHQTSFCA TPDSVVALPP
ETTPAAGSLL PSVETATNIV LDAAPRLGER VVVFGAGVIG LCVTRLLAAF PLESLVVVDP
IERRRALAAE FGADRTTTPT ELGDADPAGA DLAVDGADLA IELSGQPSAL DDAIGVVGYD
ARIVVGSWYG TKREPIDLGG RFHRNRIDIV SSQVSTISPE LRGRWDRDRR MDAALDRLDW
IPADELITHR IPFERAPEAY ELLDSAPDDA VQVILEYE