Gene Hlac_1152 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1152 
Symbol 
ID7400961 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1159896 
End bp1161416 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content66% 
IMG OID643708217 
ProductAldehyde Dehydrogenase 
Protein accessionYP_002565816 
Protein GI222479579 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.999135 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.730796 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGACC CGTATCAACA CTACATCGAC GGCGAATGGG TCTCGGGCAC TGGCTCGGAG 
ACCTTCGAGA GCGAGAACCC GGCGACCGGC GAGACGCTCG GCACGTTCGA GCGCGGGACG
CCCGAAGACG TCGAGTCGGC GGTCGACGCC GCCGACGCGG CGTTCGAGGA GTGGCGCGAG
CTCTCCCGGA TCCAGCGCGC GGAGTACCTC TGGGACGTGT ACCACGAGCT GCGCGAGCGC
ACTGACGAGC TCGGCGAGGT CGTCACGAAG GAGTGCGGCA AGGAGATCTC GGAGGGGAAA
GCCGACGTGG TCGAGGCCGC ACATATGGTC GAGTGGGCCG CGGGCGACGC CCGCCACCCG
AAAGGCGACA TCGTTCCCTC GGAGATCCCC GCGAAGGACG CGTACATGCG CCGGAACCCC
CGCGGCGTCA CCGGGTGTAT CACGCCGTGG AACTTCCCGG TCGCGATCCC CTACTGGCAC
ATGGCCATCG CGCTGGTGGA GGGGAACCCC GTCGTGTTCA AACCCGCCGA GCAGACCCCG
TGGTGCGCGC AGATCATCGC GGAGATGTTC GACGACGCCG GCATCCCCGA CGGCGTGTTC
AACATGGTTC AGGGGTTCGG CGACGCCGGC AACGCCATCG TCGAGCACGA CGACGTCAAG
ACCGTCCTTT TCACCGGTTC CGCCGAGGTC GGCCATCACA TCCAAGACAA GCTCGGCGGC
GTCGCCGGCA AGCGCGTCGC CTGTGAGATG GGCGGCAAGA ATGCGATCGT CGTCACCGAA
GAGGCGGATC TCGATATCGC AGTTCACTCG GCCGTGATGT CTTCCTTCAA GACGACCGGC
CAGCGCTGCG TCTCCTCCGA GCGCCTGATC GTCCACACGG ACGTGTACGA CGAGTTCAAA
GAGCGATTCG TCGAGGTGGC CGAGGACGTC GCCGTCGGCG ACCCTCTACA AGAGGACACG
TTCATGGGGC CGCTCATCGA GACCGAGCAC TTCGAGAAGG TCTCCGAGTA CAACCAACTC
GCCCGCGACG AGGACGTGAA CGTGCTCGTC GACCGGACCG AGCTCGACGC CGACGAGGTC
CCCGACGGTC ACGAGGACGG CCACTGGATC GGCCCGTTCG TCTACGAGGC CGACCCCGAC
GAGGACCTCC GCTGCACGCA CGAGGAGGTG TTCGGGCCGC ACGTCGCCCT TCTCGAATAC
GACGGCGACA TCGAGCGCGC GGTCGAGATC CAGAACGACA CCGAGTACGG GCTCGCCGGC
GCGGTCGTCT CCGAGGATTA CCGCCAGATC AACTACTACC GCGACCACGC CGAGGTGGGG
CTGGCGTACG GCAACCTCCC CTGTATCGGC GCAGAGGTTC AGCTCCCCTT CGGCGGCGTC
AAGAAGTCCG GGAACGGCTA CCCCTCGGCG CGAGAGGCCA TCGAGGCGGT CACTGACCGC
ACCGCGTGGA CCCTGAACAA CTCGAAGGAG ATCGAGATGG CACAGGGGCT CTCCGCGGAC
ATCAAGACGA AGGACGACTG A
 
Protein sequence
MSDPYQHYID GEWVSGTGSE TFESENPATG ETLGTFERGT PEDVESAVDA ADAAFEEWRE 
LSRIQRAEYL WDVYHELRER TDELGEVVTK ECGKEISEGK ADVVEAAHMV EWAAGDARHP
KGDIVPSEIP AKDAYMRRNP RGVTGCITPW NFPVAIPYWH MAIALVEGNP VVFKPAEQTP
WCAQIIAEMF DDAGIPDGVF NMVQGFGDAG NAIVEHDDVK TVLFTGSAEV GHHIQDKLGG
VAGKRVACEM GGKNAIVVTE EADLDIAVHS AVMSSFKTTG QRCVSSERLI VHTDVYDEFK
ERFVEVAEDV AVGDPLQEDT FMGPLIETEH FEKVSEYNQL ARDEDVNVLV DRTELDADEV
PDGHEDGHWI GPFVYEADPD EDLRCTHEEV FGPHVALLEY DGDIERAVEI QNDTEYGLAG
AVVSEDYRQI NYYRDHAEVG LAYGNLPCIG AEVQLPFGGV KKSGNGYPSA REAIEAVTDR
TAWTLNNSKE IEMAQGLSAD IKTKDD