Gene Hlac_3047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_3047 
Symbol 
ID7399021 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012028 
Strand
Start bp305758 
End bp307284 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content59% 
IMG OID643706854 
ProductAldehyde Dehydrogenase 
Protein accessionYP_002564476 
Protein GI222475955 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGTTG ATGACCAATC ACAGACACCA TCGGAGCGTA AATCCGCGAT TAAAAAGCGT 
CACGAGCAGG CCGCAAGCGA GGTGCTACCC GACCATCGAG AACTCTACAT CGGCGGTGAG
TGGGTCCAGA GTGCCTCCGG CGAGACGTTC ACAACAGTCG ACCCGACAAC CGGTGAGACA
CTCGCCGAGG TAGAGGCAGG CAACGGCGAG GACATCGACC GCGCGGTCGA CGCGGCCTGG
GAGGCCTACG ATGAGGTGTA CAGCTCCTAT TCGAGTGCCG AACGACAGGC GATGCTCGAA
GCGATTGCCG ACCGTATCGA GAACAACGCA GACGAGTTCG CCCGACTGGA GTCCCTCGAC
AATGGGAAAC CAATTACCGA GGCCCGCATC GATATCGAAC TCGTCGTCGA CCACTTCCGC
TATTTCGCGG GCATCGCTCG GGCCCACGAG GGACGGACGG TCGACACTGA CGACAGTCGA
CACGTCCAAA CCATCGAAGA ACCCTACGGC GTCGTCGGCC AAATTATTCC GTGGAATTTC
CCGCTTTTAA TGGCTGCCTG GAAACTCGGC CCTGCGCTGT CGGCTGGCAA CACAGTCGTC
CTCAAACCGG CCGAGGAAAC ACCCCTTTCC GTTCTCAAAC TGATGGAGGA GGCCGACGAC
GTGATCCCAG ACGGTGTCGT CAACATCGTC ACCGGGTTCG GTCCCGATGC TGGCGAACCG
CTTTCGAACC ACAGCGGCAT CCGGAAACTC GCCTTTACCG GGTCGACCGA AATCGGCAGC
AAGGTGATGA AAAGCGCCGC CGACAACATC ACCGACATCA CGCTCGAACT GGGTGGCAAA
AGCCCGCTCG TCGTGTTCCC CGATGCGGAC TTAGAGCAGG CAGTCCAGAC CACGATCACC
GCCATCTTCT TCAATACCGG CGAGTGCTGC TGTGCGGGTT CACGACTCTT TGTCCACGAA
GACATCAAAG ATGAGTTCCT CGACGAACTC GCGGCGGCCG CCGAAGATCT GACCGTCGAC
GATCCACTGC TGGATGCGAC TGATCTCGGC CCGAAGGTGA CCGCTGAACA GGTCGAACGA
ACCATGAGCT ACATCGAAGA GGCCGAACAG TCCGGGGCGG CCTTTGTCAC CGGCGGCAGC
CAGCCCGACG ACGAAGCCCT GTCGGACGGC TGTTTCGTTG CGCCAACACT GATCGATAAC
ATCGATCACG ACAGTAAGGC CGTCCAAGAG GAGATTTTCG GCCCCGTCCA AGAGGTGTTC
TCGTGGAGCG ACTACGACGA GATGATCGAG TTGGCGAACG ATGTCGACTA CGGGCTCGCA
GCTGGCGTGA TCACCGAGAA CCTCACGAAG GCCCACCAGT GTGCCAAAGA CATCGAGGCC
GGCAACATCT GGATCAACAC GTACAACGAC TTCCCAGCTG GCCAGCCGTT CGGCGGCTAC
AAGCAATCAG GAATCGGCCG TGAAATCGGT CAAGACGCCG TCGACCACTA CACTCAGACC
AAGACGATCA ACATCAGTCT CAGCTAA
 
Protein sequence
MSVDDQSQTP SERKSAIKKR HEQAASEVLP DHRELYIGGE WVQSASGETF TTVDPTTGET 
LAEVEAGNGE DIDRAVDAAW EAYDEVYSSY SSAERQAMLE AIADRIENNA DEFARLESLD
NGKPITEARI DIELVVDHFR YFAGIARAHE GRTVDTDDSR HVQTIEEPYG VVGQIIPWNF
PLLMAAWKLG PALSAGNTVV LKPAEETPLS VLKLMEEADD VIPDGVVNIV TGFGPDAGEP
LSNHSGIRKL AFTGSTEIGS KVMKSAADNI TDITLELGGK SPLVVFPDAD LEQAVQTTIT
AIFFNTGECC CAGSRLFVHE DIKDEFLDEL AAAAEDLTVD DPLLDATDLG PKVTAEQVER
TMSYIEEAEQ SGAAFVTGGS QPDDEALSDG CFVAPTLIDN IDHDSKAVQE EIFGPVQEVF
SWSDYDEMIE LANDVDYGLA AGVITENLTK AHQCAKDIEA GNIWINTYND FPAGQPFGGY
KQSGIGREIG QDAVDHYTQT KTINISLS