Gene Hlac_3373 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_3373 
Symbol 
ID7402226 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012030 
Strand
Start bp126560 
End bp128158 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content64% 
IMG OID643709922 
ProductAldehyde Dehydrogenase 
Protein accessionYP_002567488 
Protein GI222481252 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACTCA GTCAATTCGA GAACGAACTT ACGATTCACG AGCACACGCA GGCGGGGACG 
CTGGACGAGT TTCACCGTGC CTACGAGGCC GAAGTCGACG ACATCCGCTC CGATTTCGGG
GCGACTCACC CTCTCCGGAT CGACGGCGAC GCGGTCGAGA CCGGAGAGAC ATTCACCGTC
ACGAATCCGG GAGACACGGA CCAGGTCCTC GGGGAGTTCG CAGCGGGAGA TGAGACGCAC
GTCGACGAGG CCGTCGCGGC CGCGAGCGAC GCCTTCGACG AGTGGAAGGA GACGTCCTGG
GAGGAGCGCG TCGCGATATT TCGCGACGCC GCGGACGTCA TCCAGGACCG CAAACTCGAG
ATCACAGCGT TGATGGCCTA CGAAAATGCA AAAACACGGA ACGAGGCGAT TGCGGAGGTC
GACGAGGCGA TCGACTTCCT CCGGTACTAC AGCAGTGAAC TGGAACGGAA CGAGGGATAC
ACCGCCGACA CACATGAGCC AACACCTGGC CAGCGCTGCG TCAGCGACCT CCAGCCGTAC
GGCGTCTTCG GCGTTGTGGC CCCGTTCAAT TTTCCGTTCG CGATCACCGT CGGAATGACA
ACCGGCGCGC TGATCACCGG AAACACCGCA GTCGTGAAGC CGGCGAGCAC CACGCCGCTG
ACGGCGCACG CGTTCTACGA CGCCCTCGCG GAGGCGGGCA TTCCGGACGG CGTCGTCAAC
CTGGTCACGG GTGGCGGGCG GGCGGTCGGT CAACCGATGA TCGAACACGA GGACGTCGCC
GGATTCGTGT TTACGGGCTC TCGCGAGGTC GGACTCGAGA TCCAGCGGAC CTTCGACGAG
CTGGGCAAAC GCGGGCCAGT CGTCGCGGAG CTCGGCGGGA AGAACCCGGT CATCGTCTCC
GACAGCGCCG ATGTCTCGAA GGCCGTCTCT GGCGTGAAGT TTGGTGCGTT TTCGTTCAGC
GGTCAGAAGT GCTCTGCGAC CTCCCGCGTA TACGTCCACG AGGACATCGC CGACGAGTTC
ACGGAGCAAC TCGTCGAGGA GACGAACGAC CTCTCCATCG GCAAGCCCGA GAACCGGGAG
ACGGTCGTCT CTCCCCTGAT CGACGACAGC GCGATCGAGC GCTACGACGA TATCTGTGAA
ACGGCGGCCG CGGACGGCAC GGTCCTGACC GGCGGGAGCC GCATCGACCG AGAAAACCTC
CCGACCGGCC GGTACGTCGA GCCGACCGTG GTCACGGACA TTCCGCACGA TCACGCGCTC
GCGACGGACG AGCACTTCCT CCCGTTCGTT ACTATCCACC CCGTCTCGAG CCTCGAGGAA
GGGATTACGA AGGCCAACGA CAGCGATTAC GGACTCTGTG CTGGCCTCTT CTCCGAGGAC
GAGGACGAGA TCGACACGTG GTTCGACCGG ATCGAGTCCG GGATGTGCTA CGTGAACCGC
GAGCAGAGCG CGACGACCGG TGCGCTCGTC GAGGCCCAAC CGTTCGGCGG CTGGAAGTAC
TCCGGGACGA CCGGGAAATT CGCGGGCGGT CCGTGGTACC TCCAGCAGTT CATGCGTCAG
CAGAGTCGGA CTGTGGTCGG CGACGTCGGA CAGCCCTGA
 
Protein sequence
MTLSQFENEL TIHEHTQAGT LDEFHRAYEA EVDDIRSDFG ATHPLRIDGD AVETGETFTV 
TNPGDTDQVL GEFAAGDETH VDEAVAAASD AFDEWKETSW EERVAIFRDA ADVIQDRKLE
ITALMAYENA KTRNEAIAEV DEAIDFLRYY SSELERNEGY TADTHEPTPG QRCVSDLQPY
GVFGVVAPFN FPFAITVGMT TGALITGNTA VVKPASTTPL TAHAFYDALA EAGIPDGVVN
LVTGGGRAVG QPMIEHEDVA GFVFTGSREV GLEIQRTFDE LGKRGPVVAE LGGKNPVIVS
DSADVSKAVS GVKFGAFSFS GQKCSATSRV YVHEDIADEF TEQLVEETND LSIGKPENRE
TVVSPLIDDS AIERYDDICE TAAADGTVLT GGSRIDRENL PTGRYVEPTV VTDIPHDHAL
ATDEHFLPFV TIHPVSSLEE GITKANDSDY GLCAGLFSED EDEIDTWFDR IESGMCYVNR
EQSATTGALV EAQPFGGWKY SGTTGKFAGG PWYLQQFMRQ QSRTVVGDVG QP