Gene Hlac_2240 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2240 
Symbol 
ID7399950 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2226634 
End bp2228082 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content68% 
IMG OID643709314 
ProductAldehyde Dehydrogenase 
Protein accessionYP_002566887 
Protein GI222480650 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.455066 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.336013 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTGACC TTAATCAGAA CTACGTCGGC GGAGAGTGGG TCTCCTCGGA GACGGGAGAA 
ACGTTCGAGG TTCACAATCC CGCGGCTCCC GACGAGACGG TGGCGAGCTA TCAACAGTCC
AGCGCGGCGG ACGCCGCCGA GGCGGTCGAG GCGGCGGCCG ACGCACAGGA TGAGTGGGCG
ACCACGCCCG GTCCAGAGCG CGGTCGGATC CTCCGGAAGG CGGGGACGAT CCTCGCCGAT
CGGAAGGACG AGCTCACCGC GATGCTCGTC GAGGAGGAGG GGAAGGCCCG TCCCGAGGCG
GCCGGCGAGG TACAGCGCGC GATCGACATC TTCCACTACT TCGCGGGCAA GGCCTCGGAC
CTCGGTGGAA CGATGAAAGG GTCGAGCAGC CGCGATACGA CCCTCTACAC GCGCGAGGAG
CCGGTCGGTG TCGCGGCCCT GATCACGCCG TGGAACTACC CCATCGCCAT CCCGGTGTGG
AAGCTCGCCC CCGCGCTCGC GGCGGGCAAC TCCGTGGTCA TCAAGCCCGC GAGCGCTGCG
CCGGGCGTCG TGTTCGCTGT CACGGAGGCC TTAGACGAGG CGGGGCTCCC CGACGGCGTG
CTCAACGTCG TGACCGGGCC GGGGAGTTCG GTCGGCAACG AGTTCATCAC GAACGAGGGC
ACCGACGCCG TCTCCTTCAC CGGCAGCGGA CAGGTCGGCG AGATGGTCTA CGATCAGGCC
ACGGACGCCG GCAAGCGGGT GCAGACCGAG ATGGGCGGGA AGAACCCCAC GCTGGTCACC
GATTCGGCCG ACCCCGCGGA AGCCGCCGAG ATCGTCGCGA CCGGCGGCTT CGGGACGACC
GGCCAGTCGT GTACGGCGTG TTCCCGCGCC GTCGTTCACG AGGACGTGTA CGACGAGTTC
GTCGACGAGC TCGTCGACCG GGCCGAGGCC ATCGACATCG GCCCCGGGCT CGACCACGAG
ATGGGGCCGC AGGTCAGCGC CGACGAACTG GAGTCGACGC TCGAATACAT CGACGTAGCC
CGCGAGGAGG GCGCGACGGT CGCGGCGGGC GGCGAGAAGC CCACCGGCGA CGCGGTCGAG
AGCGGGCACT TCGTCGAGCC GACCGTCTTT ACCGACGTGG ACAACGACGA CACGATCGCG
TGTGAGGAAG TGTTCGGCCC GGTGGTCGCC GTCATCGAGG TCAGCGACTT CGACGAGGGA
CTCAACGTCG CCAACGACGT GGAGTACGGA CTCTCGGCCA GTGTGGTCAC CGACGACCAC
ACCGAGGCCA ACCGGTTTAT CGACGAGGCA GAGGCCGGCG TCGTGAAGGT CAACGAGAAG
ACGACCGGGC TCGAACTCCA CGTGCCGTTC GGCGGGTTTA AGCGCTCCTC GTCGGAGACT
TGGCGCGAGC AGGGCGACGC AGGGATGGAG TTCTACACGA TCGAGAAGAC CGTCTACGAC
AACTACTGA
 
Protein sequence
MADLNQNYVG GEWVSSETGE TFEVHNPAAP DETVASYQQS SAADAAEAVE AAADAQDEWA 
TTPGPERGRI LRKAGTILAD RKDELTAMLV EEEGKARPEA AGEVQRAIDI FHYFAGKASD
LGGTMKGSSS RDTTLYTREE PVGVAALITP WNYPIAIPVW KLAPALAAGN SVVIKPASAA
PGVVFAVTEA LDEAGLPDGV LNVVTGPGSS VGNEFITNEG TDAVSFTGSG QVGEMVYDQA
TDAGKRVQTE MGGKNPTLVT DSADPAEAAE IVATGGFGTT GQSCTACSRA VVHEDVYDEF
VDELVDRAEA IDIGPGLDHE MGPQVSADEL ESTLEYIDVA REEGATVAAG GEKPTGDAVE
SGHFVEPTVF TDVDNDDTIA CEEVFGPVVA VIEVSDFDEG LNVANDVEYG LSASVVTDDH
TEANRFIDEA EAGVVKVNEK TTGLELHVPF GGFKRSSSET WREQGDAGME FYTIEKTVYD
NY