Gene Hmuk_2343 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2343 
Symbol 
ID8411884 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp2256410 
End bp2257939 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content67% 
IMG OID645020686 
ProductAldehyde Dehydrogenase 
Protein accessionYP_003178162 
Protein GI257388389 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0135718 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCGAG ACAGCGAGGT CCCCGGCCGC CACTACATCG ACGGTGAGTG GACAGACGGC 
GACAGTACGA CGATCTTCGA GAGTCGCGAG CCCGCGACCG GCGAGGTGCT GGCCGAGTTC
CAGCGCGGAA CGAGCGACGA CGTAGCCAGC GCCGTCGCCG CCGCCGAGGC CGCCGCCGAC
GAGTGGCAGT CCCTCTCCTA CATCGATCGC GCGGAACACC TCTGGGAGAT CTTCCACGAG
CTTCGAGACC GCACCGACGA ACTCGGCGAA GTCGTCAGCA GGGAGTGCGG GAAAGAACTC
TCCGAGGGCC GAGCCGACGT GGTCGAGGCC TACCACATGG TCGAGTGGGC CGCGGGCAAC
GCCCGCCACC CCCACGGCGA CGTCGTTCCC AGCGAGGTCG CGAGCAAGGA CGCCTACATG
CGGCGCAAGC CCCGCGGCGT CGTCGGCTGC ATCACGCCCT GGAACTTCCC CGTCGCCATC
CCGTTCTGGC ACATGGCGGT CGCGCTCGTC GAGGGCAACA CCGTCGTCTG GAAACCCGCC
GAACAGACAC CGTGGTGTGC CCAGATCGTC GCGGAGATGT TCGACGACGC GGGGATCCCC
GACGGCGTGT TCAACCTCGT TCAGGGGTAC GGCGACGCCG GCAACGCCGT CGTCGAGGAC
GACCGCGTCG ACACCGTCCT CTTTACGGGC TCTGCGGCGG TCGGCCACGA GGTAGCCGGC
AAGATCGGCG GCCGGCCGGG CCGGAGCGCG TGTTGCGAGA TGGGCGGCAA GAACGCGGTC
GTCGTCACCG ACACCGCCGA CCTGGATATC GCGGTCCACT CGGCGGTGAT GTCCTCGTTC
AAGACGGCCG GCCAGCGCTG TGTCTCCGCG GAGCGCCTGA TCGTCCACGA AGACGTGTAC
GACGAGTTCA AGCGCCGCTT CGTCGACCTC GCGGAAGACG TGACGGTCGG AGATCCCCTC
GACGAGTCGA CGTTCATGGG CCCGGTCGTC GACCAAGAGC AAGTCGAGAA GTTCGCGGAG
TACAACGAAC TGGCCCGACA GGAGGGTGCG ACAGTGCTCG TCGACCGGGA GGAGCTGTCG
CCCGACGCGA TCCCCGACGG CCGCGAGGAG GGCAACTGGG TCGGACCCTT CGTCTACGAG
ATGGAGTACG ATCCCGAACT GCGCTGTCTC ACCGAAGAGG TGTTCGGCCC CCACGTCGCC
CTGCTGGAGT ACTCCGGCGG GGTCGAACGC GCAGTCGAGA TGGTCAACGC GACGCCGTAC
GGCCTCGCCG GTGCGATCGT CGCCGAGGAC TACCGCCAGA TCAACTACTT CCGAGACAAC
GCCGACCTCG GACTGGCCTA CGGGAACCTC CCGTGTATCG GCGCGGAGGT TCACCTCCCC
TTCGGCGGCG TCAAGAAGTC CGGCAGCGGC TCGCCACACG CCCGCGAGGT GATCGAGGCG
GTCACCGAAC GAACGGCCTG GACGATCAAC AACTCGAAGG AGATCGAGAT GGCACAGGGG
TTGTCGGCTG ATATACGGAC CGAGGAGTGA
 
Protein sequence
MTRDSEVPGR HYIDGEWTDG DSTTIFESRE PATGEVLAEF QRGTSDDVAS AVAAAEAAAD 
EWQSLSYIDR AEHLWEIFHE LRDRTDELGE VVSRECGKEL SEGRADVVEA YHMVEWAAGN
ARHPHGDVVP SEVASKDAYM RRKPRGVVGC ITPWNFPVAI PFWHMAVALV EGNTVVWKPA
EQTPWCAQIV AEMFDDAGIP DGVFNLVQGY GDAGNAVVED DRVDTVLFTG SAAVGHEVAG
KIGGRPGRSA CCEMGGKNAV VVTDTADLDI AVHSAVMSSF KTAGQRCVSA ERLIVHEDVY
DEFKRRFVDL AEDVTVGDPL DESTFMGPVV DQEQVEKFAE YNELARQEGA TVLVDREELS
PDAIPDGREE GNWVGPFVYE MEYDPELRCL TEEVFGPHVA LLEYSGGVER AVEMVNATPY
GLAGAIVAED YRQINYFRDN ADLGLAYGNL PCIGAEVHLP FGGVKKSGSG SPHAREVIEA
VTERTAWTIN NSKEIEMAQG LSADIRTEE