Gene Csal_1140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1140 
Symbol 
ID4027705 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1300849 
End bp1302426 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content66% 
IMG OID637966317 
Productaldehyde dehydrogenase 
Protein accessionYP_573195 
Protein GI92113267 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATATTGG AAGGCAAGCA GATTATCGGT AACACCATCG AAGCCGGCCC CGGCGCCAGC 
TTCCAAGCCG TCGACCCTTC GAACGGAGAG ACGCTGCCCC CCGAATTCCT CAGCGCCGAC
AGCAAGCAGG TCGAACGTGC CTGTCAGCTC GCCTGGGACG CCTTCGATGC ATATCGGGAA
ACCTCGCTGG AAACGCGGGC CAAGTTCCTG GAAACCATCG CCGAGGAAAT CGAACACCTG
GGTGGTGGCC TGATCGAACG TGCGATGAGC GAAAGCGGTC TGCCGGTCGC GCGCCTCGAA
GGCGAACGGG GGCGTACCTG CAATCAGCTG CGCCTGTTCG CCAACGTGGT TCGTGCCGGT
GAATGGCTCG ACGTACGCGT CGATCCGGCC CTGCCCGAAC GCACGCCGCT GCCGCGTCCC
GATCTGCGTC AGCGCCATAT CGCGCTGGGC CCGGTCGCGG TCTTCGGTGC CAGCAACTTC
CCGCTGGCGT TCTCCGTGGC CGGTGGCGAT ACCGCTTCGG CACTCGCCGC GGGCTGCCCG
GTGATCGTCA AGGCACACGC CGCCCACCCC GGTACGTCCG AACTGGTGGC GCGCGCCATC
CAGAGCGCCG CCGAGAAGTG CGGCATGCCC GAAGGCGTGT TCTCCCTGCT CTTCGATGCC
GGTTACGACG TCGGCACCGC GCTGGTCAAG CATCCCCTCA TCAAGGCCGT CGGCTTCACC
GGCTCTCGCA AGGGTGGCCT GGCGCTGCTG CAGGCGGCAC AGTCTCGCCC CGAACCGATC
CCGGTCTACG CCGAGATGAG CAGCATCAAC CCGGTCTTCC TGATGCCCAA GGCACTGGAA
GCGCGCGGCA CCGATCTCGC GCAGTCGTTC GTGGGCTCCC TGTCCATGGG CGCGGGCCAG
TTCTGCACCA ACCCGGGGCT GGTGCTAGGC CTCAAGAGCG ACGCGCTGGA CAACTTCATC
GAGGAAGCCG GCAAGGCCCT CAAGGAAGTG CCCGCCAACA CCATGCTGAC ACCGGGCATC
CATGCGGCCT ATGAGCAGAG TGTCGCCAAG CTGGCCGGCA ACCCCAAGGT CAACGAAGTG
AGCCGCGGCC TGACCGGCGA CGGCGAGAAC CAGTGCCAGG CGGGGCTCTT CACCACGCAG
GGCGCCGACG TGCTGGCCGA CGAGTCGCTT CAGGAGGAAG TCTTCGGTGC CTCGTCTCTG
GTCGTGGTCT GCAACGATCT CGATGAAATG AAGCGTGTGG CCGAGGCCCT CGAAGGCCAG
TTGACCGCGA CCCTGCAGAT GGATGAAGGC GATACGCAGG ACGCGGCGAA GCTGCTGCCG
GTGCTCGAGC GCAAGGCCGG TCGCATCATG GCCAACGGTT GGCCCACCGG CGTCGAAGTC
TGCCATGCCA TGGTGCACGG CGGCCCCTTC CCGTCCACGT CCGACTCGCG CACCACCTCC
GTGGGCAGCG CCGCCATCTA TCGCTTCCTG CGCCCGGTGT GCTACCAGAA CCTGTCGGAT
GCCTTGTTGC CCGAAGCGCT CAAGGAAGCC AACAGCCTGG GACTCAAGCG CCTGGTGGAT
GGCAAGCGCG AAAGCTGA
 
Protein sequence
MILEGKQIIG NTIEAGPGAS FQAVDPSNGE TLPPEFLSAD SKQVERACQL AWDAFDAYRE 
TSLETRAKFL ETIAEEIEHL GGGLIERAMS ESGLPVARLE GERGRTCNQL RLFANVVRAG
EWLDVRVDPA LPERTPLPRP DLRQRHIALG PVAVFGASNF PLAFSVAGGD TASALAAGCP
VIVKAHAAHP GTSELVARAI QSAAEKCGMP EGVFSLLFDA GYDVGTALVK HPLIKAVGFT
GSRKGGLALL QAAQSRPEPI PVYAEMSSIN PVFLMPKALE ARGTDLAQSF VGSLSMGAGQ
FCTNPGLVLG LKSDALDNFI EEAGKALKEV PANTMLTPGI HAAYEQSVAK LAGNPKVNEV
SRGLTGDGEN QCQAGLFTTQ GADVLADESL QEEVFGASSL VVVCNDLDEM KRVAEALEGQ
LTATLQMDEG DTQDAAKLLP VLERKAGRIM ANGWPTGVEV CHAMVHGGPF PSTSDSRTTS
VGSAAIYRFL RPVCYQNLSD ALLPEALKEA NSLGLKRLVD GKRES