Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_1140 |
Symbol | |
ID | 4027705 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 1300849 |
End bp | 1302426 |
Gene Length | 1578 bp |
Protein Length | 525 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637966317 |
Product | aldehyde dehydrogenase |
Protein accession | YP_573195 |
Protein GI | 92113267 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATATTGG AAGGCAAGCA GATTATCGGT AACACCATCG AAGCCGGCCC CGGCGCCAGC TTCCAAGCCG TCGACCCTTC GAACGGAGAG ACGCTGCCCC CCGAATTCCT CAGCGCCGAC AGCAAGCAGG TCGAACGTGC CTGTCAGCTC GCCTGGGACG CCTTCGATGC ATATCGGGAA ACCTCGCTGG AAACGCGGGC CAAGTTCCTG GAAACCATCG CCGAGGAAAT CGAACACCTG GGTGGTGGCC TGATCGAACG TGCGATGAGC GAAAGCGGTC TGCCGGTCGC GCGCCTCGAA GGCGAACGGG GGCGTACCTG CAATCAGCTG CGCCTGTTCG CCAACGTGGT TCGTGCCGGT GAATGGCTCG ACGTACGCGT CGATCCGGCC CTGCCCGAAC GCACGCCGCT GCCGCGTCCC GATCTGCGTC AGCGCCATAT CGCGCTGGGC CCGGTCGCGG TCTTCGGTGC CAGCAACTTC CCGCTGGCGT TCTCCGTGGC CGGTGGCGAT ACCGCTTCGG CACTCGCCGC GGGCTGCCCG GTGATCGTCA AGGCACACGC CGCCCACCCC GGTACGTCCG AACTGGTGGC GCGCGCCATC CAGAGCGCCG CCGAGAAGTG CGGCATGCCC GAAGGCGTGT TCTCCCTGCT CTTCGATGCC GGTTACGACG TCGGCACCGC GCTGGTCAAG CATCCCCTCA TCAAGGCCGT CGGCTTCACC GGCTCTCGCA AGGGTGGCCT GGCGCTGCTG CAGGCGGCAC AGTCTCGCCC CGAACCGATC CCGGTCTACG CCGAGATGAG CAGCATCAAC CCGGTCTTCC TGATGCCCAA GGCACTGGAA GCGCGCGGCA CCGATCTCGC GCAGTCGTTC GTGGGCTCCC TGTCCATGGG CGCGGGCCAG TTCTGCACCA ACCCGGGGCT GGTGCTAGGC CTCAAGAGCG ACGCGCTGGA CAACTTCATC GAGGAAGCCG GCAAGGCCCT CAAGGAAGTG CCCGCCAACA CCATGCTGAC ACCGGGCATC CATGCGGCCT ATGAGCAGAG TGTCGCCAAG CTGGCCGGCA ACCCCAAGGT CAACGAAGTG AGCCGCGGCC TGACCGGCGA CGGCGAGAAC CAGTGCCAGG CGGGGCTCTT CACCACGCAG GGCGCCGACG TGCTGGCCGA CGAGTCGCTT CAGGAGGAAG TCTTCGGTGC CTCGTCTCTG GTCGTGGTCT GCAACGATCT CGATGAAATG AAGCGTGTGG CCGAGGCCCT CGAAGGCCAG TTGACCGCGA CCCTGCAGAT GGATGAAGGC GATACGCAGG ACGCGGCGAA GCTGCTGCCG GTGCTCGAGC GCAAGGCCGG TCGCATCATG GCCAACGGTT GGCCCACCGG CGTCGAAGTC TGCCATGCCA TGGTGCACGG CGGCCCCTTC CCGTCCACGT CCGACTCGCG CACCACCTCC GTGGGCAGCG CCGCCATCTA TCGCTTCCTG CGCCCGGTGT GCTACCAGAA CCTGTCGGAT GCCTTGTTGC CCGAAGCGCT CAAGGAAGCC AACAGCCTGG GACTCAAGCG CCTGGTGGAT GGCAAGCGCG AAAGCTGA
|
Protein sequence | MILEGKQIIG NTIEAGPGAS FQAVDPSNGE TLPPEFLSAD SKQVERACQL AWDAFDAYRE TSLETRAKFL ETIAEEIEHL GGGLIERAMS ESGLPVARLE GERGRTCNQL RLFANVVRAG EWLDVRVDPA LPERTPLPRP DLRQRHIALG PVAVFGASNF PLAFSVAGGD TASALAAGCP VIVKAHAAHP GTSELVARAI QSAAEKCGMP EGVFSLLFDA GYDVGTALVK HPLIKAVGFT GSRKGGLALL QAAQSRPEPI PVYAEMSSIN PVFLMPKALE ARGTDLAQSF VGSLSMGAGQ FCTNPGLVLG LKSDALDNFI EEAGKALKEV PANTMLTPGI HAAYEQSVAK LAGNPKVNEV SRGLTGDGEN QCQAGLFTTQ GADVLADESL QEEVFGASSL VVVCNDLDEM KRVAEALEGQ LTATLQMDEG DTQDAAKLLP VLERKAGRIM ANGWPTGVEV CHAMVHGGPF PSTSDSRTTS VGSAAIYRFL RPVCYQNLSD ALLPEALKEA NSLGLKRLVD GKRES
|
| |