Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_5100 |
Symbol | |
ID | 8336454 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 5857567 |
End bp | 5859009 |
Gene Length | 1443 bp |
Protein Length | 480 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 644958199 |
Product | Aldehyde Dehydrogenase |
Protein accession | YP_003115801 |
Protein GI | 256394237 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.482433 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.0382581 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTCTG TCAGCACGGA AGCCACGGAA GCGGTCGAGT ACACCGCCGA AGTCTTCATC GACGGCCGCT TCGAATCCCC CGCCTCGGCG TGGCGGCCGG TCCTGGACAA GGCGGCCGGG ACGCCGTTCG CCCGCTACGG CGACGCCTCG GCCGAGCAGG TCGACCGCGC CGTGGCCGCC GCCCGCCGCG CGCAGCCCGC CTGGGCCGAC ACCGACGCGA ACACCCGCTG CGACGTCATC CGGGCGTTCG CCGCGCAGCT CCAGCGGCGC CACGACGAGC TGATCACGCT GCTCGTCCGG GAGACCGGCG GCACGGCGGA GAAGGCCGAG GAGGAACTCG GCCAGTCGAT CAACCAGCTG CTGAACTCCG CGACGCAGCT CACTGAGAAC GCCGGCTCGA TCCTGCCGCC CTACAAGCCC GGCAAGATGT CCCTGTCGCG CGCCGTCCCG CTCGGCGTCA TCGGCCTGAT CGTGCCGTGG AACTACCCGA TGAGCCTGGC GATGCGGGCG CTGGCGCCGG GCCTGGCCTA CGGCAACGCC GTCGTCCTCA AGCCCGCCGA GCTCACCCCG ATCGCCGGCG GCCGGATCCT CGCCGAGGCC GCGCGCGCCG CCGGCGTGCC GGACGGTCTG CTGGCCGTGC TGCCCGGCGA CGGCCCGGCC ACCGGCGCCG CGCTGTCCCG CCACCGGGGT CTGGACCTGA TCCACTTCAC CGGCTCCTTC GAGGTCGGCG CGGCGATCAG GGAGCACGGC GCGCGCACCG GGACCCCGGT GATCACCGAG CTCGGCGGCG ACAACGCCTT CGTCGTGCTC GACGACGCCG ACGTCGAGCA GGCCGCGAGC TGCGCGGTCT GGACCGCCCT GTGGTACCAG GGCCAGACCT GCATCAGCGC CGGCCGCCAC ATCGTGCAGC GCGCGATCGC CGCGGAGTTC ACCGAGGCGG TCGTCGAGCG CGTCCGCAAA CTGCGGGTCG GCGACCCGCT GCGCGAAGAG GTGGACCTCG GCCCGGTGAT CAGCGCCGGG CAGCTGGCCC GCTTCCATGA GGGGCTCGTC CTGCCCTCGA TCGACGCCGG CGCGCGAGTC GCGGTCGGCG CCGAGCACGA CGGCCTGTTC TACCGCCCGA CGGTCCTCAC CGACGTCACG CCGGACATGC CGATCTTCCA GGAGGAGACG TTCGGACCGG TCATGCCGAT CACCGTCGTC GACTCCGAAC TCCAGGCTCT GGAGCTCGCC AACCGCCACC GCACGCTGAT GAACTCCGTG TTCTCCGGCG ACCCGCTGCG CGGCTACGAG TTCGCCGAGC GGCTGCACAG CAACGAGGTC CACGTCAACG ACGGCTACGC CCGCCACGGC GGCGAAGGCC AGCTCGCCGG CTTCACCCGC CGCCAGTGGA TCGGCCTGCA GACGACGCCG GTCTCCTACC CGGCCTGGGC TCAAGGTGTC TGA
|
Protein sequence | MSSVSTEATE AVEYTAEVFI DGRFESPASA WRPVLDKAAG TPFARYGDAS AEQVDRAVAA ARRAQPAWAD TDANTRCDVI RAFAAQLQRR HDELITLLVR ETGGTAEKAE EELGQSINQL LNSATQLTEN AGSILPPYKP GKMSLSRAVP LGVIGLIVPW NYPMSLAMRA LAPGLAYGNA VVLKPAELTP IAGGRILAEA ARAAGVPDGL LAVLPGDGPA TGAALSRHRG LDLIHFTGSF EVGAAIREHG ARTGTPVITE LGGDNAFVVL DDADVEQAAS CAVWTALWYQ GQTCISAGRH IVQRAIAAEF TEAVVERVRK LRVGDPLREE VDLGPVISAG QLARFHEGLV LPSIDAGARV AVGAEHDGLF YRPTVLTDVT PDMPIFQEET FGPVMPITVV DSELQALELA NRHRTLMNSV FSGDPLRGYE FAERLHSNEV HVNDGYARHG GEGQLAGFTR RQWIGLQTTP VSYPAWAQGV
|
| |