Gene Caci_5100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5100 
Symbol 
ID8336454 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5857567 
End bp5859009 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content73% 
IMG OID644958199 
ProductAldehyde Dehydrogenase 
Protein accessionYP_003115801 
Protein GI256394237 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.482433 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0382581 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTCTG TCAGCACGGA AGCCACGGAA GCGGTCGAGT ACACCGCCGA AGTCTTCATC 
GACGGCCGCT TCGAATCCCC CGCCTCGGCG TGGCGGCCGG TCCTGGACAA GGCGGCCGGG
ACGCCGTTCG CCCGCTACGG CGACGCCTCG GCCGAGCAGG TCGACCGCGC CGTGGCCGCC
GCCCGCCGCG CGCAGCCCGC CTGGGCCGAC ACCGACGCGA ACACCCGCTG CGACGTCATC
CGGGCGTTCG CCGCGCAGCT CCAGCGGCGC CACGACGAGC TGATCACGCT GCTCGTCCGG
GAGACCGGCG GCACGGCGGA GAAGGCCGAG GAGGAACTCG GCCAGTCGAT CAACCAGCTG
CTGAACTCCG CGACGCAGCT CACTGAGAAC GCCGGCTCGA TCCTGCCGCC CTACAAGCCC
GGCAAGATGT CCCTGTCGCG CGCCGTCCCG CTCGGCGTCA TCGGCCTGAT CGTGCCGTGG
AACTACCCGA TGAGCCTGGC GATGCGGGCG CTGGCGCCGG GCCTGGCCTA CGGCAACGCC
GTCGTCCTCA AGCCCGCCGA GCTCACCCCG ATCGCCGGCG GCCGGATCCT CGCCGAGGCC
GCGCGCGCCG CCGGCGTGCC GGACGGTCTG CTGGCCGTGC TGCCCGGCGA CGGCCCGGCC
ACCGGCGCCG CGCTGTCCCG CCACCGGGGT CTGGACCTGA TCCACTTCAC CGGCTCCTTC
GAGGTCGGCG CGGCGATCAG GGAGCACGGC GCGCGCACCG GGACCCCGGT GATCACCGAG
CTCGGCGGCG ACAACGCCTT CGTCGTGCTC GACGACGCCG ACGTCGAGCA GGCCGCGAGC
TGCGCGGTCT GGACCGCCCT GTGGTACCAG GGCCAGACCT GCATCAGCGC CGGCCGCCAC
ATCGTGCAGC GCGCGATCGC CGCGGAGTTC ACCGAGGCGG TCGTCGAGCG CGTCCGCAAA
CTGCGGGTCG GCGACCCGCT GCGCGAAGAG GTGGACCTCG GCCCGGTGAT CAGCGCCGGG
CAGCTGGCCC GCTTCCATGA GGGGCTCGTC CTGCCCTCGA TCGACGCCGG CGCGCGAGTC
GCGGTCGGCG CCGAGCACGA CGGCCTGTTC TACCGCCCGA CGGTCCTCAC CGACGTCACG
CCGGACATGC CGATCTTCCA GGAGGAGACG TTCGGACCGG TCATGCCGAT CACCGTCGTC
GACTCCGAAC TCCAGGCTCT GGAGCTCGCC AACCGCCACC GCACGCTGAT GAACTCCGTG
TTCTCCGGCG ACCCGCTGCG CGGCTACGAG TTCGCCGAGC GGCTGCACAG CAACGAGGTC
CACGTCAACG ACGGCTACGC CCGCCACGGC GGCGAAGGCC AGCTCGCCGG CTTCACCCGC
CGCCAGTGGA TCGGCCTGCA GACGACGCCG GTCTCCTACC CGGCCTGGGC TCAAGGTGTC
TGA
 
Protein sequence
MSSVSTEATE AVEYTAEVFI DGRFESPASA WRPVLDKAAG TPFARYGDAS AEQVDRAVAA 
ARRAQPAWAD TDANTRCDVI RAFAAQLQRR HDELITLLVR ETGGTAEKAE EELGQSINQL
LNSATQLTEN AGSILPPYKP GKMSLSRAVP LGVIGLIVPW NYPMSLAMRA LAPGLAYGNA
VVLKPAELTP IAGGRILAEA ARAAGVPDGL LAVLPGDGPA TGAALSRHRG LDLIHFTGSF
EVGAAIREHG ARTGTPVITE LGGDNAFVVL DDADVEQAAS CAVWTALWYQ GQTCISAGRH
IVQRAIAAEF TEAVVERVRK LRVGDPLREE VDLGPVISAG QLARFHEGLV LPSIDAGARV
AVGAEHDGLF YRPTVLTDVT PDMPIFQEET FGPVMPITVV DSELQALELA NRHRTLMNSV
FSGDPLRGYE FAERLHSNEV HVNDGYARHG GEGQLAGFTR RQWIGLQTTP VSYPAWAQGV