Gene Caci_4403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4403 
Symbol 
ID8335757 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4997846 
End bp4999336 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content73% 
IMG OID644957506 
ProductAldehyde Dehydrogenase 
Protein accessionYP_003115108 
Protein GI256393544 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01780] succinate-semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTGCGA TCTCCCCGGA GCAGGAGCGC CGGCTTCTGG ACTCCGTGCC CACCGGACTG 
CTGATCGACG GACGCTGGCG CCAGGCGTCC GACGGCGGAA CGCTCGACGT CAGCGACCCC
TCGACCGGCG AGGTGCTGCT GACCGTCGCC AGCGCGTCGG CCGCCGACGG CCAGGACGCG
CTGGCCGCCG CGCACGCCGC GCAGGCTTCG TGGGCGCGCA CGGCGCCGCG TCAGCGCGCG
GAGATCCTGC GCAAGGCGTT CGACCTGGTC ACCGCGCGCA CCGAGGACTT CGCGCTGCTG
ATGACGCTGG AGATGGGCAA GCCGCTGGCG GACTCGCGCG CCGAGGTCGC CTACGGCGCG
GAGTTCCTGC GCTGGTTCTC CGAGCAGACG AACCGGATCG CCGGGCGGTA CCAGACGGCG
CCGGACGGAG CGGCGCGCCT GCTGGTCGCC AAGCGGCCGG TCGGACCGTG CCTGCTGATC
ACCCCGTGGA ATTTCCCGCT GGCGATGGGG ACGCGCAAGC TGGGACCGGC GCTGGCCGCC
GGGTGCACGG TCGTGTTCAA GCCGGCCGCG CTCACCCCGC TGACCTCGCT GCTGCTCGCG
CAGACGCTGA TCGAGGCCGG GGTCCCCGAC GGCGTGGTGA ACGTGATCCC GACCCGGCAC
GCAGGAGCGG TCACCGGCCC GCTGATCCGC GACCCGCGCC TGCGCAAGCT GTCCTTCACC
GGCTCGACCG AGGTCGGGCA GCAGCTGATC GCGGACTCCG CCGAGCAGGT GCTGCGGGTG
TCGATGGAAC TCGGCGGCAA CGCGCCGCTG ATCGTCTTCG CCGACGCCGA CCTGGACCGC
GCGCTGGACG GCGCCATGCT CGCCAAGCTG CGCAACGGCG GCGAGGCGTG CACCGCGGCG
AACCGGCTGC TGGTGGAGCG CTCCGTCGCC GACGTCTTCG CCGACCGCCT GACCAGCCGT
TTCCGCGAGC ACACGCTCGG GCGCGGCACG CTGCCGGACG TCAAGATCGG CCCGCTGGTC
GACGCCGAGA CCCGCGACAA GGTCGAGCGC CTGGTCGACG CCGCGGTCGA AGGCGGCGCG
AAGGTCCTCA CCGGCGGCCA CAAGCTCCCC GGAGCGGGCT ACTTCTACGA GCCGACGGTC
CTGACCGACA TCCCCGCCGG CGCGGAGATC CTGCGCGAGG AGATCTTCGG ACCGGTCGCC
CCGATCATCG CCTTCGACAG CGAGGACGAG GCCGTCGCCC TGGCCAACGA GACGCAGTAC
GGCCTGGTCG CCTACGCCTT CACCAAGGAC CTGAACCGCG GCCTGCGCCT GGCCGAGCGC
CTCGACGCCG GCATGATCGG CCTGAACACC GGGATTGTGT CGAACCCGGC GGCGCCCTTC
GGCGGGGTGA AGCAGTCCGG GATCGGGCGC GAGGGCGGGC TGGAGGGCAT CGAGGAGTAC
TTGGAGACGC GGTACGTGGG GATCGCCGAT CCCTTCGCCG AGGGCGCCTA G
 
Protein sequence
MTAISPEQER RLLDSVPTGL LIDGRWRQAS DGGTLDVSDP STGEVLLTVA SASAADGQDA 
LAAAHAAQAS WARTAPRQRA EILRKAFDLV TARTEDFALL MTLEMGKPLA DSRAEVAYGA
EFLRWFSEQT NRIAGRYQTA PDGAARLLVA KRPVGPCLLI TPWNFPLAMG TRKLGPALAA
GCTVVFKPAA LTPLTSLLLA QTLIEAGVPD GVVNVIPTRH AGAVTGPLIR DPRLRKLSFT
GSTEVGQQLI ADSAEQVLRV SMELGGNAPL IVFADADLDR ALDGAMLAKL RNGGEACTAA
NRLLVERSVA DVFADRLTSR FREHTLGRGT LPDVKIGPLV DAETRDKVER LVDAAVEGGA
KVLTGGHKLP GAGYFYEPTV LTDIPAGAEI LREEIFGPVA PIIAFDSEDE AVALANETQY
GLVAYAFTKD LNRGLRLAER LDAGMIGLNT GIVSNPAAPF GGVKQSGIGR EGGLEGIEEY
LETRYVGIAD PFAEGA