Gene Caci_1959 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_1959 
Symbol 
ID8333302 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp2216439 
End bp2217269 
Gene Length831 bp 
Protein Length276 aa 
Translation table11 
GC content69% 
IMG OID644955108 
Productshort-chain dehydrogenase/reductase SDR 
Protein accessionYP_003112720 
Protein GI256391156 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.222894 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCATG CCCGCACAGT CCTCATCACC GGCGCGACCG ACGGCCTCGG TCGTGCGCTG 
GCCCATCGGT TGGCCGCCGG GGGCGACACC GTTCTGTTGC ACGGACGCGA TCAGGGGCGT
CTCGATGCGA CCGCGGATGC CATCCGCGAC GAGCACGGGG TCGCGCGGCC CGCGACGTTC
CGTGCCGATC TGGCGGAGCT GGCGCAGGTG CGCGAGCTTG CGGCTGCGGT GCGGGGCGCC
ACCGAGCGCC TCGACGTCCT GGTGAGCAAC GCCGGCATCG GCAGCGGGGA GCCGGACGGC
CGCGATCGGC GCGAGAGCAA GGACGGCTAT GAGCTCCGCT TCGCGGTGAA CTACCTGGCC
GGGTTCCTGT TGACGCAGGA GTTGTTGCCG CTGCTGCGCG CCTCGGCGCC GGCGCGGGTG
GTGAACGTCG CCTCGCTCGG GCAGCAGGAG ATCGAGTTCG ACGACGTGAT GCTGGAGCAC
GGGTACAGCG GTATCCGGGC GTACTGCCAG AGCAAGCTGG CGCAGATCGC CTCGACGGTG
GAGCTGGCCG AGCGCGTGCC GGCAGCGGAG GTCACGTTCA ACAGCCTGCA TCCGGCGACG
TACATGCCGA CGAAGATGGT GCTCCAGGAA GCCGGGCACA GCATCGACAG CCTGGAGACC
GGCGTCGAGG CGACGTGGCG GCTCGTGACG GATCCGAAGC TGGCAGGGGT CAGCGGACGG
TTCTACGACC GGCAGCGGGA GTCACAGGCC TTGAAGCAGG CCTATGACAC GCGGGCGCGG
GCGGAGTTGT ATCAGCTGAG CTTGAAGTTG GTCGGGCTCA CGCAGGGATA G
 
Protein sequence
MTHARTVLIT GATDGLGRAL AHRLAAGGDT VLLHGRDQGR LDATADAIRD EHGVARPATF 
RADLAELAQV RELAAAVRGA TERLDVLVSN AGIGSGEPDG RDRRESKDGY ELRFAVNYLA
GFLLTQELLP LLRASAPARV VNVASLGQQE IEFDDVMLEH GYSGIRAYCQ SKLAQIASTV
ELAERVPAAE VTFNSLHPAT YMPTKMVLQE AGHSIDSLET GVEATWRLVT DPKLAGVSGR
FYDRQRESQA LKQAYDTRAR AELYQLSLKL VGLTQG