Gene Caci_4943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4943 
Symbol 
ID8336297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5643484 
End bp5644485 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content72% 
IMG OID644958042 
Productaldo/keto reductase 
Protein accessionYP_003115644 
Protein GI256394080 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.020989 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCCTT CCATACCCAC CGTCCGGCTC GGCTCGGACG GTCCCGCCGT CGGCGCGCAA 
GGGCTCGGCT GCATGGGGAT GAGCGAGTTC TACGGCGACA CCGACCAGGA CTCGGCCCGC
CAGACCCTCG AGGCCGCCCT GTCCGCCGGT GTCACCTTGT TCGACACCGC CGACATGTAC
GGCCGGGGCG AGAACGAACG CTTCCTCGCC CCGTTCCTCC GCGCCCACCG GGACCACGTC
GTCATCGCCA CCAAGTTCGG CAGCGTCCGC GCCGCCGACG GGCCGATGTC GGTCAGCAAC
GACCCCGCCC ACATCCGCCG CGCCGTCGAG GCCAGCCTGA CGCGGCTGGG CATCGAGGTC
ATCGACCTCT ACTACATGCA CCGCCGCGAC CCCGCGGTCC CGCTGGCCGA CTCCGTCGGA
GCGATGGCCG ACCTCGTCCA CGCCGGCAAA GTCCGCCACC TGGGCCTGTC CGAGGTCACC
GCCGACGAAC TGCGCGAGGC CCACAGCCAC CATCCGATCA GCGCGGTGCA GGCGGAGTGG
TCCCTGTTCA CCCGGGACAT CGAACGCAGC CTCGTACCCG CCGCCGCCGA ACTCGGCGTC
GGCGTGGTCG CCTACTCCCC CCTCGGCCGC GGCTTCCTCA CCGGCGCTGT GCCCAGCACC
TTGGCCGCCG ACGACGTGCG CACCCGATTC CCCCGCTTCA CCGGCGAGAA CGCCGAGCGC
AACGCGGCGC TCCTGCCCCC GATCACCTCG ATCGCCGCCG CCCGCGGCGC CACACCCGCG
CAGGTCGCGT TGGCGTGGCT GCACCAGCGG CGCGCCACAC ACCGCCTCCC CGTCGTGCCG
ATCCCCGGCA CCCGGCACCC GCACCGCTTG AAGGAGAACC TCGCCGCCCT CGAACTCACT
CTCACCGCTG AGGAACTCGC ACGCCTGGAA CCCCTCGCCG CGCACGTCGC CGGCGACCGG
TACCCCGACA TGGCCGAGAC GTCCAACGCC CGCGAAGCCT GA
 
Protein sequence
MSPSIPTVRL GSDGPAVGAQ GLGCMGMSEF YGDTDQDSAR QTLEAALSAG VTLFDTADMY 
GRGENERFLA PFLRAHRDHV VIATKFGSVR AADGPMSVSN DPAHIRRAVE ASLTRLGIEV
IDLYYMHRRD PAVPLADSVG AMADLVHAGK VRHLGLSEVT ADELREAHSH HPISAVQAEW
SLFTRDIERS LVPAAAELGV GVVAYSPLGR GFLTGAVPST LAADDVRTRF PRFTGENAER
NAALLPPITS IAAARGATPA QVALAWLHQR RATHRLPVVP IPGTRHPHRL KENLAALELT
LTAEELARLE PLAAHVAGDR YPDMAETSNA REA