Gene Caci_5004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5004 
Symbol 
ID8336358 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5728247 
End bp5729461 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content72% 
IMG OID644958103 
Productputative transcriptional regulator, PucR family 
Protein accessionYP_003115705 
Protein GI256394141 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism
[T] Signal transduction mechanisms 
COG ID[COG2508] Regulator of polyketide synthase expression 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.351377 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.990518 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGCAG CTCACTCGCC GGAGGTCGAG GCTGAGGCCC GGGCCCTCGT GGCACGGTTC 
GCAGAACGGC TCGATGAACT GGCTGATCTC ATGGCCGCGC ACATCCTGGC CGAGTCGGCC
GTATTGCGTT CCTTGGTCGC CGAGCACGAG CTGCGGGTCA CCTGTGCCCA GAACATCAAC
CAGCTGCTGA CCATGCTCGG CGGGCGTCCT CCGGCCGGTT CGGCGCCGGC CGATGCCGGG
CGCGCCGGAG TCCGCGACGG GGTGCCGATG TCGGCGCTGT TCGACGCCTA CCAGATCGGC
GCGACCTTCC TGTGGGAGGA ACTGGCCGAA ACCGGAGCTC ACGGCGAATG GTCAGCCGCC
GCGGTGATCC TGCTCGCGAC CAAGTCCTGG CGGCATCTGC ACGAGTTCAC CACCGCCATG
TCCGAGAGCT ACCGCGCCGA GCTGGAGACG CGGGTCCGCA AGCAGGTCCG CAGACGGTCG
GCGTTGGTGC AGGCACTGTT GGAGGGAAGC CTGGCCGAGC CCGAACTGTG GGAGGCCGCG
GACCTGCTGC GGCTCCCCCA CCGCGGCCCG TACGTGGTGA TCGCCGCGCG GGTGGCCGGT
ATCGCGGAGT CGGCACTGCC GACCATCGAG CAGACCCTCG ACGCCCTCGG CATCGGCTCA
GCCTGGCAGC TCACCCACGA CCTGGAGGTC GGCGTGGCCA GCCTGCCGCG CCCCGGCGAC
CAGTTCGACC GCCTGATCGA GAAGCTCGAC GCCGACGGCG CGAGCCGAGT CGGTGTCAGT
CCCCTGTATG AGGACCTGGC CGCCACCTCG CAGGCCGTGC GCCTGGCGCG GATCGCCCTC
CGCGGCGCCG CCAACCCGGG CCGCGTCGTG GTCTTCGGCC GCGACCCCCT GTCGGTGGCG
GCGGCCAGCG CACCGGACGT GATGGCCCGC CTGGCCCGCA CGATCCTCGC CGGCCTCGAC
GGCATGCCGC CCGAGGACCG CCTCATCCTG CTGGACACCT TCGGAGCATG GCTCGACGGC
GCCGGCTCAG CCGAGGAGGC AGCACGCCGA CTCCATGTGC ACCCGAACAC CGTGCGCTAC
CGCCTCCGCC GCCTTGAGGA ACGCACCGGC CGGGCATTGT CGGACCCGCG GCATGTGGCG
GAGCTGAGCT TGGCCTTCGA AGTTAAGCGC GGGTGGGAGT CAGGGGCGGC AGTCGCGGAG
TCGCACAGCG GGTAG
 
Protein sequence
MAAAHSPEVE AEARALVARF AERLDELADL MAAHILAESA VLRSLVAEHE LRVTCAQNIN 
QLLTMLGGRP PAGSAPADAG RAGVRDGVPM SALFDAYQIG ATFLWEELAE TGAHGEWSAA
AVILLATKSW RHLHEFTTAM SESYRAELET RVRKQVRRRS ALVQALLEGS LAEPELWEAA
DLLRLPHRGP YVVIAARVAG IAESALPTIE QTLDALGIGS AWQLTHDLEV GVASLPRPGD
QFDRLIEKLD ADGASRVGVS PLYEDLAATS QAVRLARIAL RGAANPGRVV VFGRDPLSVA
AASAPDVMAR LARTILAGLD GMPPEDRLIL LDTFGAWLDG AGSAEEAARR LHVHPNTVRY
RLRRLEERTG RALSDPRHVA ELSLAFEVKR GWESGAAVAE SHSG