Gene Caci_7082 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_7082 
Symbol 
ID8338449 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp8233747 
End bp8236719 
Gene Length2973 bp 
Protein Length990 aa 
Translation table11 
GC content69% 
IMG OID644960163 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003117753 
Protein GI256396189 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.379783 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTGGTGC AGTACGGGCT GCTCGGACCG GTCCGCGCGC TGCGAGTGGG CGCCGCGGGA 
GAACCAGCCG AGGAGCTCAA AGTGGGGTCC CCGCAACAGC AGGCCGTGCT GGCGCTGCTG
GCGTCACGGG CCGGTCGGGT GGCGACAGCC GACGAACTCA TCGAGGGTCT GTGGGGCGAC
GAGCCGCCGG AGGGTGCGCT CGGTACGGTG CGCACGTACG CGTTCCGACT CCGTAAGGTC
TTCGGTGCCG AAGCCATAGC GTCCATCGCC GGTGGCTATG CGTTACGCGC GGAGCGTTCG
CACGTGGACT TGTTCACGTT CGAGGATCAT GTCGCGGCAG CGGAGCAGCG CCGGATGTCC
GGGGACGTCG TGGGCGCGCG TGGAGAACTC GCTAATGCGT TGGCGCTATG GCGAGGCACT
CCGTTGGCCG GGATACCGGG CCCTTATGCG GAGTCCCTGC GCGCGACGCT CTTGGAGCGG
CGTACCGCTG CGCAGGAGAG CCGCATCGCC TTGGATTTGG CGTTGGGGCG TGTAGGAGAC
GCCGTTTCCG AACTCACGGT CCTCACCGCT GAACACCCGC TGCGTGAGGG ACTGCGTGCG
CAGCTGATGC TGGCGCTCTA TCAGACCGGA CGACAGGCCG AGGCGCTAGG CGTCTACGCA
GATACTCGAC GGCTGCTGCG CAAGGAACTG GGAGTCGATC CCGGCGCAGA GCTCGGCGAA
CTGCATCAGC GGATCCTGCG TGCCGACCCG TCCTTGGCGC GTCCTTCTGC CGACAGTGCC
GAGATCGCCC CGAGAGCGGC TGAGGCTTCT GCGGGGACCG CTTTGGAGAA GCCGCCTACG
ATCCCTGCGC AGCTTCCTGC TGACACAGCC GACTTCACCG GACGCGAGAA CCTGACGCGC
GTGCTCGCGG CGCGGATATC CACGACGGTC GGGCAGTCGG TGGCGGTCTG TGCGCTGTCC
GGGCTCGGCG GAGTCGGCAA GACGGCGCTC GCCATCCACT TGGCGCACTC GGTGCGGGAG
GAGTTTCCCG ACGGGCAGCT CTATGTGGAC CTGCGCGGAG GCGACCCGAC GCCGGCAGAC
CCTGCGCCGG TTCTCGCAGC GTTCCTGCGG GGTCTGGGGA TCTCGGAGGG CGAGACTGCG
CCAGGGCTGG AGGAGAGGGC TGCGGCGTAT CGGTCGGCGC TCGCCGGTCG GAGGGTTCTG
ATCGTCCTGG ACAACGCCCG GGACGCCGCG CAAGTACGGC CTCTGCTGCC CGGCGCTCCG
GGGTGCGCGG TCATCGTCAC GAGCCGGCCG AAGCTGACCG GGTTGGCCGG GGCGACGTTC
GCTGATCTCG ACGTGCTGGA TCCCGGCGAG GCGATGAACA TGTTCACGCG GATCGTCGGC
GAGGAACGCC TAGGGATGGA GCACACCGCG GCTATAGACG TGGTGTCCCT CTGCGGATAC
TTGCCTCTGG CGGTGCGTAT AGCGGCCGCA CGGCTCGCGT CCCGACCGCG CTGGCGTATC
GGGTCGCTCG CGGCGCGGTT GTCCGACGAG CGGCGCCGGC TGGGCGAACT GGCTGTGGGC
GACCTCGCGG TGCGCGCGGC GTTCGAACTG GGCTACCACC AGTTGTCCCC GGCGCAGGCG
GATGTCTTCC GGCGGCTGTC GCAGCTGAAC AGCGCTGACG TGTCGGCTGC TGCGGCGGCC
GCGCTGCTCG GCCAGGACGA GGCCGACACC GAGGAAGTGC TGGAGTCGCT GGTGGATGCC
GCGATGCTGG AGTCTTCGTC TCCGGGGCGC TACCGCTATC ACGACTTGCT GCGGCTCTAT
GCGCGGGAGC AGTACGCGGC CGAGGGTGGC CTCGACGACG CGGGCTTCGT GAGCCTGCTC
GACTTCTACC TCGCCTCGAT GCGCCGGCTG CGGCGCATCC CGGTGGAGGA GCTCGGTCTT
TTCCCGACGC GGTCCTCTGG GAGGGAGTTC GACTCCGTGG AGGCCGGGGT GCGGTGGATC
GCCGACGAGG GCAGCTGTGT GGACGCGGTG TTCAATCGGC GGGTCCTGAC CGCGCCGCCG
TGCAGCGTCG GGGTGGCGCC GTTGACGCTC GCAGTCGAGC TGCTGGACCA CCTTGTCTCG
CTGCCGGGTA TCGAACGCTA TGCGGATGAG CTCTCTGCGG CGGCGCGTAA CGCTGCCGCA
GCTGCCGTGG AGTCTGGGGA CGTACGGAGT GAGGCGCGTG TACGGCACTC CTTGGCGCGC
ATCCTTTACG CGACGTACCA GATCGAGGCT GCTGCCGAGG AGGCTGAGCG ATCGCACCGC
GCCGCCGAGT CGGTCGGCGG CGACGAGACT CAGGCGGACG CCGTCAATCT GCTCGCGATG
ACGTATGCGG ACCTTGGGAG GGACGCGGAG GCGATCGCGC TGTACGAGCG CGCTGTCGAC
GTCAGCCGGG AGTTCGGGGA CGTGGCGTCC GAGGCGGCCG CGCGGCAGAA CATGGCGCGC
TCGCTGCTGG TGTTGGGTCG GACGGAGCAG GCACTGCAGA GCACTATGGC GGGACTTGCC
TTGTGTCGCG CCTTGGGGGA CGACGTGTCC ACCGGGTACG CGCTCTTCCA GACAGGTTCC
ATCTACCTGC AGACGCTCGC CTATAGGGAG TCCCTGACGT ACTTCACGGA GGCGGCTCGG
TACTTCGCCG ACGTGCATCC GCCTATGGAG GGTGCTGCGC ACGCCTCCAG CGCGCGGGCT
CTGTTGGGAC TCGGCGAGCC CAGCGGCGCG CTGGAGCATG CCGAGCGTGC GGTGAGTGTG
CTGCGGGGGA CCGCCGACAC CTGGCAGCAC GCGACGGCGC TCGCGGTGCT GGCTGACGTG
CTGGATGTCG CCGGGCAGCC GGAGCGGGCG CGGGGTTGTC GGGAGGAAGC GTTGGCGATG
TTCGTCGTGA TCGGCGCGCC GGAGGCTGAA CGTATCAGGG TCATGCTCTC CGGCTCGGAA
GCAACCTTCA CGGCGGTTCG AACGTCGCGC TAA
 
Protein sequence
MVVQYGLLGP VRALRVGAAG EPAEELKVGS PQQQAVLALL ASRAGRVATA DELIEGLWGD 
EPPEGALGTV RTYAFRLRKV FGAEAIASIA GGYALRAERS HVDLFTFEDH VAAAEQRRMS
GDVVGARGEL ANALALWRGT PLAGIPGPYA ESLRATLLER RTAAQESRIA LDLALGRVGD
AVSELTVLTA EHPLREGLRA QLMLALYQTG RQAEALGVYA DTRRLLRKEL GVDPGAELGE
LHQRILRADP SLARPSADSA EIAPRAAEAS AGTALEKPPT IPAQLPADTA DFTGRENLTR
VLAARISTTV GQSVAVCALS GLGGVGKTAL AIHLAHSVRE EFPDGQLYVD LRGGDPTPAD
PAPVLAAFLR GLGISEGETA PGLEERAAAY RSALAGRRVL IVLDNARDAA QVRPLLPGAP
GCAVIVTSRP KLTGLAGATF ADLDVLDPGE AMNMFTRIVG EERLGMEHTA AIDVVSLCGY
LPLAVRIAAA RLASRPRWRI GSLAARLSDE RRRLGELAVG DLAVRAAFEL GYHQLSPAQA
DVFRRLSQLN SADVSAAAAA ALLGQDEADT EEVLESLVDA AMLESSSPGR YRYHDLLRLY
AREQYAAEGG LDDAGFVSLL DFYLASMRRL RRIPVEELGL FPTRSSGREF DSVEAGVRWI
ADEGSCVDAV FNRRVLTAPP CSVGVAPLTL AVELLDHLVS LPGIERYADE LSAAARNAAA
AAVESGDVRS EARVRHSLAR ILYATYQIEA AAEEAERSHR AAESVGGDET QADAVNLLAM
TYADLGRDAE AIALYERAVD VSREFGDVAS EAAARQNMAR SLLVLGRTEQ ALQSTMAGLA
LCRALGDDVS TGYALFQTGS IYLQTLAYRE SLTYFTEAAR YFADVHPPME GAAHASSARA
LLGLGEPSGA LEHAERAVSV LRGTADTWQH ATALAVLADV LDVAGQPERA RGCREEALAM
FVVIGAPEAE RIRVMLSGSE ATFTAVRTSR