Gene Caci_4249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4249 
Symbol 
ID8335603 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4820162 
End bp4823488 
Gene Length3327 bp 
Protein Length1108 aa 
Translation table11 
GC content72% 
IMG OID644957352 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003114954 
Protein GI256393390 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.319347 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACTTCC GTGTTCTGGG ACCCCTGGAG ATCGTCGACG ACGGGGTGCC GATACGGCTC 
GGCGGGCTGC GGGAACAAGC GGTCATGGCC ATGTTCCTGG TGCAGCCCGA CACCATCATC
CCGGTCGAGC GCCTCGTCGA CGCGGTCTGG GGCGACCGGC CGCCGGCGAC CGCGCGGGCG
CAGATCCAGA TCTGCGTCTC GGCGCTGCGC CGGCTGCTCG GCGACCCCGA GCGGATCCGG
ACGCGCAATC CCGGCTACCT GTTCCATCTC GGGACCGACG TGCTGGACGC GCGGTTGTTC
GAGCAGATCG CGGCGAAGGG CCACACGCTG CTGGCCGAGG GCCGGCGCAC CGAGGCCGCC
GCGGAGTTCC GCAAAGCCCT GTCACTGTGG CGGGGACCGG CGCTGGCCAA CGTGGCCGGG
GACGTCGTCC AGCACAGCGT GGCGCACCTG AACGAGCGCC GGCTGAACGT GCTGGAGGTG
TGCCTGGAGG CCGAGCTGGA GGCCGGGACG CGCGGCGATC TGGTCGGGGA ACTGGTCCGG
GTCTGCCACG AATACCCGCT GAACGAACGC TTCCGGCTGC TGCTCATGAC CGCGCTGTAT
CGCGCCGGAC GGCAGGCCGA CGCGCTGGAG GTGTATCGCG CCACACGCGG AACACTGAAG
GAGGAGCTGG GGATCGAGCC CGGTCCGGAG CTACGGCGGC TGCAGCAGGC GATTCTGAAC
GGCGAGGTCC ACGGACAGGC AGCGGCTTCG ACACCATCTC AGTCGGCCAC GACGGCGCAG
GCGCGGACAC CGACATCGGC GCCGACAACA ACGCCGAAGC CCGCAGCCGA GGCGCCGCCC
CCGCCGGTAT CGCAGCCTTC GACCGCACCC GTCCCATCGC AAGACCCTGC CACGCCCGCC
GCGCGCCGTC CCACAACCCG TGCCCAGCCT CCGACGCCGC GGCTCCTGCC GCCGGCCATC
CCGGATTTCA CCGGCCGCGC CAAGGCGATC GCGCAGATCG TCTCGGAGAT GCCGGTGGTC
CACACCCTCG ACGGCCCGGC GGCGCTGCCG GTCACCGTTC TGTACGGCCA AGGCGGCGTC
GGCAAGACCA CGCTCGCGGT CCACGTCGCG CACCGGCTCG CCGAGTCCTA TCCCGACGGC
CAGCTCTACG CGCGCCTTCG CGACGGAGAC CAGTCGGTCG CGCCGGCGGA CATCCTGGAA
CGCTTCCTGC GCTCCCTCGG CGTCGCAGGG CCCTCGCTGG CCGACGGCCT GGAGGAGCGC
GCCGAGATGT ACCGCAATCT GCTGGGCGAT CGGCGCGTCC TGGTAGTACT CGACGACGCC
ATGACCGAGC ACCAGGTACA GCCGCTGCTG CCGGGCGGAT CGGGCTGCTC GGTGATCGTC
ACCAGCCGGC GGCGGCTCAC CGGCGTGCCC GCGGCGGTGC GCCTGGAGGT CGGCACGTTC
AGCGACGACA GCGCGGTGGC GCTGCTGAGC CGGGTCGCCG ATCCCGCGCG CATCCACGCC
GAACCCGAGG CGGCGGCTCA GTTGTGCCGC CTGTGCGGAC ATCTGCCGCT GGCGCTGCGC
ATCGTCGCCG CGCGGCTGGC CGCACGTCCG CACTGGAGCG TGCGGGCCTT GGTGGACCGG
CTGATCGACG AATCGCGGCA GCTCGACGAG CTGAACCACG AAGGCGTCGG GATGCGCGCC
AGCATCTCGG TCACCTACGC CGGGCTGTCG GCGGACGCGC GGCGCCTGTT CCGGCGGCTG
GCACTGTTCG GCGGACCGGA CTTCGCCGCC TGGGTCGCGG CGCCGCTGCT GGACGCCGAC
GTCTGGCGCG CGGAGGACCT GCTGGAGGAG CTGACCGAGG CCTACCTCAT CGACATCGAG
CAGGGTCCGG ACGGCGAGCC GACGCGCTAC CGGTTCCACG ACATCGTGCG CCCCTTCGCC
CGCGAGCGGC TGTTGGCCGA GGATCCGCCA GGCGATCGCC ACCAGGCGTT GGAGCGCTTG
ATCGGCGCGC ACCTTTTCCT GGCCAAGCTG GCGCACGAGC GGGAGTACTC CGGGGACCAC
CTGCTGCCCG CCGACACCGC CACGACCTGG CCGCTGCCGC CGGAGGCGGT GGCGCCGCTG
ATCGCCGACC CGCTGACCTG GTTCGAGCGC GAGCGGCTGT CGCTGGTGGC GGCGGTGCGC
CAGGCCGCCG CGCGCGGGCT CGCCGACAAG GCGTGGAGTC TGGCGCTGTC CTCGGTCGCG
CTGTTCGAGG CGCGGTCCTA TTACGGCGAC TGGCGCGAGA CCCACGAGAC CGCGCTGGAG
GCGGTGATCC AGGCCGGCGA CCGGCGCGGC GAGGCGGCGA TGCGCTACTC GCTGGGCTCG
CTGCACATGT TCGAGGTCGA CAACGCCGGC GCCCGGCATC AGTTCGGGCT GGCCGCCGCC
ATCTATCAGG AGCTTGACGA CCGGTACGGC GCGGCGCTGG TGCTGCGCAA CGTCGCGGTG
CTGGACCGCC GCGAGGGCGA TCTGGACCGT GCGCTGGAAC GCTGGACGGA TGCGCTCGCC
ACGTTCTGCG AGGCAGGAGA CCGCGTCGCG GAGGCGTACG TCCTGAACAG CATCGCGCAG
GTCCATCTGG CGCGCGGCAA CGACAGCGCG GCGTTCGACC TGCTGACGCG CGCCGAGCTG
ATCTGCGCCG AGACCGGCGT GCGCCGCGTC GCAGCACAAG TACAGCTGCG TCTGGGTCAC
ATGTATCGCC ATCGCAACGA CATGGATCGG GCGCGCGCGG CGTATCAACA GGTGCTGGCC
GCGGTCCGGG AGACCGGCGA CCGCATCGGC GAGTGCCACG CCTTGATGGG TCTGGGCGCG
ACCGAGGCGG AGGACGGACG TCCGGGCCCG GCGGTCGACG TGCTGCGTCA GGCGCTGGAG
GCCGCCGAGG CGGTCGGGGA CAAGATCCTC GGCGGACGCG CGGCGTTGAC GCTGGCTCGG
GCGGAGCTGG CGGCCGGGCT GCTGGCCGAG GCCGCCGACG ACGCCGACCA CGCGGTCGAG
GCGCTGGGTT CCGGGCTCGC CTCGGCGCAC GCGCTGGTGT TGCGCGGACG GATCCGCGAC
GAGCGCGGCG ACGTCTCCGG GGCGGTGGGC GACTGGTGGC AGGCGGCCAC GGTCGTCACC
GCGCTGACGG TGGACGAGGC TCAGGATCTG GCCGGGGAGA TCGCCGCGTT GCTGGCGGAG
GTCACCGGCG GCGGCTCCGG CGGTGGTTCG GATGGTGGTT CCGGTGATGG CTCAAATGGT
GGTTCGGGTG ACGGTTCAGA CGATGGCTCA GCCGGTGATC CCGGTGGCGC CGCGATGGTG
GTTCAGGTGA CAGAACCCTC GGCGTGA
 
Protein sequence
MDFRVLGPLE IVDDGVPIRL GGLREQAVMA MFLVQPDTII PVERLVDAVW GDRPPATARA 
QIQICVSALR RLLGDPERIR TRNPGYLFHL GTDVLDARLF EQIAAKGHTL LAEGRRTEAA
AEFRKALSLW RGPALANVAG DVVQHSVAHL NERRLNVLEV CLEAELEAGT RGDLVGELVR
VCHEYPLNER FRLLLMTALY RAGRQADALE VYRATRGTLK EELGIEPGPE LRRLQQAILN
GEVHGQAAAS TPSQSATTAQ ARTPTSAPTT TPKPAAEAPP PPVSQPSTAP VPSQDPATPA
ARRPTTRAQP PTPRLLPPAI PDFTGRAKAI AQIVSEMPVV HTLDGPAALP VTVLYGQGGV
GKTTLAVHVA HRLAESYPDG QLYARLRDGD QSVAPADILE RFLRSLGVAG PSLADGLEER
AEMYRNLLGD RRVLVVLDDA MTEHQVQPLL PGGSGCSVIV TSRRRLTGVP AAVRLEVGTF
SDDSAVALLS RVADPARIHA EPEAAAQLCR LCGHLPLALR IVAARLAARP HWSVRALVDR
LIDESRQLDE LNHEGVGMRA SISVTYAGLS ADARRLFRRL ALFGGPDFAA WVAAPLLDAD
VWRAEDLLEE LTEAYLIDIE QGPDGEPTRY RFHDIVRPFA RERLLAEDPP GDRHQALERL
IGAHLFLAKL AHEREYSGDH LLPADTATTW PLPPEAVAPL IADPLTWFER ERLSLVAAVR
QAAARGLADK AWSLALSSVA LFEARSYYGD WRETHETALE AVIQAGDRRG EAAMRYSLGS
LHMFEVDNAG ARHQFGLAAA IYQELDDRYG AALVLRNVAV LDRREGDLDR ALERWTDALA
TFCEAGDRVA EAYVLNSIAQ VHLARGNDSA AFDLLTRAEL ICAETGVRRV AAQVQLRLGH
MYRHRNDMDR ARAAYQQVLA AVRETGDRIG ECHALMGLGA TEAEDGRPGP AVDVLRQALE
AAEAVGDKIL GGRAALTLAR AELAAGLLAE AADDADHAVE ALGSGLASAH ALVLRGRIRD
ERGDVSGAVG DWWQAATVVT ALTVDEAQDL AGEIAALLAE VTGGGSGGGS DGGSGDGSNG
GSGDGSDDGS AGDPGGAAMV VQVTEPSA