Gene Caci_1981 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_1981 
Symbol 
ID8333324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp2238021 
End bp2241122 
Gene Length3102 bp 
Protein Length1033 aa 
Translation table11 
GC content69% 
IMG OID644955130 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003112742 
Protein GI256391178 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0609683 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGTACA CCGTGCTCGG GCCTGTCGGC ATTCGGAGTC ACGATGTTTT TCATGCTGCC 
GGCACGGCGA AGGAACAGGG GGTCTTGGCG ATTCTGCTGA TGGAGCGGGG GCATTCCGTC
TCTACGCAGA CACTTGCCGA TCGGCTGTGG GAGCGGCCTC CGGAGCAGTT TCGGGCGACG
TTGCAGGCGC ACATTTCGCG GTTGCGGCGG CGGTTGCGGG AGGCCTCTGA GCAGGCGGAA
GTCATCGCCA GCAATCAGGC CGGCTATCGC ATCGACGTTC CGGCGGACCA GGTCGACGTG
CATTATTTCG ATCTTTTGGT TTCGCGGGCT CAGGCGCACG CCGGGCAGGA TCCGGACTCG
GCGCGGGAGT TGCTGCGCGA GGCCGAGGGG CTGTGGAACG GCGAGCCGTT GGCGGGGTTG
CCTGGGACGT GGGCCGAGAC CATGCGGCGC GTCCTGACCG ACAAGCGCCG CACCGCGTTG
CTCAAGCGCC TCGAGCTGGA TCTCCAGACC GGCGGCAATG CCGATGACGC TGTCGCCGAG
CTGACCGAGC TGGCTTCGGG CAGCCGGATC GACCAGCGCG TCATCGAGCT GCTGATGATC
GCCTTGGACA GCGCCGGACG TCCCGGCGAT GCCCTGACGC TCTATCACGA GGTCCGCATC
CGCCTTCGCG ACGAGGCAGG CACGGACCCC CGCGCCGAAC TCCGCCAGCT CCACCAGCGC
CTGCTCAACG GCTCCGCGCA GCACCCGGCG GCGGCAGCAC CCGCCCAGCC GGTCACGCCC
CGCGCCATCG ACACCCTCGA CCCCGATCCG CCCTACGTCG CAGGACGCGA ACAAGAACTC
GCCGCGATCC TCGCTGCCGT CGCCGCGGAC CTGCGACCGG GCAAACGGGG CGCGACCTTC
CTCATCGACG GCTTGGCCGG CATCGGCAAG ACGACGCTCG CCCTCCAGGC GGCGCACCTC
CTGCGCTCAC ACTGCCCCGA CGGCGCGCTC CAGCTGAACC TGCACTCGCA CGATCCGTAC
CTCCCGCCCC TGGACCAGCG CCAAGCCCTG ACCCAGCTCC TGGACGCGAT CGGCACGCCC
TACCGCGAAC TGGCCCGCGC CGACACCGTG CCAGCGCTCG GCGCGCTATG GCGAAAACGC
ACCAGCGGCC GCCGCCTGCT GATCCTGCTC GACGATGTCC TGGACACCGC CCAGATCGAG
CTGCTGATCC CCGCGACAGC CGGCACGATC GTCCTGATCA CCTCGCGCCG CCGCCTGACC
GGGACGCCCG GCAACCGCCA GTACACCCTC GGTCCGCTGC CCGACTCCGC AGCCACAGCG
CTCCTGTCTC ACATCACTGA CAGGACCCTG CCAGAGGACG ATGACCTCGC CAGCTTCACC
CAGTGCTGCG GCGGTCTGGC TCTGGCGATC ACGGTCGCCG CCGGCCACCT GCGCAGCCGA
CCGGTCTGGA CGGTCGGCGA CCTGGTCTCG CGCCTGTCCA CGACCTCGCA GTCGCTCGCC
GACGACCCCC TGACCAGCCC GATCCACACC GCCTTCGCAA TGTCCTACCA AACCCTCAGC
CCCACACTGC GGGACCTGCT CCGCTACATC GCCGCCCACC CCGGTCCCGA CATCGGCCTG
CCGGCCGCCG CCGCGATGTC CGGCGCGGCG CTGGCCGACA CCGACATCAG GCTCGACGCC
CTGGTGGACC ACCGGCTCCT GAACCTCGCG AGCGCGCACC GCTACCGCCT GCACAACCTG
CTCCGCCAAT ACATCCTGGT CCAGGGAGAC GAGCAGCAGA ACCTTGATAA CCGCCAGGCA
GTCGGCCGCG CCATAACGTT CTACAAAGCC GCCGCCGCAC GCGCCGATCA CGCGCTCCAG
CCTCGCCGCC GCGAACTGCA CTACCCCGCA GCCTCCGCTC AGGTCGAGGG CGTCAACCTC
GACACCACCG AGCAGGCGCG CACCTGGCTC GACACCGAGC ACCTCAACCT GGCGGCGGTC
ACCACCTGGT CGGTCCAACT CGGCCGCGGA ATACAGGTCG GGCTGATCCC GCACGTGCTC
GCCCAACACC TGGACCGGCG CGGACGCTGG CCGCAAGCGC TCGAGCACAT CGATGAACTG
CTCGCCGCCC CGAAAGGCGA CTTCCCCGGC GGCAACCCGG ACGCCGTCAC CGCGTGTCTG
CTGACCGACC AGGCCGGGCT CCTCATCCGC GCCAACCAAC TCGACGTCGC AGTGGACGCC
GCCAACGCCG CGCTGGCCAT CTGGAACGCC GCCAACGACC GCTACGGCCA AGCCGACGCC
CACTTCCAGA TCGGACGCGC CCACGACGCC GCCGAACGCC ACGACGAAGC CCTGCAAGCC
TTCCGCACGG CAGCCGCGCT CTACGAGAGC CTCGGCGACC ACACCCGGGT GGCAGTAGCC
GAGGACCAAT GGGCCGTCAC CGCGTTCAAG CAAGGCCACC TGGACGAAGC CTTCTCCCGT
GGCCACCACG CGCTGGACAT CGCCCGGCAG CAGAACGACC TCGCGGCCAT CGCCGACGTC
CTCAACAACT TGGGCGAAAT GCACCGCCAG GCCGACCACG ACCAGGAGGC GCTCGCCTTC
TTCCAAGAGG CCCGCACCCT GACCGCAGCG CTCGGCGACC CGCTCATCAC CGCTGTGCTC
GGCTACAACA TCGGCGCCGT CTACGAACAC GCCGGCGACT ATCACCGTTC CCTGACATCG
ACGCGAACCG CGCTCCTGCA GTTCCGCGAA CTCAATGACC ACCGCAGCGA AATCGAGTGC
CTCATCCTGC TCGCCACCGC GCACATCAAC CTCGGAGACC GCAACGCAGC GTTCGAAGAA
ACCCGGCACG CAATCGACCT CGCCGAGCAA ACGCACGACC AGCTACGGCT GGCGCAAGTC
CGCCTGGCGC AGGGCACCAT GCTTGCCGCC CGCGGCGATA TCCAGGGCGC GATCGAGGCG
TGCGAATCAG CCCTCGATAT TGCCGAACAG ATCGGCGCCG TCGCCGAACA GAGCCAGGCG
CACCGTTCCC GTTGCGAGGC GTACACGAGT CTTGGCCTGC ACGATCGCGC CCAAAGCCAT
CTCCAGCAAG CAGAATCGCA AGGCGGCCCG ACCACTGAAT GA
 
Protein sequence
MQYTVLGPVG IRSHDVFHAA GTAKEQGVLA ILLMERGHSV STQTLADRLW ERPPEQFRAT 
LQAHISRLRR RLREASEQAE VIASNQAGYR IDVPADQVDV HYFDLLVSRA QAHAGQDPDS
ARELLREAEG LWNGEPLAGL PGTWAETMRR VLTDKRRTAL LKRLELDLQT GGNADDAVAE
LTELASGSRI DQRVIELLMI ALDSAGRPGD ALTLYHEVRI RLRDEAGTDP RAELRQLHQR
LLNGSAQHPA AAAPAQPVTP RAIDTLDPDP PYVAGREQEL AAILAAVAAD LRPGKRGATF
LIDGLAGIGK TTLALQAAHL LRSHCPDGAL QLNLHSHDPY LPPLDQRQAL TQLLDAIGTP
YRELARADTV PALGALWRKR TSGRRLLILL DDVLDTAQIE LLIPATAGTI VLITSRRRLT
GTPGNRQYTL GPLPDSAATA LLSHITDRTL PEDDDLASFT QCCGGLALAI TVAAGHLRSR
PVWTVGDLVS RLSTTSQSLA DDPLTSPIHT AFAMSYQTLS PTLRDLLRYI AAHPGPDIGL
PAAAAMSGAA LADTDIRLDA LVDHRLLNLA SAHRYRLHNL LRQYILVQGD EQQNLDNRQA
VGRAITFYKA AAARADHALQ PRRRELHYPA ASAQVEGVNL DTTEQARTWL DTEHLNLAAV
TTWSVQLGRG IQVGLIPHVL AQHLDRRGRW PQALEHIDEL LAAPKGDFPG GNPDAVTACL
LTDQAGLLIR ANQLDVAVDA ANAALAIWNA ANDRYGQADA HFQIGRAHDA AERHDEALQA
FRTAAALYES LGDHTRVAVA EDQWAVTAFK QGHLDEAFSR GHHALDIARQ QNDLAAIADV
LNNLGEMHRQ ADHDQEALAF FQEARTLTAA LGDPLITAVL GYNIGAVYEH AGDYHRSLTS
TRTALLQFRE LNDHRSEIEC LILLATAHIN LGDRNAAFEE TRHAIDLAEQ THDQLRLAQV
RLAQGTMLAA RGDIQGAIEA CESALDIAEQ IGAVAEQSQA HRSRCEAYTS LGLHDRAQSH
LQQAESQGGP TTE