Gene Caci_2572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_2572 
Symbol 
ID8333921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp2914536 
End bp2917571 
Gene Length3036 bp 
Protein Length1011 aa 
Translation table11 
GC content72% 
IMG OID644955725 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003113331 
Protein GI256391767 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGTGT ACTCGATCCT CGGTCCGTTG GAGGTGCGGG TTGGTGGTGT GCTGGTTGAG 
GTGGCGCGGC CTCGGCGGCG GGCTGTGCTG ACGTATTTGT TGCTGCATGC GAATGACCGG
GTTGATGTCG AGCAACTGAT TGATGCTTTG TGGGCGGAGG GGACGCCGCG GACGGCGCGG
GCGCAGATTC ACACGGCGGT GTCGGCTTTG AAGGCGGCGC TCCCGGAGGA GTTGCGGACG
GGGCTGGTTT CGGAGGCGAC GGGGTACCGG CTGTGGGTCG GAGCCGAGGA TCTGGATCTG
GCGGTTTTCC GTCAGCGGCT GGCGTCGGCG CGGGGGTGTC CGGGGATCGG GGCTGAGTCC
GAGCAGGCGC GGCGGGCGTT GCGGTCGGCG TTGGCGTTGT GGCGCGGGCC GGCGTTGGCG
GGCGTGGACG CGCCGTTCGT CGAACCGGCC CGCGCCAGGC TGGAGGAGGA GCGGTTCAGC
GCTTATGAGG CGTTGGCCGA CGGGGAGATG GCCGCCGGCC GGCACGCCGA GCTGATCCCG
CTGCTCACGG GACTGCTCAA CGAGTACCCG GCGCGGGAGT CGATCGTACG GCGGCTGGCG
CTGTCGCTCT ATCGCGCGGG CCGCAAGACG GACGCGCTGG CGGTGGTGCG GCGGCTGCGC
GTGCTGCTGG CCAAGGAGTA CGGACTCGAT CCGGAGAAAG GCATCGTGGA CCTGGAGAAC
GCGATGCTCC GCGGTGATCT GGCGCTTGAT GCCCGGGAGG TCGGGTCCGA GGGCAGATCC
GAGGGCAGGT CCGAAGGGGT CAGGTCCGAA GGAAGGCCCG ACGCCGACTT GCGCGCTGGA
GCCGCCGTCG CGAGCGAGGT CGCCGCTCCG GAAGCGGCGA CTGGCGGCGA GAAATCAGCA
CCGCCGTCCC CCGCACCGCC GAGGCAGGCC GCACCGCTTC AGCCCACCTG GCCGCGCCCG
GCCCAACTCC CCCCGCCAAC CGCGGGCTTC GTCGGCCGCG ACCACGAGCT GACCCGGCTC
GCCCGCCTGC TCACGTCGGA ATCCGACGCG CCACGCGCGG CGGCGGTCAC CGGTCCGGCC
GGCGTCGGCA AAACGTCCCT GGCGTTGATC TGGGCGCACG AGCATGCCGG CGCCTTCCCC
GACGGGCAAC TGTTCGTCGA CCTTCACGGC TACGACCACA GCGAGGCCGA GAGCCCCGAA
GGCGTGTTGG AGCGCTTCCT GCTGGCCCTG GGCATACCCG GCCACCAGAT CCCGCCGGGG
CTGCCCAAGC GCGAGGACCT GTTCCGCTCG GCGATGGCAG AGCGCCGCAT GCTCCTGGTC
CTGGACAACG CGCGTGACTA CCGCCAAATC AGCCCCCTGC TTCCTGGCTC CGCCCACACC
CGCACGCTGA TCACCAGCCG TATCCGGCTG GGCAGCCTGG TCGCCGACAC CGGCGCGCTC
CCGGTCCCCC TGGACGTCCT GCCCCTCGAG GAATCGGTCG AGGTCCTGAC GCGCATCGTC
GGCGCCGAAT CGGTGGCGGC GGCGCCGCAG TCCGCGCGTG ACCTGGCCCG CCTGTGCGGC
GGTCTCCCGC TCGCACTGCG CATCTCGGCC GTCCGGCTGC TTGAGGAACC GGCCGCCGGG
CTGACCGGGC TGGCCACCGA ACTCAGCCCG GAAGCCGACC GCCTGCACGG CTTGGGCCTG
CTCGACGGCG GCCACACCGT CTCCCACGCG CTCGAGAACT CCTGCCGCCG GCTGACCGCC
GCGCAGATCC GCCTGTTCCG CCTCCTGTGC CTGCACCCCG GCGACAGCGT CGGCGCGGCG
GCGGCGCAAG CCATGGTCGA TCAAGGCGAC CTGCGCTTCA CGGCGCATGT CGAAGTGCGC
CATCTGCTGC GTGTCCTGGA GACGGTGCAC CTGGTCGACC GCACCGCCGC CGACCGCTAC
CGGATGCACG ACCTCGTGCG GCTCTACGGC CGGGGTCTGT CAGGCCTCGA CGATGCCGAC
CAGACGCAGG ACTCCCTGGC TCTCCAACGC CTCCTCGACT GGTACATCAA CGTCGCCCAA
GCCGCCCACC GGGTTCTCGC CCCCGCCATG CCAGCGCTCC CGATGGACGT CCGCCACAGC
CTCACCGACA ACCCGACGCC TTTCCCTGAC GAGTCCGCCG CGCTGGACTG GTTCGATCAG
GAGGCTGCGA ACCTGATCGC GCTGACGAAG TCAGCAGCCG AACACGGCGA CCACCGCGGA
GTCTGGCAGC TCGCCATCGC GCTGGGCGCC TACCTTTCGC GCCGCCACCG TGTAGACGCC
CTGGTCCAGA CCCAGGCACT CGGCGAGCAG GCCGCGCTGG CCGAGGCGCA CCACGCGGCA
GCCGCGGCGC TGGCGAACAA CCTCGGCATC GCCCACGCCA TGCGCCGCGA CCCCGAGGCG
GCGCAGCAGC CGTTCGAGCG AGCCGTCGCC GCCTATCGCG ACCTCGGCGA CCGCCAGCGT
GCCGCGCAGA TCAGCGCCAA CCTCGGAAGT CTGCGCTACG ACCTGGGCAT GCCGCACGAA
GCCGCCGCCG CCCACAGCGC CGCCATCGAG ACCCTGCGCG AGTTCGGCGA CAGCCCGGCG
CTGTCCGCCG TCCTGGCCAA CCTCGGCCTG ACCGTCGGCG ACCTGGGCCG GCACGAACAA
GCCCGCGACC TGTTCCGTGA GGCGATCGGC GTCGCAGAGG CCTGCGGTTC GGACTACCGG
GCCGGCTACG CGCGCAGCCA GCTGGCCTGG ACGCTGCTGC GCCTCGGCGA GGCCGACGAA
GGCCTGGAGC TGAGCCGCGA AACGCTCGCC TACGCGTTGA CCATCGGCGA CCCGCTGCTG
GCCGGCCGGA TGCACGACCA GATCGGCATC GCCCAGGCGA TGCGCGGAGC CTGGGACGAG
GCGCGCGCCG CGTGGGAGGA AGCCGTCGCG ACGCTCACCG GGATCGGCAG CTCGGAAGCC
GACGTCGTCC GGGCCCGCCT GCGCGGCGAA CCCGACGCGC TCCCGGCCGT GGGAGCCGAC
GGGAAGCCGA ACGTCGCTGC TCTACCGAGT AGATAG
 
Protein sequence
MPVYSILGPL EVRVGGVLVE VARPRRRAVL TYLLLHANDR VDVEQLIDAL WAEGTPRTAR 
AQIHTAVSAL KAALPEELRT GLVSEATGYR LWVGAEDLDL AVFRQRLASA RGCPGIGAES
EQARRALRSA LALWRGPALA GVDAPFVEPA RARLEEERFS AYEALADGEM AAGRHAELIP
LLTGLLNEYP ARESIVRRLA LSLYRAGRKT DALAVVRRLR VLLAKEYGLD PEKGIVDLEN
AMLRGDLALD AREVGSEGRS EGRSEGVRSE GRPDADLRAG AAVASEVAAP EAATGGEKSA
PPSPAPPRQA APLQPTWPRP AQLPPPTAGF VGRDHELTRL ARLLTSESDA PRAAAVTGPA
GVGKTSLALI WAHEHAGAFP DGQLFVDLHG YDHSEAESPE GVLERFLLAL GIPGHQIPPG
LPKREDLFRS AMAERRMLLV LDNARDYRQI SPLLPGSAHT RTLITSRIRL GSLVADTGAL
PVPLDVLPLE ESVEVLTRIV GAESVAAAPQ SARDLARLCG GLPLALRISA VRLLEEPAAG
LTGLATELSP EADRLHGLGL LDGGHTVSHA LENSCRRLTA AQIRLFRLLC LHPGDSVGAA
AAQAMVDQGD LRFTAHVEVR HLLRVLETVH LVDRTAADRY RMHDLVRLYG RGLSGLDDAD
QTQDSLALQR LLDWYINVAQ AAHRVLAPAM PALPMDVRHS LTDNPTPFPD ESAALDWFDQ
EAANLIALTK SAAEHGDHRG VWQLAIALGA YLSRRHRVDA LVQTQALGEQ AALAEAHHAA
AAALANNLGI AHAMRRDPEA AQQPFERAVA AYRDLGDRQR AAQISANLGS LRYDLGMPHE
AAAAHSAAIE TLREFGDSPA LSAVLANLGL TVGDLGRHEQ ARDLFREAIG VAEACGSDYR
AGYARSQLAW TLLRLGEADE GLELSRETLA YALTIGDPLL AGRMHDQIGI AQAMRGAWDE
ARAAWEEAVA TLTGIGSSEA DVVRARLRGE PDALPAVGAD GKPNVAALPS R