Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_2572 |
Symbol | |
ID | 8333921 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 2914536 |
End bp | 2917571 |
Gene Length | 3036 bp |
Protein Length | 1011 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 644955725 |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003113331 |
Protein GI | 256391767 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGTGT ACTCGATCCT CGGTCCGTTG GAGGTGCGGG TTGGTGGTGT GCTGGTTGAG GTGGCGCGGC CTCGGCGGCG GGCTGTGCTG ACGTATTTGT TGCTGCATGC GAATGACCGG GTTGATGTCG AGCAACTGAT TGATGCTTTG TGGGCGGAGG GGACGCCGCG GACGGCGCGG GCGCAGATTC ACACGGCGGT GTCGGCTTTG AAGGCGGCGC TCCCGGAGGA GTTGCGGACG GGGCTGGTTT CGGAGGCGAC GGGGTACCGG CTGTGGGTCG GAGCCGAGGA TCTGGATCTG GCGGTTTTCC GTCAGCGGCT GGCGTCGGCG CGGGGGTGTC CGGGGATCGG GGCTGAGTCC GAGCAGGCGC GGCGGGCGTT GCGGTCGGCG TTGGCGTTGT GGCGCGGGCC GGCGTTGGCG GGCGTGGACG CGCCGTTCGT CGAACCGGCC CGCGCCAGGC TGGAGGAGGA GCGGTTCAGC GCTTATGAGG CGTTGGCCGA CGGGGAGATG GCCGCCGGCC GGCACGCCGA GCTGATCCCG CTGCTCACGG GACTGCTCAA CGAGTACCCG GCGCGGGAGT CGATCGTACG GCGGCTGGCG CTGTCGCTCT ATCGCGCGGG CCGCAAGACG GACGCGCTGG CGGTGGTGCG GCGGCTGCGC GTGCTGCTGG CCAAGGAGTA CGGACTCGAT CCGGAGAAAG GCATCGTGGA CCTGGAGAAC GCGATGCTCC GCGGTGATCT GGCGCTTGAT GCCCGGGAGG TCGGGTCCGA GGGCAGATCC GAGGGCAGGT CCGAAGGGGT CAGGTCCGAA GGAAGGCCCG ACGCCGACTT GCGCGCTGGA GCCGCCGTCG CGAGCGAGGT CGCCGCTCCG GAAGCGGCGA CTGGCGGCGA GAAATCAGCA CCGCCGTCCC CCGCACCGCC GAGGCAGGCC GCACCGCTTC AGCCCACCTG GCCGCGCCCG GCCCAACTCC CCCCGCCAAC CGCGGGCTTC GTCGGCCGCG ACCACGAGCT GACCCGGCTC GCCCGCCTGC TCACGTCGGA ATCCGACGCG CCACGCGCGG CGGCGGTCAC CGGTCCGGCC GGCGTCGGCA AAACGTCCCT GGCGTTGATC TGGGCGCACG AGCATGCCGG CGCCTTCCCC GACGGGCAAC TGTTCGTCGA CCTTCACGGC TACGACCACA GCGAGGCCGA GAGCCCCGAA GGCGTGTTGG AGCGCTTCCT GCTGGCCCTG GGCATACCCG GCCACCAGAT CCCGCCGGGG CTGCCCAAGC GCGAGGACCT GTTCCGCTCG GCGATGGCAG AGCGCCGCAT GCTCCTGGTC CTGGACAACG CGCGTGACTA CCGCCAAATC AGCCCCCTGC TTCCTGGCTC CGCCCACACC CGCACGCTGA TCACCAGCCG TATCCGGCTG GGCAGCCTGG TCGCCGACAC CGGCGCGCTC CCGGTCCCCC TGGACGTCCT GCCCCTCGAG GAATCGGTCG AGGTCCTGAC GCGCATCGTC GGCGCCGAAT CGGTGGCGGC GGCGCCGCAG TCCGCGCGTG ACCTGGCCCG CCTGTGCGGC GGTCTCCCGC TCGCACTGCG CATCTCGGCC GTCCGGCTGC TTGAGGAACC GGCCGCCGGG CTGACCGGGC TGGCCACCGA ACTCAGCCCG GAAGCCGACC GCCTGCACGG CTTGGGCCTG CTCGACGGCG GCCACACCGT CTCCCACGCG CTCGAGAACT CCTGCCGCCG GCTGACCGCC GCGCAGATCC GCCTGTTCCG CCTCCTGTGC CTGCACCCCG GCGACAGCGT CGGCGCGGCG GCGGCGCAAG CCATGGTCGA TCAAGGCGAC CTGCGCTTCA CGGCGCATGT CGAAGTGCGC CATCTGCTGC GTGTCCTGGA GACGGTGCAC CTGGTCGACC GCACCGCCGC CGACCGCTAC CGGATGCACG ACCTCGTGCG GCTCTACGGC CGGGGTCTGT CAGGCCTCGA CGATGCCGAC CAGACGCAGG ACTCCCTGGC TCTCCAACGC CTCCTCGACT GGTACATCAA CGTCGCCCAA GCCGCCCACC GGGTTCTCGC CCCCGCCATG CCAGCGCTCC CGATGGACGT CCGCCACAGC CTCACCGACA ACCCGACGCC TTTCCCTGAC GAGTCCGCCG CGCTGGACTG GTTCGATCAG GAGGCTGCGA ACCTGATCGC GCTGACGAAG TCAGCAGCCG AACACGGCGA CCACCGCGGA GTCTGGCAGC TCGCCATCGC GCTGGGCGCC TACCTTTCGC GCCGCCACCG TGTAGACGCC CTGGTCCAGA CCCAGGCACT CGGCGAGCAG GCCGCGCTGG CCGAGGCGCA CCACGCGGCA GCCGCGGCGC TGGCGAACAA CCTCGGCATC GCCCACGCCA TGCGCCGCGA CCCCGAGGCG GCGCAGCAGC CGTTCGAGCG AGCCGTCGCC GCCTATCGCG ACCTCGGCGA CCGCCAGCGT GCCGCGCAGA TCAGCGCCAA CCTCGGAAGT CTGCGCTACG ACCTGGGCAT GCCGCACGAA GCCGCCGCCG CCCACAGCGC CGCCATCGAG ACCCTGCGCG AGTTCGGCGA CAGCCCGGCG CTGTCCGCCG TCCTGGCCAA CCTCGGCCTG ACCGTCGGCG ACCTGGGCCG GCACGAACAA GCCCGCGACC TGTTCCGTGA GGCGATCGGC GTCGCAGAGG CCTGCGGTTC GGACTACCGG GCCGGCTACG CGCGCAGCCA GCTGGCCTGG ACGCTGCTGC GCCTCGGCGA GGCCGACGAA GGCCTGGAGC TGAGCCGCGA AACGCTCGCC TACGCGTTGA CCATCGGCGA CCCGCTGCTG GCCGGCCGGA TGCACGACCA GATCGGCATC GCCCAGGCGA TGCGCGGAGC CTGGGACGAG GCGCGCGCCG CGTGGGAGGA AGCCGTCGCG ACGCTCACCG GGATCGGCAG CTCGGAAGCC GACGTCGTCC GGGCCCGCCT GCGCGGCGAA CCCGACGCGC TCCCGGCCGT GGGAGCCGAC GGGAAGCCGA ACGTCGCTGC TCTACCGAGT AGATAG
|
Protein sequence | MPVYSILGPL EVRVGGVLVE VARPRRRAVL TYLLLHANDR VDVEQLIDAL WAEGTPRTAR AQIHTAVSAL KAALPEELRT GLVSEATGYR LWVGAEDLDL AVFRQRLASA RGCPGIGAES EQARRALRSA LALWRGPALA GVDAPFVEPA RARLEEERFS AYEALADGEM AAGRHAELIP LLTGLLNEYP ARESIVRRLA LSLYRAGRKT DALAVVRRLR VLLAKEYGLD PEKGIVDLEN AMLRGDLALD AREVGSEGRS EGRSEGVRSE GRPDADLRAG AAVASEVAAP EAATGGEKSA PPSPAPPRQA APLQPTWPRP AQLPPPTAGF VGRDHELTRL ARLLTSESDA PRAAAVTGPA GVGKTSLALI WAHEHAGAFP DGQLFVDLHG YDHSEAESPE GVLERFLLAL GIPGHQIPPG LPKREDLFRS AMAERRMLLV LDNARDYRQI SPLLPGSAHT RTLITSRIRL GSLVADTGAL PVPLDVLPLE ESVEVLTRIV GAESVAAAPQ SARDLARLCG GLPLALRISA VRLLEEPAAG LTGLATELSP EADRLHGLGL LDGGHTVSHA LENSCRRLTA AQIRLFRLLC LHPGDSVGAA AAQAMVDQGD LRFTAHVEVR HLLRVLETVH LVDRTAADRY RMHDLVRLYG RGLSGLDDAD QTQDSLALQR LLDWYINVAQ AAHRVLAPAM PALPMDVRHS LTDNPTPFPD ESAALDWFDQ EAANLIALTK SAAEHGDHRG VWQLAIALGA YLSRRHRVDA LVQTQALGEQ AALAEAHHAA AAALANNLGI AHAMRRDPEA AQQPFERAVA AYRDLGDRQR AAQISANLGS LRYDLGMPHE AAAAHSAAIE TLREFGDSPA LSAVLANLGL TVGDLGRHEQ ARDLFREAIG VAEACGSDYR AGYARSQLAW TLLRLGEADE GLELSRETLA YALTIGDPLL AGRMHDQIGI AQAMRGAWDE ARAAWEEAVA TLTGIGSSEA DVVRARLRGE PDALPAVGAD GKPNVAALPS R
|
| |