Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_5691 |
Symbol | |
ID | 8337052 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 6567279 |
End bp | 6570149 |
Gene Length | 2871 bp |
Protein Length | 956 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 644958795 |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003116390 |
Protein GI | 256394826 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCAGATCG GGGTTCTCGG CCCGCTTTCG GTGGCCGCCG AGGCCGGTGA GGTCCGGGTG ACCGGTCGCC GGAGACGGGC GTTGTTGGCG TTGCTGGCTC TACACGCCAA CCAGGTCGTC CACACCGGGC GGCTGATCGA GACCGCGTGG GGTTCGGCTG CCGCTCCGAC CAGCTCCGAC AGTCTCCCGA GCTACATACT CCGCCTGCGT CGGGCGCTCG GCGCGGAAAT CGGCGCGCGG ATCCTCACCC GCCCCGGCGG CTATACGCTC GAGCTGGCCG AGGACGAGCT GGATCTGATC CGGTTCCGAA CCCTGCGGTC GGCAGCCAAG GCCAAGGCGG ACTCCGGGGA CTGGCCCGGC TTCCACGCCC TGGCCGAACA AGCCCTGGCA CAGTGGCGTG GCGATCCGCT GGCCGACATC GTCACCGACG GTGAGATGCA GGAGGAGACG GAAGCGCTGT CGGCGGCGCA CCTGGAGCTC TGGCGCGACC GGCTCGACGC CGGCCTGTTC CTGGGCAGGC ACGCCGAGGT CGCCGCCGAG ACGGCGCCGC TGGTCGCGCG TCATCCGATG GACGAGCGGT TCCGAGAGCA GCGGATGCTC GCGCTGTATC ACACCGGCCG CAGGACCCAG GCCCTGGCCG TCTTCCGCGA GGTCCGAAGG CTGTTGGTCG ACGAGGTCGG AGTCGAGCCG GGTTCCCGGC TGGCCGAGAT ACACGCCCGG ATCCTGCGTG GGGATCCGGA GCCCTCGGAA CCGCGGGCGT CCAGTCCGGC CACAGTGACC GTCGCGCATC CCGCCCCCCG GCAGCTGCCG CCCGCTCCGC TCCGCTTCAC CGCACGCCAC GAGCCGATCG CGGCTCTGGA CCAGTGGATC GCGACTGCCG GCCGGACGGC CGGCACCGTG GCGGTCGTCA GCGGGCCGCC CGGCGTCGGG AAGACCGCTT TGGCCGTGCA CTTCGCGCAC ACCATCGCCG ACCGCTTCCC CGACGGTCAG ATCTACCTGA ATCTGCGGGG ATTCGACCCC CTCGAACCGC CGGTGGCGGC GGCGACAGCC ATGCGCGACG TACTCGTTGC TCTCGGAATG CCGTCCGGGG CGGTGCCTAC CGAACCGGCC GCGCTACTGG CGTTGTACCG GAGCCGGCTC AGCGGCGGCC GGATGCTTCT GGTGTTGGAC AACGCCCGGG ACGCGGCCCA AGTCCGGTCT CTGATACCGG CCGGTCCCGG CAGCATCGTC GTGGTGACCA GCCGGGACCG GCTGTTCGGA CTGATCGCGG TGGACGGTGG AGTGGCGCTG CCGCTGGATG CGCTGACACC CGCGGAATCG GCGCAGCTGC TGGCCGGGCG GCTGGGCGCG ACCACGGTGC GAGAGCACAG GGCCGCGGCG GAGGAGATGG CTCAGCTGTG CTCGCACCTC CCGCTGGCCT TGACCATCGC GGCTGCTCGT GCTGCCGCGC ACCCCACGAT CCCGCTGGCG AACTGGGTGA GCGAGCTCCG CCGGGCCGAC AGACGTCTGG ACATGCTCAC CACCGGCGAC CGTGACTCCA ACGTCCGTAC CGTGTTCTCC TCGTCGTACC ATGCGCTCAG TACGTCTGCG GGCACTGTGT TCCGGTTCCT GAGCCTGCAC CCCGGACCGG AGATCGGCGC TGCCGCGGCC CGCGCGCTGA CCGGGCTGCC GGACCGCGAG GCCCGTGGCG CACTGGACGA ACTGACCCGG GCGAGCCTTG TGGCCGAGAC GGTCCCGGGC CGGTTCGCGG GACACGACCT GCTGCGCGAG TACGCCGCCG AACTCGGGCA AGAGCACGAT CCGGAATTCG CGCGCCGGTC CGGGATACAG CGACTGCTGG ACTACTACCT CCACAGCGCA CATGCCGTGC TGACCCCGTC CTATGCGCAG CGGCTCGCGC TCGAGCTGGC GGCTCCGGTC CCCGGCGCGC ACCCCGAGCA GTTCCTCGAC CGTCAGCAGG GCCGCGCCTG GCGCGACACC GAGTCCCGGG CATTGATCGC CGCGGTCCCG CTGGCGGCCC GAGCCGGCCT GGACCGGCAC GCCTGGCAGC TGGCCACCGT GCTGGCCGGC CACTTGGGCT TCGTCGGCCT GCGGCAGGAA CAGATGGACG TGGCCGCGGC CGGTCTGGCC GCGGCCTCCC TGGACGGGGA TCCGGTCGGG CTCGGGCTCA GCCATATGCA CGTCGCCGAG GCGCACGCCG CGCAGGGGCA GGACGTCGAA GCGCTCGAGC ACCTCGACAA GGCCCTGGAG TACTTCGTCG ATCTGGGTGA GGCATCCTGG CAGGGCATGG TCCTGCTGTA CGTCAGCCAG GCCTGTGAAC GGCGCGCGGA CTTCACGGCG GCACTTGATG CCGCTCAGCG GGCCTCAGTG CTCCTGGCAA GCGTCGACGA TCCGGACGGG CAGGCTCAGG CGTTGAACAA CGCCGGGCAT TATCACACCG AGCTGGGCCG CCCCGACCTG GGGCTGGAGC ACGCGGAACG GGCCCTGGCA CTGACTCGGG AGGTCGGTAA CCGATTCGCC GAGTTCGCCG TGCTCGACAC GCTGATCGTG GCCCACGATC GGCTGGGCGA TCCGAAGTCC GCCGTCGCCT GCGGCCGGAA AGCCGCGCAG ATCGCCGACC AGCTCGGCCC GACACCGCAT CTCGCCGTGG TTCTGGACCA CCTCGCTCAG GCTCATTGGA ACGGAGGCGA CACCGCCGAG GCCCGCACTG CCTGGCAGTC GGCGCTGGCG ATCATGGAGG AGCATCAGGA CCCGAAGGCG GACGAGCTGC GCAGACGCCT GTCCGGTCTG CGTGAGCCGA CCGGAGGGCT GGACGGAGGT GGACGGGAAT CGAACCCGCC GGACGGGGAT TCCCCGTCCC ACCCGCTTTG A
|
Protein sequence | MQIGVLGPLS VAAEAGEVRV TGRRRRALLA LLALHANQVV HTGRLIETAW GSAAAPTSSD SLPSYILRLR RALGAEIGAR ILTRPGGYTL ELAEDELDLI RFRTLRSAAK AKADSGDWPG FHALAEQALA QWRGDPLADI VTDGEMQEET EALSAAHLEL WRDRLDAGLF LGRHAEVAAE TAPLVARHPM DERFREQRML ALYHTGRRTQ ALAVFREVRR LLVDEVGVEP GSRLAEIHAR ILRGDPEPSE PRASSPATVT VAHPAPRQLP PAPLRFTARH EPIAALDQWI ATAGRTAGTV AVVSGPPGVG KTALAVHFAH TIADRFPDGQ IYLNLRGFDP LEPPVAAATA MRDVLVALGM PSGAVPTEPA ALLALYRSRL SGGRMLLVLD NARDAAQVRS LIPAGPGSIV VVTSRDRLFG LIAVDGGVAL PLDALTPAES AQLLAGRLGA TTVREHRAAA EEMAQLCSHL PLALTIAAAR AAAHPTIPLA NWVSELRRAD RRLDMLTTGD RDSNVRTVFS SSYHALSTSA GTVFRFLSLH PGPEIGAAAA RALTGLPDRE ARGALDELTR ASLVAETVPG RFAGHDLLRE YAAELGQEHD PEFARRSGIQ RLLDYYLHSA HAVLTPSYAQ RLALELAAPV PGAHPEQFLD RQQGRAWRDT ESRALIAAVP LAARAGLDRH AWQLATVLAG HLGFVGLRQE QMDVAAAGLA AASLDGDPVG LGLSHMHVAE AHAAQGQDVE ALEHLDKALE YFVDLGEASW QGMVLLYVSQ ACERRADFTA ALDAAQRASV LLASVDDPDG QAQALNNAGH YHTELGRPDL GLEHAERALA LTREVGNRFA EFAVLDTLIV AHDRLGDPKS AVACGRKAAQ IADQLGPTPH LAVVLDHLAQ AHWNGGDTAE ARTAWQSALA IMEEHQDPKA DELRRRLSGL REPTGGLDGG GRESNPPDGD SPSHPL
|
| |