Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_7082 |
Symbol | |
ID | 8338449 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 8233747 |
End bp | 8236719 |
Gene Length | 2973 bp |
Protein Length | 990 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644960163 |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003117753 |
Protein GI | 256396189 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.379783 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTGGTGC AGTACGGGCT GCTCGGACCG GTCCGCGCGC TGCGAGTGGG CGCCGCGGGA GAACCAGCCG AGGAGCTCAA AGTGGGGTCC CCGCAACAGC AGGCCGTGCT GGCGCTGCTG GCGTCACGGG CCGGTCGGGT GGCGACAGCC GACGAACTCA TCGAGGGTCT GTGGGGCGAC GAGCCGCCGG AGGGTGCGCT CGGTACGGTG CGCACGTACG CGTTCCGACT CCGTAAGGTC TTCGGTGCCG AAGCCATAGC GTCCATCGCC GGTGGCTATG CGTTACGCGC GGAGCGTTCG CACGTGGACT TGTTCACGTT CGAGGATCAT GTCGCGGCAG CGGAGCAGCG CCGGATGTCC GGGGACGTCG TGGGCGCGCG TGGAGAACTC GCTAATGCGT TGGCGCTATG GCGAGGCACT CCGTTGGCCG GGATACCGGG CCCTTATGCG GAGTCCCTGC GCGCGACGCT CTTGGAGCGG CGTACCGCTG CGCAGGAGAG CCGCATCGCC TTGGATTTGG CGTTGGGGCG TGTAGGAGAC GCCGTTTCCG AACTCACGGT CCTCACCGCT GAACACCCGC TGCGTGAGGG ACTGCGTGCG CAGCTGATGC TGGCGCTCTA TCAGACCGGA CGACAGGCCG AGGCGCTAGG CGTCTACGCA GATACTCGAC GGCTGCTGCG CAAGGAACTG GGAGTCGATC CCGGCGCAGA GCTCGGCGAA CTGCATCAGC GGATCCTGCG TGCCGACCCG TCCTTGGCGC GTCCTTCTGC CGACAGTGCC GAGATCGCCC CGAGAGCGGC TGAGGCTTCT GCGGGGACCG CTTTGGAGAA GCCGCCTACG ATCCCTGCGC AGCTTCCTGC TGACACAGCC GACTTCACCG GACGCGAGAA CCTGACGCGC GTGCTCGCGG CGCGGATATC CACGACGGTC GGGCAGTCGG TGGCGGTCTG TGCGCTGTCC GGGCTCGGCG GAGTCGGCAA GACGGCGCTC GCCATCCACT TGGCGCACTC GGTGCGGGAG GAGTTTCCCG ACGGGCAGCT CTATGTGGAC CTGCGCGGAG GCGACCCGAC GCCGGCAGAC CCTGCGCCGG TTCTCGCAGC GTTCCTGCGG GGTCTGGGGA TCTCGGAGGG CGAGACTGCG CCAGGGCTGG AGGAGAGGGC TGCGGCGTAT CGGTCGGCGC TCGCCGGTCG GAGGGTTCTG ATCGTCCTGG ACAACGCCCG GGACGCCGCG CAAGTACGGC CTCTGCTGCC CGGCGCTCCG GGGTGCGCGG TCATCGTCAC GAGCCGGCCG AAGCTGACCG GGTTGGCCGG GGCGACGTTC GCTGATCTCG ACGTGCTGGA TCCCGGCGAG GCGATGAACA TGTTCACGCG GATCGTCGGC GAGGAACGCC TAGGGATGGA GCACACCGCG GCTATAGACG TGGTGTCCCT CTGCGGATAC TTGCCTCTGG CGGTGCGTAT AGCGGCCGCA CGGCTCGCGT CCCGACCGCG CTGGCGTATC GGGTCGCTCG CGGCGCGGTT GTCCGACGAG CGGCGCCGGC TGGGCGAACT GGCTGTGGGC GACCTCGCGG TGCGCGCGGC GTTCGAACTG GGCTACCACC AGTTGTCCCC GGCGCAGGCG GATGTCTTCC GGCGGCTGTC GCAGCTGAAC AGCGCTGACG TGTCGGCTGC TGCGGCGGCC GCGCTGCTCG GCCAGGACGA GGCCGACACC GAGGAAGTGC TGGAGTCGCT GGTGGATGCC GCGATGCTGG AGTCTTCGTC TCCGGGGCGC TACCGCTATC ACGACTTGCT GCGGCTCTAT GCGCGGGAGC AGTACGCGGC CGAGGGTGGC CTCGACGACG CGGGCTTCGT GAGCCTGCTC GACTTCTACC TCGCCTCGAT GCGCCGGCTG CGGCGCATCC CGGTGGAGGA GCTCGGTCTT TTCCCGACGC GGTCCTCTGG GAGGGAGTTC GACTCCGTGG AGGCCGGGGT GCGGTGGATC GCCGACGAGG GCAGCTGTGT GGACGCGGTG TTCAATCGGC GGGTCCTGAC CGCGCCGCCG TGCAGCGTCG GGGTGGCGCC GTTGACGCTC GCAGTCGAGC TGCTGGACCA CCTTGTCTCG CTGCCGGGTA TCGAACGCTA TGCGGATGAG CTCTCTGCGG CGGCGCGTAA CGCTGCCGCA GCTGCCGTGG AGTCTGGGGA CGTACGGAGT GAGGCGCGTG TACGGCACTC CTTGGCGCGC ATCCTTTACG CGACGTACCA GATCGAGGCT GCTGCCGAGG AGGCTGAGCG ATCGCACCGC GCCGCCGAGT CGGTCGGCGG CGACGAGACT CAGGCGGACG CCGTCAATCT GCTCGCGATG ACGTATGCGG ACCTTGGGAG GGACGCGGAG GCGATCGCGC TGTACGAGCG CGCTGTCGAC GTCAGCCGGG AGTTCGGGGA CGTGGCGTCC GAGGCGGCCG CGCGGCAGAA CATGGCGCGC TCGCTGCTGG TGTTGGGTCG GACGGAGCAG GCACTGCAGA GCACTATGGC GGGACTTGCC TTGTGTCGCG CCTTGGGGGA CGACGTGTCC ACCGGGTACG CGCTCTTCCA GACAGGTTCC ATCTACCTGC AGACGCTCGC CTATAGGGAG TCCCTGACGT ACTTCACGGA GGCGGCTCGG TACTTCGCCG ACGTGCATCC GCCTATGGAG GGTGCTGCGC ACGCCTCCAG CGCGCGGGCT CTGTTGGGAC TCGGCGAGCC CAGCGGCGCG CTGGAGCATG CCGAGCGTGC GGTGAGTGTG CTGCGGGGGA CCGCCGACAC CTGGCAGCAC GCGACGGCGC TCGCGGTGCT GGCTGACGTG CTGGATGTCG CCGGGCAGCC GGAGCGGGCG CGGGGTTGTC GGGAGGAAGC GTTGGCGATG TTCGTCGTGA TCGGCGCGCC GGAGGCTGAA CGTATCAGGG TCATGCTCTC CGGCTCGGAA GCAACCTTCA CGGCGGTTCG AACGTCGCGC TAA
|
Protein sequence | MVVQYGLLGP VRALRVGAAG EPAEELKVGS PQQQAVLALL ASRAGRVATA DELIEGLWGD EPPEGALGTV RTYAFRLRKV FGAEAIASIA GGYALRAERS HVDLFTFEDH VAAAEQRRMS GDVVGARGEL ANALALWRGT PLAGIPGPYA ESLRATLLER RTAAQESRIA LDLALGRVGD AVSELTVLTA EHPLREGLRA QLMLALYQTG RQAEALGVYA DTRRLLRKEL GVDPGAELGE LHQRILRADP SLARPSADSA EIAPRAAEAS AGTALEKPPT IPAQLPADTA DFTGRENLTR VLAARISTTV GQSVAVCALS GLGGVGKTAL AIHLAHSVRE EFPDGQLYVD LRGGDPTPAD PAPVLAAFLR GLGISEGETA PGLEERAAAY RSALAGRRVL IVLDNARDAA QVRPLLPGAP GCAVIVTSRP KLTGLAGATF ADLDVLDPGE AMNMFTRIVG EERLGMEHTA AIDVVSLCGY LPLAVRIAAA RLASRPRWRI GSLAARLSDE RRRLGELAVG DLAVRAAFEL GYHQLSPAQA DVFRRLSQLN SADVSAAAAA ALLGQDEADT EEVLESLVDA AMLESSSPGR YRYHDLLRLY AREQYAAEGG LDDAGFVSLL DFYLASMRRL RRIPVEELGL FPTRSSGREF DSVEAGVRWI ADEGSCVDAV FNRRVLTAPP CSVGVAPLTL AVELLDHLVS LPGIERYADE LSAAARNAAA AAVESGDVRS EARVRHSLAR ILYATYQIEA AAEEAERSHR AAESVGGDET QADAVNLLAM TYADLGRDAE AIALYERAVD VSREFGDVAS EAAARQNMAR SLLVLGRTEQ ALQSTMAGLA LCRALGDDVS TGYALFQTGS IYLQTLAYRE SLTYFTEAAR YFADVHPPME GAAHASSARA LLGLGEPSGA LEHAERAVSV LRGTADTWQH ATALAVLADV LDVAGQPERA RGCREEALAM FVVIGAPEAE RIRVMLSGSE ATFTAVRTSR
|
| |