Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_6410 |
Symbol | |
ID | 8337773 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 7393130 |
End bp | 7396042 |
Gene Length | 2913 bp |
Protein Length | 970 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 644959511 |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003117105 |
Protein GI | 256395541 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.112311 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTTTCG GGCTTTTGGG TCCGCTCGAC GTGCGCGTCG AGGATGACAT GCCGGTCAGT GTGGCCGCTC CTATGCAGCG GGCCGTATTG GCGGCGCTGC TGTTGCATCA GGGGCGGCCG CTGACGCTGG ACAGCTTGAT CGACGTTCTG TGGGACGGTC GGGCGCCGGC GAGTTCGCGG ATGACCGCGG TCAACTATGT CGCTCGGTTG CGGCGGTCGG TGGGTCCGGA TGTGGCCGCG CGATTACAGA CCAGTCCTGC CGGCTATCTG GTGAGGCTCG CGGGTGACGC CGAGCTGGAC AGTCTGGAGG CTTCCGGGCT GGAGCGGCGG GCTCTGGACC GGTCGCGGGT CGGGGACTGG TCCGCGGTCG CGACGGCGGC GGGGCGGGCC CTGGCTCTGT GGCGCGGTGA GCCGTTGCAG GATCTGCCCG CGACCCGGCT GCAGCGCGAC CATCTTCCGG CACTGGCCGC ACTCCGGCTG CGGCTGCGGG AACTCGCTGC CGATGCCGCG GTGCACCTCG GCGAGTACGA GCAAGCCGCC GCTGATCTCA CCGAGTTGCT GCGCGACCAT TCGCTGAACG AGCGCCTGTA TGAGCTTCTC GTGGTCGCGC TCTACGGCTC GGGGCGACGC GCCGATGCGC TGGAGGCGTT TCAGCAGGCG CGCCGGACGC TGAGCGCGGA GCTGGGGGTC GATCCGACGC CGCGGCTTCA GGCGCTGCAG CAAGGGATTC TGGCCGGTGC GAGCCACGGC ACGGTGCTGA ATCTGTTGGC CGCCGACGGG CGGTCCGCGG GCACGAGTCC GGCGCGCGCC GTCACCTCGA TCAGTGGTGC GCAGGCGGTG GTGCCCCGGC AGCTGCCCGC CGCGACCCGG TATTTCAGCG GGCGGGCGCA GAGTCTGGCG GCGTTGACGG CGCTGGCCGA CGAGGTCGTC TCCGACGAGG CTGCCCACGA CGCCGCCACC TTGGACGCCG CCGACCGCGA CGGCGCGGCC ATCGCCGTCA TCGCGGGCAT GGCCGGCATC GGCAAAACGA CGCTGGCCGT CCAGTGGGCC CACCGAGCCG CCTCCCGCTT CTCGGACGGT CAGCTCTACA TCAACCTGCG CGGCTTCGAT CCCGGCGGCG CCCCGGTCGC CCCCGACCAC GCGATTCGCG TGTTCCTGGA GGCGTTCGGC ATACCGCCGG CGCGGATTCC GACCACCGCG CAGGCGCGGG CCGGGCTCTA CCGGAGCCTG GTCGCCGATC GCAGGGTCCT GATTCTTCTC GACAACGCGC GCGACGTCGA GCAGGTCCGT CCCCTGCTGC CGGGCACCCC GGCGTGCCTG GTGCTGGTGA CCAGTCGCAA CCGGCTCACC GGGCTGGTGA CGGCCGAGGG CGCGCACTGG ATCCCGCTCG ACCTGCCGGA CCCGCCGCAG GCGCGGGAGC TGCTGGCGCG GCGGCTGGGC TCTGATGTGG TCGCCGAGCA GCCGGAGGCG ATCGCCGAAC TGGTCGAGCT CACCGCCCGG CTTCCGCTGG CGCTGAGCGT GGCCGGGGCG CGGCTGGCGA TGAACCCGCT GCTGCCGGTC TCGGCATTCC TGGCCAGCCT GCGCACCACG CGGAGCCGGC TCACCGTGCT GAACGGCGGG GATATCACCA CCGATCTGCG GGCCGTGTTC TCGTGGTCCT ACCAGCAGCT GGCACCGGCC GCGGCGCGGA TGTTCCGGCT GGTGAGCCTG TACCCCGGTC CGGACGTCTC CCTGGCGGCG GCCGCGAGCC TGGCCGGCCT GGAAGCCGCG GAGGCCAGGG CGGCGTTGGC GGAGCTGACC GCCGCGAACC TGATCACCGA GCCCGCGCCG GACCGGTTCG CCTGTCACGA CCTGCTGCGC GCCTACGCCG CCGAACTCGC CGACGCCCCG GCGGAGCGGG AGCTGCGATC GGCGGCGTTC GCGCGGATGC TCGACCACTA CCTGCACTCC GCCTATCGGG CGTCGATGGT CCTGGCGTCG CATCGGGACC CGGTCGAGGT CGGTGATCCG CAGCCTGGCG TGGCGGTGGA GGATTTCCCC GACAAGGCTG CGGCCACGGC GTGGTTCACC GCCGAGCACC TGGCGCTCCC CGCCGTCATC GCTCGGGCGG CCGATACCGG CTTCGACGTC CACGCCTGGC AGACCGCGTG GGCCGGCTAC GTGTTCTTCA ACGTGCACCG GTATTGGAAC GACATGCTGG AGACGCTGGT GATCGGTCTG GACGCCGCCG AGCGGCTGGG CGACGAATAC GCCCAAGGAC TCGTGCTGCG TCCGCTCGGC GGAGTGTCCG ACAAGCTCGG CCGGGAGGAC GAAGCGCAAG CCTACCTGGG GCGGGCTCAC GTCCTGCTGG TCAAACTGGA CGAGCCCTTG GGCCATGCGC ACGTCCACTT GAGCATGGGC CAATCCGCCT ATCGGCGCGG CCGCTACGCG GAGGCTCTCG AGCACAGCGA GAAGTGCTTG AACCACGTCA CCCGCGCCGG ATCCGGACTG GGCCAGGCGA CCGCGCTCGG GTCGCTGGTC TCCTGCCACG TCGCCCTCGG CGACCTCGAC GCGGCCCGCG CCGCCGGCGA GCGCTCGCTG GCCCTGTACC GGGAGTTCGG CTCGCCGATC AGCGCCGGCA ACACCTTTCT CAGCCTGGCG GACATCGAGG TCGCCGGCGG CGACTACCCG CGGGCCGCCG AGCTCTGCCG CCGGGCCGCC GAGGTCTTCG CCGGGCACGG CGCGCGCCAC TACGTCGCCA AAGCCCTGAC TCAGCTCGGC GACGTCTTGG ACAAGGACGG CGACCAGGGC TCCGCGCTGT CGGCGTGGCG CGAAGCCCTG GAGGCGGTCG ACCAGCTGGA CAGCTCCGAC GCGGACGGCA TCCGGGGCGG GCTGCGTGAC CGGCTGCGCG AACGCGGTCA GGTCGTGCCG TAG
|
Protein sequence | MRFGLLGPLD VRVEDDMPVS VAAPMQRAVL AALLLHQGRP LTLDSLIDVL WDGRAPASSR MTAVNYVARL RRSVGPDVAA RLQTSPAGYL VRLAGDAELD SLEASGLERR ALDRSRVGDW SAVATAAGRA LALWRGEPLQ DLPATRLQRD HLPALAALRL RLRELAADAA VHLGEYEQAA ADLTELLRDH SLNERLYELL VVALYGSGRR ADALEAFQQA RRTLSAELGV DPTPRLQALQ QGILAGASHG TVLNLLAADG RSAGTSPARA VTSISGAQAV VPRQLPAATR YFSGRAQSLA ALTALADEVV SDEAAHDAAT LDAADRDGAA IAVIAGMAGI GKTTLAVQWA HRAASRFSDG QLYINLRGFD PGGAPVAPDH AIRVFLEAFG IPPARIPTTA QARAGLYRSL VADRRVLILL DNARDVEQVR PLLPGTPACL VLVTSRNRLT GLVTAEGAHW IPLDLPDPPQ ARELLARRLG SDVVAEQPEA IAELVELTAR LPLALSVAGA RLAMNPLLPV SAFLASLRTT RSRLTVLNGG DITTDLRAVF SWSYQQLAPA AARMFRLVSL YPGPDVSLAA AASLAGLEAA EARAALAELT AANLITEPAP DRFACHDLLR AYAAELADAP AERELRSAAF ARMLDHYLHS AYRASMVLAS HRDPVEVGDP QPGVAVEDFP DKAAATAWFT AEHLALPAVI ARAADTGFDV HAWQTAWAGY VFFNVHRYWN DMLETLVIGL DAAERLGDEY AQGLVLRPLG GVSDKLGRED EAQAYLGRAH VLLVKLDEPL GHAHVHLSMG QSAYRRGRYA EALEHSEKCL NHVTRAGSGL GQATALGSLV SCHVALGDLD AARAAGERSL ALYREFGSPI SAGNTFLSLA DIEVAGGDYP RAAELCRRAA EVFAGHGARH YVAKALTQLG DVLDKDGDQG SALSAWREAL EAVDQLDSSD ADGIRGGLRD RLRERGQVVP
|
| |