Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_3752 |
Symbol | |
ID | 8335105 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 4236608 |
End bp | 4239586 |
Gene Length | 2979 bp |
Protein Length | 992 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 644956892 |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003114495 |
Protein GI | 256392931 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTTTCG GGGTCCTCGG TCCGCTCCTG GCGCACGACG GCTCGGCCGA CAGACGGATC GTCGCGCCGA AGCAACAAGT CATCCTGGCC ACGATGCTCC TGAACGCGAA CCGGGTGGTC TCGGTCGAGC GCATCGCCGA GAACCTGTGG GCCGACGCCG CACCGGGCGG TGCGCACAAG ACGCTGCACA CCTATGTCAT GAGGCTGCGA CGATCGCTCG GCACCGCCGC CGACCGGGTG CGCACCGAGG CCCGGGGCTA CCGCTTCGTC GTGCAGGACG GGGAACTGGA TCTGCACCGC TTCACCGCTC TGGTCGAAGC GGCCAAGGCC AGGAGCGCGG AGCAGGCTTG GGAAAGCGCC GCAGACCTCT TACACACCGC GCTGGGAGTC TGGCGCGGGC AGCCGTTGCA GGGTGTCCAG TCCGTCGCCC TGAGCGGCGA GGTCGACCGG CTCGCCGAAC TCCGGCTGGA CGCGCTGTTG GCCCGGATCG ACGCGGACCT GCATTTGGAC CGCCACGAAC TGCTCGTGCC GGAACTGCGC GACCTCGTCG TCCGGCAACC TCTTCTGGAA CGCGCGCACT CCTTCCTCAT GCTCGCGCTC TACCGCGCGA GCCGGCGCGC CGACGCGCTG GCCGCCTTCC GGTCGGCGCG GCGCGTGCTC GCCGAGGAGC TGGGGATCGA GCCCGGCCGG GACCTGCAGC GGCTGCACGG CCGGATCCTC AACGCCGACC CGGCGTTGAA CGATCCGGCG TGGAACGACC CGGCTGGCTC CTCCCCGGAG CCGCGATTTC CGTCGCAGCC GGCACCGGAT CCCACGTCGG ATGCGGAACC GGCCGCCCCC GCGCCTCATC CCGCCGCGCC CGCACAGTTG CCGCGCGGCA TCCGTGACTT CACCGGACGC GACCACGAAC TGGAGCGCAT CAAGACGCTG CTTTCCCACG ACCCGCCCGG CAGCGTGCCG GCTCCGGGAG TCTGCGTCAT CGCCGGAGCC GGGGGCACCG GCAAATCGGC GCTCGCCGTC CAGGTCGCGC ACGCCGTCCG GGACCGGTTC CCCGACGGCC AGCTCTATCT CGACCTGCGC GGCGCCGACC GGCACCCGGT CGATCCCGGC CACGCGCTCG CCGAGTTCAT CCGCGCCCTC GGAGACGGTG GTTCCGCGCT GCCCGAGGGG GTGGCGGACC GCTCCGCGGT CTTCCGCACG ATGCTGGCCG ATCGGCGGGT GCTGATTTTG CTGGACGATG CCGGTGACGT CCAACAGGTG CGGCCTCTGC TGCCGGCCGA TCCCCGGTGC TGCGTCATCA TCACGAGCCG CAGCCGCCTG CCCGGCCTGG AGGACTGTGC GCGGCTGGAG CTCGGCTCGC TGTCACCACA GGACGGCGCC AGCCTGTTCG GCAAAGTCGT GGGCGACGAG CGGCCGCAGT CCGAGCCGGC CGCGGTGGCG CGCATCGTCG AGCTCTGCGG CGGGCTGCCG CTGGCGATCC GCATCGCCGG CTCGCGACTC GCGGTGCGGC GGACCTGGCG GCTGGAGTCG CTGGCGGCCC GGCTGGGGGA CACCGCACGG CGCCTCGACG AGCTCCGGAC GGACGACCTG CAAGTGCGGG CGACCCTCGA CATGAGCTAC CAGCACCTGA CCGGCGACCA GGCGCGAGCC TTCCGGCTGC TCGCCGTCCC CGATGTCGAC TCACTGTCCG TCTGGCACGC AGCGGTGCAC CTCGATGTGC CGGCACGGAC AGCTGAGACT CTTCTGGAGA GCCTGGTGGA CGCCTTCCTG CTGGAGCCGG CCGGCGCCGA GCGCTATCGC TACCACGATC TGACGCGCGT GTTCGCCCGC GAGGCGGCGT CCGTGACCGA GTCCGCTGAG GCGTTGGCCG GCGCCGCGGG CCGGACTCTG GCGGCGTACG CAGAGCTGCT GGCCCATGCC GCCGCCGCTG CCCGGCCGGG CTACCTCGAC GAATCCCCGC CCGCGCTCCG GTTCGGCACC GCCCACGAAG CGTTGGACTG GCTGGATCAG GAGTTCCGCG CCGTCGGCGG ACTGATCGTC CAGGCCGGCT CCGGCCCGGC CGACGCGGTC GGCGTCGCCG CGGACATGTT GTACCGCGTC CAGTGGTACC TGCGGTCCCG GGGGCACTGG CGGCTGTGGC ATGACGCGGC GGCCGCGGTG ATCGACGGCG CGGTCCGCAC CGGCGACACG GCCGCCGAGC TCGTCGGCCG CCAGAGCAGC GGCCTGCTGG CCCTGCTCAC CGGAAGATTC GAGGAATCCG ACGAGAACCT GTCGGCGGCC GTCGGGCTCG CCGAGCGGCT CGACGATTCC CTGGAAAAGG CCCGCGTCCT GAACCGGCGC GGCCTGCTGG ACTTCCAGCG TGGTTTCTAC CGCGAGGCCG TGGCAGACCA CGAGGCGGCG GCGGACCTGT TCAAGCGGCT CGGCAACCGG CTGGGCGAAT GCGCCAGCTT GGTGAACATC GGCAAGTGCT TGCGCGTGTC CGGCGAACCG GCACGGGCTC TGGCTCACCT CGAACGGGCT CTGGCGCTCA GTGAGGAACT GGGCGAATCG GAGAACGCGA CAATGGCCCG GCACCACCTG GCCGCTTGCC ACTCCGAACT CGGCAACCAT GAAACGGCCA TATCTGCGCA GTATGACTGC CTGGTCTTCA CGCGCGAACA CGGACTCCGC GAAGGCGAGG CCTTCGCTCT CGCCGAACTG GGTCGTGCGC TCCTGAGAGC GGACCGCGCG CTCGAAGCCC TGGAGAGTTT CGAGGAGGCC ATGGACCTCT TCAGCGCCCT CGGCGACCCC AATGCGGTCG CGGTGTTCCT CGCGGACTCC GGCTTCGCGC ACCAACGCCT CGGCGACCTG GCCGCCGCGA CGAGCGCATG GCGGGCGGCA CTGCCTGCGC TCCGGCCGGA CACGCGGGAG GCAGGCGCTG TTCGAGAGGT GTTGGGCGCC TATACCCATG AAGAGATTCA CACCAGTGAA TCAGGGTGA
|
Protein sequence | MRFGVLGPLL AHDGSADRRI VAPKQQVILA TMLLNANRVV SVERIAENLW ADAAPGGAHK TLHTYVMRLR RSLGTAADRV RTEARGYRFV VQDGELDLHR FTALVEAAKA RSAEQAWESA ADLLHTALGV WRGQPLQGVQ SVALSGEVDR LAELRLDALL ARIDADLHLD RHELLVPELR DLVVRQPLLE RAHSFLMLAL YRASRRADAL AAFRSARRVL AEELGIEPGR DLQRLHGRIL NADPALNDPA WNDPAGSSPE PRFPSQPAPD PTSDAEPAAP APHPAAPAQL PRGIRDFTGR DHELERIKTL LSHDPPGSVP APGVCVIAGA GGTGKSALAV QVAHAVRDRF PDGQLYLDLR GADRHPVDPG HALAEFIRAL GDGGSALPEG VADRSAVFRT MLADRRVLIL LDDAGDVQQV RPLLPADPRC CVIITSRSRL PGLEDCARLE LGSLSPQDGA SLFGKVVGDE RPQSEPAAVA RIVELCGGLP LAIRIAGSRL AVRRTWRLES LAARLGDTAR RLDELRTDDL QVRATLDMSY QHLTGDQARA FRLLAVPDVD SLSVWHAAVH LDVPARTAET LLESLVDAFL LEPAGAERYR YHDLTRVFAR EAASVTESAE ALAGAAGRTL AAYAELLAHA AAAARPGYLD ESPPALRFGT AHEALDWLDQ EFRAVGGLIV QAGSGPADAV GVAADMLYRV QWYLRSRGHW RLWHDAAAAV IDGAVRTGDT AAELVGRQSS GLLALLTGRF EESDENLSAA VGLAERLDDS LEKARVLNRR GLLDFQRGFY REAVADHEAA ADLFKRLGNR LGECASLVNI GKCLRVSGEP ARALAHLERA LALSEELGES ENATMARHHL AACHSELGNH ETAISAQYDC LVFTREHGLR EGEAFALAEL GRALLRADRA LEALESFEEA MDLFSALGDP NAVAVFLADS GFAHQRLGDL AAATSAWRAA LPALRPDTRE AGAVREVLGA YTHEEIHTSE SG
|
| |