Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_1981 |
Symbol | |
ID | 8333324 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 2238021 |
End bp | 2241122 |
Gene Length | 3102 bp |
Protein Length | 1033 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644955130 |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003112742 |
Protein GI | 256391178 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0609683 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGTACA CCGTGCTCGG GCCTGTCGGC ATTCGGAGTC ACGATGTTTT TCATGCTGCC GGCACGGCGA AGGAACAGGG GGTCTTGGCG ATTCTGCTGA TGGAGCGGGG GCATTCCGTC TCTACGCAGA CACTTGCCGA TCGGCTGTGG GAGCGGCCTC CGGAGCAGTT TCGGGCGACG TTGCAGGCGC ACATTTCGCG GTTGCGGCGG CGGTTGCGGG AGGCCTCTGA GCAGGCGGAA GTCATCGCCA GCAATCAGGC CGGCTATCGC ATCGACGTTC CGGCGGACCA GGTCGACGTG CATTATTTCG ATCTTTTGGT TTCGCGGGCT CAGGCGCACG CCGGGCAGGA TCCGGACTCG GCGCGGGAGT TGCTGCGCGA GGCCGAGGGG CTGTGGAACG GCGAGCCGTT GGCGGGGTTG CCTGGGACGT GGGCCGAGAC CATGCGGCGC GTCCTGACCG ACAAGCGCCG CACCGCGTTG CTCAAGCGCC TCGAGCTGGA TCTCCAGACC GGCGGCAATG CCGATGACGC TGTCGCCGAG CTGACCGAGC TGGCTTCGGG CAGCCGGATC GACCAGCGCG TCATCGAGCT GCTGATGATC GCCTTGGACA GCGCCGGACG TCCCGGCGAT GCCCTGACGC TCTATCACGA GGTCCGCATC CGCCTTCGCG ACGAGGCAGG CACGGACCCC CGCGCCGAAC TCCGCCAGCT CCACCAGCGC CTGCTCAACG GCTCCGCGCA GCACCCGGCG GCGGCAGCAC CCGCCCAGCC GGTCACGCCC CGCGCCATCG ACACCCTCGA CCCCGATCCG CCCTACGTCG CAGGACGCGA ACAAGAACTC GCCGCGATCC TCGCTGCCGT CGCCGCGGAC CTGCGACCGG GCAAACGGGG CGCGACCTTC CTCATCGACG GCTTGGCCGG CATCGGCAAG ACGACGCTCG CCCTCCAGGC GGCGCACCTC CTGCGCTCAC ACTGCCCCGA CGGCGCGCTC CAGCTGAACC TGCACTCGCA CGATCCGTAC CTCCCGCCCC TGGACCAGCG CCAAGCCCTG ACCCAGCTCC TGGACGCGAT CGGCACGCCC TACCGCGAAC TGGCCCGCGC CGACACCGTG CCAGCGCTCG GCGCGCTATG GCGAAAACGC ACCAGCGGCC GCCGCCTGCT GATCCTGCTC GACGATGTCC TGGACACCGC CCAGATCGAG CTGCTGATCC CCGCGACAGC CGGCACGATC GTCCTGATCA CCTCGCGCCG CCGCCTGACC GGGACGCCCG GCAACCGCCA GTACACCCTC GGTCCGCTGC CCGACTCCGC AGCCACAGCG CTCCTGTCTC ACATCACTGA CAGGACCCTG CCAGAGGACG ATGACCTCGC CAGCTTCACC CAGTGCTGCG GCGGTCTGGC TCTGGCGATC ACGGTCGCCG CCGGCCACCT GCGCAGCCGA CCGGTCTGGA CGGTCGGCGA CCTGGTCTCG CGCCTGTCCA CGACCTCGCA GTCGCTCGCC GACGACCCCC TGACCAGCCC GATCCACACC GCCTTCGCAA TGTCCTACCA AACCCTCAGC CCCACACTGC GGGACCTGCT CCGCTACATC GCCGCCCACC CCGGTCCCGA CATCGGCCTG CCGGCCGCCG CCGCGATGTC CGGCGCGGCG CTGGCCGACA CCGACATCAG GCTCGACGCC CTGGTGGACC ACCGGCTCCT GAACCTCGCG AGCGCGCACC GCTACCGCCT GCACAACCTG CTCCGCCAAT ACATCCTGGT CCAGGGAGAC GAGCAGCAGA ACCTTGATAA CCGCCAGGCA GTCGGCCGCG CCATAACGTT CTACAAAGCC GCCGCCGCAC GCGCCGATCA CGCGCTCCAG CCTCGCCGCC GCGAACTGCA CTACCCCGCA GCCTCCGCTC AGGTCGAGGG CGTCAACCTC GACACCACCG AGCAGGCGCG CACCTGGCTC GACACCGAGC ACCTCAACCT GGCGGCGGTC ACCACCTGGT CGGTCCAACT CGGCCGCGGA ATACAGGTCG GGCTGATCCC GCACGTGCTC GCCCAACACC TGGACCGGCG CGGACGCTGG CCGCAAGCGC TCGAGCACAT CGATGAACTG CTCGCCGCCC CGAAAGGCGA CTTCCCCGGC GGCAACCCGG ACGCCGTCAC CGCGTGTCTG CTGACCGACC AGGCCGGGCT CCTCATCCGC GCCAACCAAC TCGACGTCGC AGTGGACGCC GCCAACGCCG CGCTGGCCAT CTGGAACGCC GCCAACGACC GCTACGGCCA AGCCGACGCC CACTTCCAGA TCGGACGCGC CCACGACGCC GCCGAACGCC ACGACGAAGC CCTGCAAGCC TTCCGCACGG CAGCCGCGCT CTACGAGAGC CTCGGCGACC ACACCCGGGT GGCAGTAGCC GAGGACCAAT GGGCCGTCAC CGCGTTCAAG CAAGGCCACC TGGACGAAGC CTTCTCCCGT GGCCACCACG CGCTGGACAT CGCCCGGCAG CAGAACGACC TCGCGGCCAT CGCCGACGTC CTCAACAACT TGGGCGAAAT GCACCGCCAG GCCGACCACG ACCAGGAGGC GCTCGCCTTC TTCCAAGAGG CCCGCACCCT GACCGCAGCG CTCGGCGACC CGCTCATCAC CGCTGTGCTC GGCTACAACA TCGGCGCCGT CTACGAACAC GCCGGCGACT ATCACCGTTC CCTGACATCG ACGCGAACCG CGCTCCTGCA GTTCCGCGAA CTCAATGACC ACCGCAGCGA AATCGAGTGC CTCATCCTGC TCGCCACCGC GCACATCAAC CTCGGAGACC GCAACGCAGC GTTCGAAGAA ACCCGGCACG CAATCGACCT CGCCGAGCAA ACGCACGACC AGCTACGGCT GGCGCAAGTC CGCCTGGCGC AGGGCACCAT GCTTGCCGCC CGCGGCGATA TCCAGGGCGC GATCGAGGCG TGCGAATCAG CCCTCGATAT TGCCGAACAG ATCGGCGCCG TCGCCGAACA GAGCCAGGCG CACCGTTCCC GTTGCGAGGC GTACACGAGT CTTGGCCTGC ACGATCGCGC CCAAAGCCAT CTCCAGCAAG CAGAATCGCA AGGCGGCCCG ACCACTGAAT GA
|
Protein sequence | MQYTVLGPVG IRSHDVFHAA GTAKEQGVLA ILLMERGHSV STQTLADRLW ERPPEQFRAT LQAHISRLRR RLREASEQAE VIASNQAGYR IDVPADQVDV HYFDLLVSRA QAHAGQDPDS ARELLREAEG LWNGEPLAGL PGTWAETMRR VLTDKRRTAL LKRLELDLQT GGNADDAVAE LTELASGSRI DQRVIELLMI ALDSAGRPGD ALTLYHEVRI RLRDEAGTDP RAELRQLHQR LLNGSAQHPA AAAPAQPVTP RAIDTLDPDP PYVAGREQEL AAILAAVAAD LRPGKRGATF LIDGLAGIGK TTLALQAAHL LRSHCPDGAL QLNLHSHDPY LPPLDQRQAL TQLLDAIGTP YRELARADTV PALGALWRKR TSGRRLLILL DDVLDTAQIE LLIPATAGTI VLITSRRRLT GTPGNRQYTL GPLPDSAATA LLSHITDRTL PEDDDLASFT QCCGGLALAI TVAAGHLRSR PVWTVGDLVS RLSTTSQSLA DDPLTSPIHT AFAMSYQTLS PTLRDLLRYI AAHPGPDIGL PAAAAMSGAA LADTDIRLDA LVDHRLLNLA SAHRYRLHNL LRQYILVQGD EQQNLDNRQA VGRAITFYKA AAARADHALQ PRRRELHYPA ASAQVEGVNL DTTEQARTWL DTEHLNLAAV TTWSVQLGRG IQVGLIPHVL AQHLDRRGRW PQALEHIDEL LAAPKGDFPG GNPDAVTACL LTDQAGLLIR ANQLDVAVDA ANAALAIWNA ANDRYGQADA HFQIGRAHDA AERHDEALQA FRTAAALYES LGDHTRVAVA EDQWAVTAFK QGHLDEAFSR GHHALDIARQ QNDLAAIADV LNNLGEMHRQ ADHDQEALAF FQEARTLTAA LGDPLITAVL GYNIGAVYEH AGDYHRSLTS TRTALLQFRE LNDHRSEIEC LILLATAHIN LGDRNAAFEE TRHAIDLAEQ THDQLRLAQV RLAQGTMLAA RGDIQGAIEA CESALDIAEQ IGAVAEQSQA HRSRCEAYTS LGLHDRAQSH LQQAESQGGP TTE
|
| |