Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_6515 |
Symbol | |
ID | 8337879 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 7507099 |
End bp | 7510113 |
Gene Length | 3015 bp |
Protein Length | 1004 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 644959612 |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003117205 |
Protein GI | 256395641 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.409149 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.360927 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACTT TTTCGATTCT CGGGCCTGTG GAGGTGCGGG TCGATGGTTA CCCGGTGAGC GTTCCGCGGC CGCGTCGGCG GGCCGTGCTC GCCTACCTGC TGCTGCATGC GAATGACCGG GTGGAGACCG AACAGCTGAT CGGTGCGTTG TGGGACGCCG AACCGCCCAG AACGGCGCGG GCACAGGTGC ATTCGGCAGT TTCGGCTCTG CGGACCGGGC TGCCGGGGAC GCTCGGCGAA GGCCTGGTGT CGCAGACCAG TGGCTACCGG TTGACTGTCG CCGCCGATCA ACTCGACCTG ACGCTCTTTC GCCGGCTGAC AAGCGCGGCG AGAGCGCAGA TGGCGAGCAG ACAGTTCGAC ACCGCGGTGG GGTCGCTGCG TGCGGCCCTG GCATTGTGGC GGGGCGCGGC GTTGGCGGGG GTGGAGGCGC CTTTCGTCGA GCCGGCGCGC ACGCGGCTGG AGGAGGAGCG GTTCAGCGCG TATGAAGCGC TGGCCGATCT GGAGATGGCA GCGGGTCGGC ACGTGGATCT GGTGCCGTTG CTGACCGGAT TGCTGAACGA GTACCCGACC CGGGAGTCGC TCGTGGAGCG GCTCGCCCTG GCCCTGTACC GGGGCGGGCG GAAGACGGAT GCGCTGGCTC TCGTGCGAAA GCTGCGGACC CTGCTCGCCG AGGAATACGG CCTCGACCCG GAGCGCAGCA TCGTGGAGCT GGAAAACTCC ATGCTGCGAG GGGATCCGGC TCTGGAGCAT CAGTGGCCTG CGATGTCGGC ACAACCACTG AGCCGATCCG AAGCGGGTGC CGAAACCGGT GCCGCCGCGG GCGCCAGGCC CGGGGCTTCG ACCACCGGGG CCGCGCCGGC CGCGCCCGCC GCGCCCACGC TCACCGTCCC CACTCCGGTC CCCACTCCGG TCCCCACTCC GGCGCCCGTT CCAGTTCCGG CCGCACGCAC CCTGCCCCGG CCCGCCCAGC TGCCCCGCGC CAGCGCGGGC TTCGTCGGCC GCGCCGCCGA GTTGAGCCGC CTGACCCGGC TGCTCGGCTC CGACAACGAT TCCCCCTACA TCGCCGTGGT CACCGGGCCG GCCGGAGTCG GGAAGACTGC CCTGGCCCTG CTGTGGGCAC ACCGCCAGGC CGACGCGTTC CCCGACGGAC AGCTCTTCGT CGACCTCCAC GGCTACGACC GCATCGAGGC CGAGAACGCC GACAGCGTGC TGGAGCGTTT CCTGCTCGCT CTGGGCATCC CGGGGCACGA CATACCCTCG GGGCTCCCCA AACGCGAAGA TCTGTTCCGC TCGGCGGTGG CCAACCGCCG TATGCTGCTG GTTCTGGACA ACGCGCGGGA CTACCAGCAG ATCAGCCCTC TGCTTCCGGG TTCAGCACTC AGCCGCACCG TCATCACCAG CCGTGCGCGC ATGGGCAGCC TCGTGGCCGA CACCGGCGCA CTGACCGTCC AGCTCGGCGT ACTGCCGCTG GATGAATCAG TCGAAGTACT CGCGCGCATC GTCGGGCCCG ACGCCGTTGC CGGCGCGCCC GATGCGTCGC GCGAGCTGGC GCGCCTATGC GCAGGACTCC CCTTGGCATT GCGTATCTCG GCAGTCCGCT TGCTCGAGGA ATCCACCGCC GGAATCGCCG GCCTCGCAGC CGAGCTGACG TCCGAAGAGC ACCGCCTTTC GGCTCTGGAT CTGCTCGACG ACGGCCGCAC CGTCTCCCAG GCCCTGGAAC ACTCGCACCG CGGGCTCTCC GCCGAACAGG GCCGCCTGTT CCGCCTCCTG AGCCGACACC CCGTCGGCAC CGTGAGCGCG GCCGCAGCGC ATGCTCTCGC AGACCACGGC GAGCTGCGCT TCGCAGCGAC GCACGCCACG CAGCGCCTGT TGCGTGCGCT GGAAGCTGTT TACCTGCTTG ACCAGAGCAC TGATGGCTAC CAGATGCACG ACCTGGTTCG GCTGTACGGT CGCAGCCGCC GGGAACCGGA TGACGACGCC GCGTACGGCC GCGTCATCGA CTGGTACATC AGCGTCGCCG CCGCGGGGTT CGCCGTGCTG GCCCCGACGC AGCCGCGGCT GCCCGCCGAC GTCACACACC GGTTGACCAC CTCCGACGCG GAGCCCTTCG ACGGCGAAGC AGCGGCCCTG GCCTGGTTCG AAGAACAGAT CGACAACCTC GTGGCGGTGC TCAGGCACGC CGCCGAACGC GAAGAGCACC GTGTGGTGTG GCAGATCGCC GTCGGCGTCA GCCCCTATCT GCTGCGCCGC CACCGTTTGG ACGCCCTCGT GGAAACCCAG CGGCTCGGTG AGCGGGCAGC CCTCGGCGCG GACAACCCGC CGGCCGCCGC CGTGCTGGCG AACAACCTCG GCATCGCGTA CGCCATGCGT CAGGACCCGC AGGCCCAGGA ACCTTTCGAG CGGGCAGCGG CGATCCACCG CGCGCTGGGC GACCGTCGCC GAGCCGCCGG GGTCATGGCG AACCTCGGCA GCCTGCGTTA CGGCCTCGGC ATGCTGTCGA AGGCCGCCGA GACGCAGCGC AGGAGCCTGG AGGAGTTGCG TGAGTTCGGG CACAGCCAAG CGCTGTCCGC CGCGCTCGCC AACCTCGGAT TGACGCTGAG CGATCTCGGG CGGCACGACG AGGCCTGCAA GCTGTACGCC GAGGCCATAG ATGTCGCCGA GGAATGCGGC GCAGCGCAGC GGGCCGGGTA CGCGCGCAGC CAGCTGGCGT GGGTTCTGCT GCGAGTCGGC CAGGTCGACG AAGGGCTCGC GATGAGCCGC GCCGCCCTGC GGTACGCACA GGATCAAGAA GACGCGCTGC TGCTCGGCCG GATGCAGGAC CAGGTGGGGA TCGGCCTGGC GATGCTAGGG GCTTGGCCGG AGGCCCAGGC GGCCTGGCAG CAGGCTGTGG AGACGCTCGA CGGCATCGGC AGCCCCGAGG CGGAGGCTGT GCGGGCGCGC CTGCGAGCAG AACCGGAGGC TCTGCCGGTC GTCGGTCCGG ACGGAGGCCT GATCACACCG ATACGCAGCC GGTGA
|
Protein sequence | MTTFSILGPV EVRVDGYPVS VPRPRRRAVL AYLLLHANDR VETEQLIGAL WDAEPPRTAR AQVHSAVSAL RTGLPGTLGE GLVSQTSGYR LTVAADQLDL TLFRRLTSAA RAQMASRQFD TAVGSLRAAL ALWRGAALAG VEAPFVEPAR TRLEEERFSA YEALADLEMA AGRHVDLVPL LTGLLNEYPT RESLVERLAL ALYRGGRKTD ALALVRKLRT LLAEEYGLDP ERSIVELENS MLRGDPALEH QWPAMSAQPL SRSEAGAETG AAAGARPGAS TTGAAPAAPA APTLTVPTPV PTPVPTPAPV PVPAARTLPR PAQLPRASAG FVGRAAELSR LTRLLGSDND SPYIAVVTGP AGVGKTALAL LWAHRQADAF PDGQLFVDLH GYDRIEAENA DSVLERFLLA LGIPGHDIPS GLPKREDLFR SAVANRRMLL VLDNARDYQQ ISPLLPGSAL SRTVITSRAR MGSLVADTGA LTVQLGVLPL DESVEVLARI VGPDAVAGAP DASRELARLC AGLPLALRIS AVRLLEESTA GIAGLAAELT SEEHRLSALD LLDDGRTVSQ ALEHSHRGLS AEQGRLFRLL SRHPVGTVSA AAAHALADHG ELRFAATHAT QRLLRALEAV YLLDQSTDGY QMHDLVRLYG RSRREPDDDA AYGRVIDWYI SVAAAGFAVL APTQPRLPAD VTHRLTTSDA EPFDGEAAAL AWFEEQIDNL VAVLRHAAER EEHRVVWQIA VGVSPYLLRR HRLDALVETQ RLGERAALGA DNPPAAAVLA NNLGIAYAMR QDPQAQEPFE RAAAIHRALG DRRRAAGVMA NLGSLRYGLG MLSKAAETQR RSLEELREFG HSQALSAALA NLGLTLSDLG RHDEACKLYA EAIDVAEECG AAQRAGYARS QLAWVLLRVG QVDEGLAMSR AALRYAQDQE DALLLGRMQD QVGIGLAMLG AWPEAQAAWQ QAVETLDGIG SPEAEAVRAR LRAEPEALPV VGPDGGLITP IRSR
|
| |