Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4249 |
Symbol | |
ID | 8335603 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 4820162 |
End bp | 4823488 |
Gene Length | 3327 bp |
Protein Length | 1108 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 644957352 |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003114954 |
Protein GI | 256393390 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.319347 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACTTCC GTGTTCTGGG ACCCCTGGAG ATCGTCGACG ACGGGGTGCC GATACGGCTC GGCGGGCTGC GGGAACAAGC GGTCATGGCC ATGTTCCTGG TGCAGCCCGA CACCATCATC CCGGTCGAGC GCCTCGTCGA CGCGGTCTGG GGCGACCGGC CGCCGGCGAC CGCGCGGGCG CAGATCCAGA TCTGCGTCTC GGCGCTGCGC CGGCTGCTCG GCGACCCCGA GCGGATCCGG ACGCGCAATC CCGGCTACCT GTTCCATCTC GGGACCGACG TGCTGGACGC GCGGTTGTTC GAGCAGATCG CGGCGAAGGG CCACACGCTG CTGGCCGAGG GCCGGCGCAC CGAGGCCGCC GCGGAGTTCC GCAAAGCCCT GTCACTGTGG CGGGGACCGG CGCTGGCCAA CGTGGCCGGG GACGTCGTCC AGCACAGCGT GGCGCACCTG AACGAGCGCC GGCTGAACGT GCTGGAGGTG TGCCTGGAGG CCGAGCTGGA GGCCGGGACG CGCGGCGATC TGGTCGGGGA ACTGGTCCGG GTCTGCCACG AATACCCGCT GAACGAACGC TTCCGGCTGC TGCTCATGAC CGCGCTGTAT CGCGCCGGAC GGCAGGCCGA CGCGCTGGAG GTGTATCGCG CCACACGCGG AACACTGAAG GAGGAGCTGG GGATCGAGCC CGGTCCGGAG CTACGGCGGC TGCAGCAGGC GATTCTGAAC GGCGAGGTCC ACGGACAGGC AGCGGCTTCG ACACCATCTC AGTCGGCCAC GACGGCGCAG GCGCGGACAC CGACATCGGC GCCGACAACA ACGCCGAAGC CCGCAGCCGA GGCGCCGCCC CCGCCGGTAT CGCAGCCTTC GACCGCACCC GTCCCATCGC AAGACCCTGC CACGCCCGCC GCGCGCCGTC CCACAACCCG TGCCCAGCCT CCGACGCCGC GGCTCCTGCC GCCGGCCATC CCGGATTTCA CCGGCCGCGC CAAGGCGATC GCGCAGATCG TCTCGGAGAT GCCGGTGGTC CACACCCTCG ACGGCCCGGC GGCGCTGCCG GTCACCGTTC TGTACGGCCA AGGCGGCGTC GGCAAGACCA CGCTCGCGGT CCACGTCGCG CACCGGCTCG CCGAGTCCTA TCCCGACGGC CAGCTCTACG CGCGCCTTCG CGACGGAGAC CAGTCGGTCG CGCCGGCGGA CATCCTGGAA CGCTTCCTGC GCTCCCTCGG CGTCGCAGGG CCCTCGCTGG CCGACGGCCT GGAGGAGCGC GCCGAGATGT ACCGCAATCT GCTGGGCGAT CGGCGCGTCC TGGTAGTACT CGACGACGCC ATGACCGAGC ACCAGGTACA GCCGCTGCTG CCGGGCGGAT CGGGCTGCTC GGTGATCGTC ACCAGCCGGC GGCGGCTCAC CGGCGTGCCC GCGGCGGTGC GCCTGGAGGT CGGCACGTTC AGCGACGACA GCGCGGTGGC GCTGCTGAGC CGGGTCGCCG ATCCCGCGCG CATCCACGCC GAACCCGAGG CGGCGGCTCA GTTGTGCCGC CTGTGCGGAC ATCTGCCGCT GGCGCTGCGC ATCGTCGCCG CGCGGCTGGC CGCACGTCCG CACTGGAGCG TGCGGGCCTT GGTGGACCGG CTGATCGACG AATCGCGGCA GCTCGACGAG CTGAACCACG AAGGCGTCGG GATGCGCGCC AGCATCTCGG TCACCTACGC CGGGCTGTCG GCGGACGCGC GGCGCCTGTT CCGGCGGCTG GCACTGTTCG GCGGACCGGA CTTCGCCGCC TGGGTCGCGG CGCCGCTGCT GGACGCCGAC GTCTGGCGCG CGGAGGACCT GCTGGAGGAG CTGACCGAGG CCTACCTCAT CGACATCGAG CAGGGTCCGG ACGGCGAGCC GACGCGCTAC CGGTTCCACG ACATCGTGCG CCCCTTCGCC CGCGAGCGGC TGTTGGCCGA GGATCCGCCA GGCGATCGCC ACCAGGCGTT GGAGCGCTTG ATCGGCGCGC ACCTTTTCCT GGCCAAGCTG GCGCACGAGC GGGAGTACTC CGGGGACCAC CTGCTGCCCG CCGACACCGC CACGACCTGG CCGCTGCCGC CGGAGGCGGT GGCGCCGCTG ATCGCCGACC CGCTGACCTG GTTCGAGCGC GAGCGGCTGT CGCTGGTGGC GGCGGTGCGC CAGGCCGCCG CGCGCGGGCT CGCCGACAAG GCGTGGAGTC TGGCGCTGTC CTCGGTCGCG CTGTTCGAGG CGCGGTCCTA TTACGGCGAC TGGCGCGAGA CCCACGAGAC CGCGCTGGAG GCGGTGATCC AGGCCGGCGA CCGGCGCGGC GAGGCGGCGA TGCGCTACTC GCTGGGCTCG CTGCACATGT TCGAGGTCGA CAACGCCGGC GCCCGGCATC AGTTCGGGCT GGCCGCCGCC ATCTATCAGG AGCTTGACGA CCGGTACGGC GCGGCGCTGG TGCTGCGCAA CGTCGCGGTG CTGGACCGCC GCGAGGGCGA TCTGGACCGT GCGCTGGAAC GCTGGACGGA TGCGCTCGCC ACGTTCTGCG AGGCAGGAGA CCGCGTCGCG GAGGCGTACG TCCTGAACAG CATCGCGCAG GTCCATCTGG CGCGCGGCAA CGACAGCGCG GCGTTCGACC TGCTGACGCG CGCCGAGCTG ATCTGCGCCG AGACCGGCGT GCGCCGCGTC GCAGCACAAG TACAGCTGCG TCTGGGTCAC ATGTATCGCC ATCGCAACGA CATGGATCGG GCGCGCGCGG CGTATCAACA GGTGCTGGCC GCGGTCCGGG AGACCGGCGA CCGCATCGGC GAGTGCCACG CCTTGATGGG TCTGGGCGCG ACCGAGGCGG AGGACGGACG TCCGGGCCCG GCGGTCGACG TGCTGCGTCA GGCGCTGGAG GCCGCCGAGG CGGTCGGGGA CAAGATCCTC GGCGGACGCG CGGCGTTGAC GCTGGCTCGG GCGGAGCTGG CGGCCGGGCT GCTGGCCGAG GCCGCCGACG ACGCCGACCA CGCGGTCGAG GCGCTGGGTT CCGGGCTCGC CTCGGCGCAC GCGCTGGTGT TGCGCGGACG GATCCGCGAC GAGCGCGGCG ACGTCTCCGG GGCGGTGGGC GACTGGTGGC AGGCGGCCAC GGTCGTCACC GCGCTGACGG TGGACGAGGC TCAGGATCTG GCCGGGGAGA TCGCCGCGTT GCTGGCGGAG GTCACCGGCG GCGGCTCCGG CGGTGGTTCG GATGGTGGTT CCGGTGATGG CTCAAATGGT GGTTCGGGTG ACGGTTCAGA CGATGGCTCA GCCGGTGATC CCGGTGGCGC CGCGATGGTG GTTCAGGTGA CAGAACCCTC GGCGTGA
|
Protein sequence | MDFRVLGPLE IVDDGVPIRL GGLREQAVMA MFLVQPDTII PVERLVDAVW GDRPPATARA QIQICVSALR RLLGDPERIR TRNPGYLFHL GTDVLDARLF EQIAAKGHTL LAEGRRTEAA AEFRKALSLW RGPALANVAG DVVQHSVAHL NERRLNVLEV CLEAELEAGT RGDLVGELVR VCHEYPLNER FRLLLMTALY RAGRQADALE VYRATRGTLK EELGIEPGPE LRRLQQAILN GEVHGQAAAS TPSQSATTAQ ARTPTSAPTT TPKPAAEAPP PPVSQPSTAP VPSQDPATPA ARRPTTRAQP PTPRLLPPAI PDFTGRAKAI AQIVSEMPVV HTLDGPAALP VTVLYGQGGV GKTTLAVHVA HRLAESYPDG QLYARLRDGD QSVAPADILE RFLRSLGVAG PSLADGLEER AEMYRNLLGD RRVLVVLDDA MTEHQVQPLL PGGSGCSVIV TSRRRLTGVP AAVRLEVGTF SDDSAVALLS RVADPARIHA EPEAAAQLCR LCGHLPLALR IVAARLAARP HWSVRALVDR LIDESRQLDE LNHEGVGMRA SISVTYAGLS ADARRLFRRL ALFGGPDFAA WVAAPLLDAD VWRAEDLLEE LTEAYLIDIE QGPDGEPTRY RFHDIVRPFA RERLLAEDPP GDRHQALERL IGAHLFLAKL AHEREYSGDH LLPADTATTW PLPPEAVAPL IADPLTWFER ERLSLVAAVR QAAARGLADK AWSLALSSVA LFEARSYYGD WRETHETALE AVIQAGDRRG EAAMRYSLGS LHMFEVDNAG ARHQFGLAAA IYQELDDRYG AALVLRNVAV LDRREGDLDR ALERWTDALA TFCEAGDRVA EAYVLNSIAQ VHLARGNDSA AFDLLTRAEL ICAETGVRRV AAQVQLRLGH MYRHRNDMDR ARAAYQQVLA AVRETGDRIG ECHALMGLGA TEAEDGRPGP AVDVLRQALE AAEAVGDKIL GGRAALTLAR AELAAGLLAE AADDADHAVE ALGSGLASAH ALVLRGRIRD ERGDVSGAVG DWWQAATVVT ALTVDEAQDL AGEIAALLAE VTGGGSGGGS DGGSGDGSNG GSGDGSDDGS AGDPGGAAMV VQVTEPSA
|
| |