Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_5051 |
Symbol | |
ID | 8336405 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 5797498 |
End bp | 5800395 |
Gene Length | 2898 bp |
Protein Length | 965 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 644958150 |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003115752 |
Protein GI | 256394188 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAATTC GCTTGCTCGG TCCGGTCGAG GCGTGGGGCG ACCAGGACCG GCTGGACATC GGCTCGGCGA AATCGTGTCT GGTGCTGGCG GCCCTGACCA TCGCGCCGGG GCACATCGTG CCCTGGGACG TCCTGGTCGA CCGGGTCTGG GGCGAGCAGC TTCCCGGCGA TCCGAAGGCC TCGCTGTACG CGTACGTCGC ACGGCTGCGG CGAACGCTGG ACCCGGCAGG CGTCCACATC CTGAGTCGCC CCGGCGGGTA CCTGTGCGAC GTGCCGCCGG AATCGGTCGA TCTGGCGCGG TTCCAACAGC GGGTCGCAGA GCTGCGAAGC ATCGAGGCGG CCGATGCCGG CGGGCCCGAC ACCGCCGATC GGCTCACCGA GGCACTCGCC TGGTGGCAGG GCACGCCGCT GGCGAACCTG ACCGGCGAAT GGGTCACCCG GACCCGCCGG ACGCTGAACG AGGAACGCCT TGCCGCGTTG CTCCTGCTCG CCGACGTCCA GGCCCGACAC GGCCGGCTCG CAGACCTCGC CGCCGACCTG CTGGCCGCAT CCGCCGAGTA TCCGCTGTCC GAACCGCTTG CCGGGTACGT CATCCGCGCC CTGGCAGCGG CCGGCAGGCG CGCTGAAGCG CTCGACTACT ACGCCGACGT TCGCAGCCAT CTGGTCGACG AGCTCGGCGA GGAGCCCGGC GCGGCGTTGC AGCAGTTGCA CGTCCGGCTG CTCCGCCGCG ACCCGAGCCT GGCCGACGAG GCACCACCGG CCGCGGCCGC ATCGCTCGTG CCGCGTCAGC TGCCGTCAAT CGCACGCCAC TTCGTCGGCC GGCGCGCCGA GCTGAAGGCG CTCGACGGCG TGCTCACGGG CAGCCAAGCC GCCTCGGCGG TCCTCATCTC GGCGATCTCC GGCACGGCCG GCATCGGCAA GACGACGACC GTCGTGTATT GGGCGCACCA CGCGGCGCGG CAGTTCCCGG ACGGCCAGCT CTACGTGAAC CTGCGTGGCT TCGACCCGAC CGGACCGCCG ATGAAGCCGG AGGAGGCGAT CCGCGGCTTC CTCGACGTCT TCGCCGTCCC GAAAGAGCGG ATTCCGCACG GCCTGGACGC CCAAGCCGCG CTGTATCGCA GCCTGCTCGC CGGGCGCCGG ATGCTGGTGG TGCTCGACAA CGCCCGCGAC GCCGACCACG TCCGACCGCT GCTCCCCGGC TCGCCGGGCT GCCTGGTCCT GGTCACCAGC CGCAGCCGGC TCACCGGGCT GGTCGTCGGC CACGGCGCCA CGCCGATCAC GCTGGGTCTG CTCGACGACG CGGAGGCCGA GCACCTGCTG AGCCGCTACC TCGGAGCCGA ACGCGTCGCC GCCGAACCGG ACGCGGTACG TGTCCTGATT CAGCGGTGTG CCCGCCTGCC GCTGGCGTTG GCGGTCGCCG CCGCTCGGGC GCTGATGGAT CCCGCGATGC CGCTGGGCGC GCTCGCCGCC GAGTTGGCCG CCGCCCCCGG ACAGCTCGAC GCGCTGGACA CCGGCGATCC CTCGACGACA GCGCGAGCCG TGTTCTCGTG GTCGTATCTC GCGCAACGGC CTGAGGCGCA ACGACTTTTC CGGCTATTGG GACTGCACCC CGGACCCGAC ATCTCGGTTC CGGCAGCGGC AAGTCTCACC GGACTGAGCA CCGAAGAGGC AGCCGCGTTG CTCAGTGAGC TGACGCGAGC CCACCTGCTC ACCGAGCACG CATCGGGTCG ATACAGCAGC CACGACCTTC TCCGCTCCTA CGCCGCGGAG CTTGTGCAGA CAGAGAGCTC CAATGCCGAA CGCGACACCG CGTTCCGTCG GATGCTGCAT CACTACCTGC ACAGCTCATA CCTCGCCGGC CGGCTGCTGG ACCCGCATCG CAAGCCGATC ACCCCGGCGG CTTTGGTCGA CGGAGTCATC CCCGAGTCCT TCGCCGACCA AATGCAGCAG GCGCTGCGCT GGTTCGAGGC TGAACGCGAG GTGCTGCTCG CGGTGATCCG GCGCGCGGCA GCCGCTGAGC CGGCAGCCGA CAAGACGCTG GCCGACGAGC CGCACGCCGC CGACGTCGAC ACCCTCACCT GGGAACTCGC CTGGACGGTC ACTGACTACC TCGACAGGCG CGGGCACTGG CAAGACTGGC TCGCCACCCA GCAAGTCGCG ATGCAGGCAG CGCAGCGGCT TGGCGACCAA GCCAAACAGG CGCACTCCCA CCGTCTTCTT GCCAACGCGT ACATCGGGCT CGTCCACTAC GAGGCAGCCG CCGATCATCT GAGCCACGCA CTCGACTACC ACGACCGCCT CGGCGATCTC GAGGGCACCG CCAACTGCCG GCGGTCGCTG TGCCGCGTCC GCGAACTCCA AGGCAGGTAT CCCGAAGCAC TCGCCCACGC CGAGGAATCC CTGCGCCTCT TCCGCGCCAC CGACAACACC ATCGGCCAAG CCCGCGCCCT GAACGCGGTC GGCTGGCTGC ACATCCTGCT CGATGATCCC CAGCCCGCGC TCGAGTACTG CCAAAGCGCT CTGGCCTTGT TCCAGGAACT CGGCAGCACC TACGGGGAGG CGGTGACCTG GGACAGCGTC GGCTCAGCGC ACCACCGGCT CGGACAGACG GACCAAGCCA TCGCCTGCTT CCGGCGGTCC ATCGACCTGC TCCGGACCGT CGGCGACCGC CACACCGAAG CCGAGACCCT CACCAATCTC GGCGACGCCC AGCACGACAT CGGCCAGGAC GAGGCAGCCC GCACCACCTG GCAGCAGGCG CTGGAGATCT GCGAGCACCT CGATCATCCC GACGCCGAAA AGGTGCGGAC CAGGCTTCAC GCCTTGCGGC CGACGCCTCC CCAAACCCCT ACCTCAGCAG GGCTTCCGAA GGGGTTGGGA TCGGGCCGCC AGTCCTGA
|
Protein sequence | MRIRLLGPVE AWGDQDRLDI GSAKSCLVLA ALTIAPGHIV PWDVLVDRVW GEQLPGDPKA SLYAYVARLR RTLDPAGVHI LSRPGGYLCD VPPESVDLAR FQQRVAELRS IEAADAGGPD TADRLTEALA WWQGTPLANL TGEWVTRTRR TLNEERLAAL LLLADVQARH GRLADLAADL LAASAEYPLS EPLAGYVIRA LAAAGRRAEA LDYYADVRSH LVDELGEEPG AALQQLHVRL LRRDPSLADE APPAAAASLV PRQLPSIARH FVGRRAELKA LDGVLTGSQA ASAVLISAIS GTAGIGKTTT VVYWAHHAAR QFPDGQLYVN LRGFDPTGPP MKPEEAIRGF LDVFAVPKER IPHGLDAQAA LYRSLLAGRR MLVVLDNARD ADHVRPLLPG SPGCLVLVTS RSRLTGLVVG HGATPITLGL LDDAEAEHLL SRYLGAERVA AEPDAVRVLI QRCARLPLAL AVAAARALMD PAMPLGALAA ELAAAPGQLD ALDTGDPSTT ARAVFSWSYL AQRPEAQRLF RLLGLHPGPD ISVPAAASLT GLSTEEAAAL LSELTRAHLL TEHASGRYSS HDLLRSYAAE LVQTESSNAE RDTAFRRMLH HYLHSSYLAG RLLDPHRKPI TPAALVDGVI PESFADQMQQ ALRWFEAERE VLLAVIRRAA AAEPAADKTL ADEPHAADVD TLTWELAWTV TDYLDRRGHW QDWLATQQVA MQAAQRLGDQ AKQAHSHRLL ANAYIGLVHY EAAADHLSHA LDYHDRLGDL EGTANCRRSL CRVRELQGRY PEALAHAEES LRLFRATDNT IGQARALNAV GWLHILLDDP QPALEYCQSA LALFQELGST YGEAVTWDSV GSAHHRLGQT DQAIACFRRS IDLLRTVGDR HTEAETLTNL GDAQHDIGQD EAARTTWQQA LEICEHLDHP DAEKVRTRLH ALRPTPPQTP TSAGLPKGLG SGRQS
|
| |