Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_5472 |
Symbol | |
ID | 8336830 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 6310277 |
End bp | 6313261 |
Gene Length | 2985 bp |
Protein Length | 994 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 644958574 |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003116172 |
Protein GI | 256394608 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.959943 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.441932 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGGTC CAATACTGTC CGGTCCCATG TCAGGCACAC GGTTCGGCAT CCTGGGTCCC CTTCTGGTCG AGGACCCGGC CGGACCGCGT CCCATCGCCG CGGCGCGGCA GCGCGCGGTG CTGGCCGCCC TGCTGCTCAC CGCGCCGCGC ACGGTCGCCG CCGGCGAGCT GGCCGAGCAG GTCTGGAACC TGGAGCCGCC GGCCGGCGCC GCCGGGACGC TGCACAGCTA CCTGAGCCGG CTGCGGGCGG CGCTCGGTCC GCTCGGGGAC CGGATCCGCA CGCACAGCTC CGGGTACACC GTCGAGCTGC ACGACGGCGA GCTGGACATC GAGGTGTTCC GCGCGCTGCG CAACCGCGCG AGGGCGGCGA TGACGCGCGG CGACCTGGAG AGCGCGACCG CCTCCTACAG CGAGGCGCTG GCGTTGTGGC GGGGCGCGCC GTTGGCGGAT GTGCCGAGCG GACCCTGGCG CGACGATGCC GTCCGCTATT GGGACGAGCA AGAGCTGCAG ACGCGCGAGG AGTTGTTCGA GGCCGAGGTC CGGCTCGGGC GGGCGGCGGC GGTTGTGCCA CAACTGCGGG TCTTGGTCGC CGAGAACCCC TTCCGGGAGC GTCCCGCCGC GCTGCTGCTC GGCGCCCTGG CGGCGGACGG CAGGCGCGCC GAGGCGCTGG CCGAGTACCA GCGCGTGCGC CGCGTCCTGG TCGAGGAAGC GGGCATCGAG CCGGGCGAGC AGCTGAAGGC CGCCTTCCTG GAGATTCTGC GCGAGGACGA CGACCGCCCG GCCGCCCCGC GTCTGCTGCC CGCCGACCTG CCGGACTTCA CCGGACGCGA GGACCAGCTC GCCGCTCTCG CGAAGGTGCT CACCGCGCCG GAGCCCGGAA ATCCGCCGGC GGTCGTGGTG GTCACCGGTC CCGGCGGGAT CGGCAAGACC TCGTTCGCGG TGCGGCTCGG GCAGCGGCTG CGTCCGGACT TCCCGGACGG CCAGGTCTTC GTGCGGCTCG GCGGGCTGCG CGCGCCGCGT CGGCCGACCG AGCTGGTCGC CGAGGTGCTG CGCGCGCTCG GCGTCGCCGA GATCCCGGGC GACCCGGACC GGCGCACGGC GTTGCTGCGC AGCACGCTGG CTGATCGGCG GGTCCTGCTG GTCCTCGACG ATGCCACGGA CCCGGCGCAG ATCCGGCGGC TGCTGCCGGC GAGCGCGCCG GCCGCGGTCG TGGTGACCAG TCGGCGGCGG CTGCCGGGGC TGGCCGGACA CGTTCCGGTG GAGCTCGGGC GGCTGTCGGC GGAACAGGCC GCTGCGATGG TGGGGAACAT CATCGGCGCC GACCGGACCG CCGCCGAGCC CGAGGCGCTC GCGCGGCTGG TCGAGGCGTG CGGCGGGCTG CCGATCGCGC TGCGGATCTG CGGCGCGCGG CTGGCACTGC GGCGGGGACG GAGCATCGCG TCGCTGGTCG CGCGGCTGGA GGCGGTCGGG AAGCGGCTGG AGGGGATCGA CGCGCTGCAT CAGGAGGCTG CGGCAGCGAG CGTTGAGGGC GGCGTGGCGC TGCATCAGGA GACTGCGGCG CACAAAGAGG CTGGCGCGCA CGAAGAGACT GCGGCGCATG AAGAGGCAGA GCCGGCGGAT GTCGCGGGCG GCGTGGCGCT GCGCGAGGAG TCTCCGGCAG CTGATGCCGA GCGGTCTGCG CTGCGCGGGC CGCTGGAGGA GAGCTATCTG GCGCTCAACT ACGGCGCGGC GGGTCGTGAC GTGGATCTGG CGCGCGCCTT CCGGCTGCTG AGCCTGATCG GCGGCGAGCG GTTCAGCCTG CCGGCGGCCG CGGCGGTGCT GGACGTCGAC GAGTTCGACG CCGACTCGGC GGTGGAGCAC CTGGTCCAGG TCTCGCTGCT GGAGGCGGCG GCGCCGGACC GGTTCGCGTT CCATCCGCTG ATCCAGGAGT TGGCGCGCGA GCACGCCGCG GCGACCGACG CCGACGACGC GCGCTCGGCG GCCGTCGGGC GCTGGACGGC GTGGTGTCTG GCCGGCGCGG CGGCTGCGGA CCGATTGTTC GACCCCAACC GACCGAAGCT GGCGTGGGAG GAATGGATTC CGGACGCGTC CCCGGCGCCG TTCGCCGACC GGACCGAAGC CGGGGACTGG TTCGACCGGG AGTCCGCGGG ATTGCTCGAG GCCGCTGCCG CCGCCATGGC CCAGGACGAC TTCGCGACCG CCGCGGCACT GCCGATGGTG CTGCTGCAAA GCTTCCGAAC CCGAGGGCGC GTCGAAGAAC TCGAAGAACC GCTGCGGGCC GGCGTCGAGG CGGCGGTGAA GCTCGGCGAG CCGGAGGTGG CCGGCGTGCA GTTGAACAGC CTGGCGATCG TGTACGGCGC GCTCGGGCGG TTCGACGAGG CGATCGCCAC GTTCGCCGAG GCGGTGCCGC ACTACGAGGC GGCCGGGCTC GCCGAGCGCG TGGCGCAGGC GCGCATCAAC GCGGCGATCA CGGTGGCGCA GAGCGGCCGG CCCGGCGAGG CGGCCGAGCG ACTGACCGCG TCGCTGGCAG AACTGGACGC GCTGCCGACG ACGCCTTTCC TGGCCAGCCT GCGGGTGTCG GTGATGCTGG CGCTGACCGA GTCGTTGCGC GACTCCGGAC AACCGGAGGC GGCGCTGGAG CTGTATCCGC GGTTGCTCGC CGCCGCCGAG GAGGTCGGGG ATACGCCGCG TCTGGCGATC GCCTGGGGCA ACCTCGGGAA GCTGCACGCG AAGAACGGTC GCGCCGAGGA GGGCATCCCC TGCATCGACA AGGCTTTGGA GCTCCATCGC TTCATCGGCA ACCGGGACGG CGAGGGGTAC GCGCTGTGGG CGCTCGGCGA GGCACGGGCT CTGTTGGGGC AGCGTGATCA CGCGCGGTGC GCGTGGTCGG AGGCTCGCGA GATCTTCCTG ACGCTCGGCC GGCACGGCTA TGCCGCCGAT CTCGCGGCGT CGATCGCCGA GCTGGACGAG GCGGCGCAGG GCTGA
|
Protein sequence | MSGPILSGPM SGTRFGILGP LLVEDPAGPR PIAAARQRAV LAALLLTAPR TVAAGELAEQ VWNLEPPAGA AGTLHSYLSR LRAALGPLGD RIRTHSSGYT VELHDGELDI EVFRALRNRA RAAMTRGDLE SATASYSEAL ALWRGAPLAD VPSGPWRDDA VRYWDEQELQ TREELFEAEV RLGRAAAVVP QLRVLVAENP FRERPAALLL GALAADGRRA EALAEYQRVR RVLVEEAGIE PGEQLKAAFL EILREDDDRP AAPRLLPADL PDFTGREDQL AALAKVLTAP EPGNPPAVVV VTGPGGIGKT SFAVRLGQRL RPDFPDGQVF VRLGGLRAPR RPTELVAEVL RALGVAEIPG DPDRRTALLR STLADRRVLL VLDDATDPAQ IRRLLPASAP AAVVVTSRRR LPGLAGHVPV ELGRLSAEQA AAMVGNIIGA DRTAAEPEAL ARLVEACGGL PIALRICGAR LALRRGRSIA SLVARLEAVG KRLEGIDALH QEAAAASVEG GVALHQETAA HKEAGAHEET AAHEEAEPAD VAGGVALREE SPAADAERSA LRGPLEESYL ALNYGAAGRD VDLARAFRLL SLIGGERFSL PAAAAVLDVD EFDADSAVEH LVQVSLLEAA APDRFAFHPL IQELAREHAA ATDADDARSA AVGRWTAWCL AGAAAADRLF DPNRPKLAWE EWIPDASPAP FADRTEAGDW FDRESAGLLE AAAAAMAQDD FATAAALPMV LLQSFRTRGR VEELEEPLRA GVEAAVKLGE PEVAGVQLNS LAIVYGALGR FDEAIATFAE AVPHYEAAGL AERVAQARIN AAITVAQSGR PGEAAERLTA SLAELDALPT TPFLASLRVS VMLALTESLR DSGQPEAALE LYPRLLAAAE EVGDTPRLAI AWGNLGKLHA KNGRAEEGIP CIDKALELHR FIGNRDGEGY ALWALGEARA LLGQRDHARC AWSEAREIFL TLGRHGYAAD LAASIAELDE AAQG
|
| |