Gene Caci_5472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5472 
Symbol 
ID8336830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp6310277 
End bp6313261 
Gene Length2985 bp 
Protein Length994 aa 
Translation table11 
GC content74% 
IMG OID644958574 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003116172 
Protein GI256394608 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.959943 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.441932 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGGTC CAATACTGTC CGGTCCCATG TCAGGCACAC GGTTCGGCAT CCTGGGTCCC 
CTTCTGGTCG AGGACCCGGC CGGACCGCGT CCCATCGCCG CGGCGCGGCA GCGCGCGGTG
CTGGCCGCCC TGCTGCTCAC CGCGCCGCGC ACGGTCGCCG CCGGCGAGCT GGCCGAGCAG
GTCTGGAACC TGGAGCCGCC GGCCGGCGCC GCCGGGACGC TGCACAGCTA CCTGAGCCGG
CTGCGGGCGG CGCTCGGTCC GCTCGGGGAC CGGATCCGCA CGCACAGCTC CGGGTACACC
GTCGAGCTGC ACGACGGCGA GCTGGACATC GAGGTGTTCC GCGCGCTGCG CAACCGCGCG
AGGGCGGCGA TGACGCGCGG CGACCTGGAG AGCGCGACCG CCTCCTACAG CGAGGCGCTG
GCGTTGTGGC GGGGCGCGCC GTTGGCGGAT GTGCCGAGCG GACCCTGGCG CGACGATGCC
GTCCGCTATT GGGACGAGCA AGAGCTGCAG ACGCGCGAGG AGTTGTTCGA GGCCGAGGTC
CGGCTCGGGC GGGCGGCGGC GGTTGTGCCA CAACTGCGGG TCTTGGTCGC CGAGAACCCC
TTCCGGGAGC GTCCCGCCGC GCTGCTGCTC GGCGCCCTGG CGGCGGACGG CAGGCGCGCC
GAGGCGCTGG CCGAGTACCA GCGCGTGCGC CGCGTCCTGG TCGAGGAAGC GGGCATCGAG
CCGGGCGAGC AGCTGAAGGC CGCCTTCCTG GAGATTCTGC GCGAGGACGA CGACCGCCCG
GCCGCCCCGC GTCTGCTGCC CGCCGACCTG CCGGACTTCA CCGGACGCGA GGACCAGCTC
GCCGCTCTCG CGAAGGTGCT CACCGCGCCG GAGCCCGGAA ATCCGCCGGC GGTCGTGGTG
GTCACCGGTC CCGGCGGGAT CGGCAAGACC TCGTTCGCGG TGCGGCTCGG GCAGCGGCTG
CGTCCGGACT TCCCGGACGG CCAGGTCTTC GTGCGGCTCG GCGGGCTGCG CGCGCCGCGT
CGGCCGACCG AGCTGGTCGC CGAGGTGCTG CGCGCGCTCG GCGTCGCCGA GATCCCGGGC
GACCCGGACC GGCGCACGGC GTTGCTGCGC AGCACGCTGG CTGATCGGCG GGTCCTGCTG
GTCCTCGACG ATGCCACGGA CCCGGCGCAG ATCCGGCGGC TGCTGCCGGC GAGCGCGCCG
GCCGCGGTCG TGGTGACCAG TCGGCGGCGG CTGCCGGGGC TGGCCGGACA CGTTCCGGTG
GAGCTCGGGC GGCTGTCGGC GGAACAGGCC GCTGCGATGG TGGGGAACAT CATCGGCGCC
GACCGGACCG CCGCCGAGCC CGAGGCGCTC GCGCGGCTGG TCGAGGCGTG CGGCGGGCTG
CCGATCGCGC TGCGGATCTG CGGCGCGCGG CTGGCACTGC GGCGGGGACG GAGCATCGCG
TCGCTGGTCG CGCGGCTGGA GGCGGTCGGG AAGCGGCTGG AGGGGATCGA CGCGCTGCAT
CAGGAGGCTG CGGCAGCGAG CGTTGAGGGC GGCGTGGCGC TGCATCAGGA GACTGCGGCG
CACAAAGAGG CTGGCGCGCA CGAAGAGACT GCGGCGCATG AAGAGGCAGA GCCGGCGGAT
GTCGCGGGCG GCGTGGCGCT GCGCGAGGAG TCTCCGGCAG CTGATGCCGA GCGGTCTGCG
CTGCGCGGGC CGCTGGAGGA GAGCTATCTG GCGCTCAACT ACGGCGCGGC GGGTCGTGAC
GTGGATCTGG CGCGCGCCTT CCGGCTGCTG AGCCTGATCG GCGGCGAGCG GTTCAGCCTG
CCGGCGGCCG CGGCGGTGCT GGACGTCGAC GAGTTCGACG CCGACTCGGC GGTGGAGCAC
CTGGTCCAGG TCTCGCTGCT GGAGGCGGCG GCGCCGGACC GGTTCGCGTT CCATCCGCTG
ATCCAGGAGT TGGCGCGCGA GCACGCCGCG GCGACCGACG CCGACGACGC GCGCTCGGCG
GCCGTCGGGC GCTGGACGGC GTGGTGTCTG GCCGGCGCGG CGGCTGCGGA CCGATTGTTC
GACCCCAACC GACCGAAGCT GGCGTGGGAG GAATGGATTC CGGACGCGTC CCCGGCGCCG
TTCGCCGACC GGACCGAAGC CGGGGACTGG TTCGACCGGG AGTCCGCGGG ATTGCTCGAG
GCCGCTGCCG CCGCCATGGC CCAGGACGAC TTCGCGACCG CCGCGGCACT GCCGATGGTG
CTGCTGCAAA GCTTCCGAAC CCGAGGGCGC GTCGAAGAAC TCGAAGAACC GCTGCGGGCC
GGCGTCGAGG CGGCGGTGAA GCTCGGCGAG CCGGAGGTGG CCGGCGTGCA GTTGAACAGC
CTGGCGATCG TGTACGGCGC GCTCGGGCGG TTCGACGAGG CGATCGCCAC GTTCGCCGAG
GCGGTGCCGC ACTACGAGGC GGCCGGGCTC GCCGAGCGCG TGGCGCAGGC GCGCATCAAC
GCGGCGATCA CGGTGGCGCA GAGCGGCCGG CCCGGCGAGG CGGCCGAGCG ACTGACCGCG
TCGCTGGCAG AACTGGACGC GCTGCCGACG ACGCCTTTCC TGGCCAGCCT GCGGGTGTCG
GTGATGCTGG CGCTGACCGA GTCGTTGCGC GACTCCGGAC AACCGGAGGC GGCGCTGGAG
CTGTATCCGC GGTTGCTCGC CGCCGCCGAG GAGGTCGGGG ATACGCCGCG TCTGGCGATC
GCCTGGGGCA ACCTCGGGAA GCTGCACGCG AAGAACGGTC GCGCCGAGGA GGGCATCCCC
TGCATCGACA AGGCTTTGGA GCTCCATCGC TTCATCGGCA ACCGGGACGG CGAGGGGTAC
GCGCTGTGGG CGCTCGGCGA GGCACGGGCT CTGTTGGGGC AGCGTGATCA CGCGCGGTGC
GCGTGGTCGG AGGCTCGCGA GATCTTCCTG ACGCTCGGCC GGCACGGCTA TGCCGCCGAT
CTCGCGGCGT CGATCGCCGA GCTGGACGAG GCGGCGCAGG GCTGA
 
Protein sequence
MSGPILSGPM SGTRFGILGP LLVEDPAGPR PIAAARQRAV LAALLLTAPR TVAAGELAEQ 
VWNLEPPAGA AGTLHSYLSR LRAALGPLGD RIRTHSSGYT VELHDGELDI EVFRALRNRA
RAAMTRGDLE SATASYSEAL ALWRGAPLAD VPSGPWRDDA VRYWDEQELQ TREELFEAEV
RLGRAAAVVP QLRVLVAENP FRERPAALLL GALAADGRRA EALAEYQRVR RVLVEEAGIE
PGEQLKAAFL EILREDDDRP AAPRLLPADL PDFTGREDQL AALAKVLTAP EPGNPPAVVV
VTGPGGIGKT SFAVRLGQRL RPDFPDGQVF VRLGGLRAPR RPTELVAEVL RALGVAEIPG
DPDRRTALLR STLADRRVLL VLDDATDPAQ IRRLLPASAP AAVVVTSRRR LPGLAGHVPV
ELGRLSAEQA AAMVGNIIGA DRTAAEPEAL ARLVEACGGL PIALRICGAR LALRRGRSIA
SLVARLEAVG KRLEGIDALH QEAAAASVEG GVALHQETAA HKEAGAHEET AAHEEAEPAD
VAGGVALREE SPAADAERSA LRGPLEESYL ALNYGAAGRD VDLARAFRLL SLIGGERFSL
PAAAAVLDVD EFDADSAVEH LVQVSLLEAA APDRFAFHPL IQELAREHAA ATDADDARSA
AVGRWTAWCL AGAAAADRLF DPNRPKLAWE EWIPDASPAP FADRTEAGDW FDRESAGLLE
AAAAAMAQDD FATAAALPMV LLQSFRTRGR VEELEEPLRA GVEAAVKLGE PEVAGVQLNS
LAIVYGALGR FDEAIATFAE AVPHYEAAGL AERVAQARIN AAITVAQSGR PGEAAERLTA
SLAELDALPT TPFLASLRVS VMLALTESLR DSGQPEAALE LYPRLLAAAE EVGDTPRLAI
AWGNLGKLHA KNGRAEEGIP CIDKALELHR FIGNRDGEGY ALWALGEARA LLGQRDHARC
AWSEAREIFL TLGRHGYAAD LAASIAELDE AAQG