Gene Caci_6887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_6887 
Symbol 
ID8338253 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp7956804 
End bp7959611 
Gene Length2808 bp 
Protein Length935 aa 
Translation table11 
GC content67% 
IMG OID644959975 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003117566 
Protein GI256396002 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.691511 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.899507 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACCTGA TCGCACGCCT TTCCAAGGCG GCTTTCTCTC TGGCCCTTAC CGCCGGGCTG 
TCCTGTGGCG TTCCGTGGTT GCTGCTGACC TATATCGGAA GTCCTGTTCC GAAGCAGTTC
CCGGGCTTGC ATGAGATCGT GCCTGCTCTG TCGGCACGGT TCGACATCCA CGTCTTCATC
ACGATCATCG TCTATCTGCT GTGGGCTTGC TGGGCTGTGC TCGTTGTGCA GCTGCTCGTC
CAGGTTCCTG GGACGTTCGT CGACATCGTC AGAATCCTGA GGCGTCGCGA GCCGGTCCGT
CGAGGCGCAG CTTGGGGACC GGGCGGCGCT CTGGCCCGAG GCTTGATCGC GGCCTTCACT
ATCGCGCTGC TCGCGCCGCG CGCAGCCGAC ACCGCGGCGG CTGCGGCGAA GGCAACGGGC
TACGTTCTCA GCCCCACAGC CAAGATTGCC TCCGTTGCGC CGGCCGTCCC AGGCGCCCGA
TCGGCGAGCG GAGATGACTA TGTCGTTCGA TCTGGCGACA CCTTGTGGGA TATCGCGGCG
CTGCACCTCG GTGAACCTGA GCGCTGGCAC GACATCTATA GCCTGAACGT CGGCCGGGTC
CAGCCTGATG GTGGCGTGCT GTCCGATCCG CAGCTGATCC AGCCGGGCTG GCAGCTGCGT
CTCCCCGAGA ACGTCGTAGC GCCTGCCTCG GCGGTACCGG CTCCGGCGGT GTCAACCACA
AGGCCGAGCA ACGCAGGTAC CGTGCTGGTC CCGGAGCCAG TAGAGACGCC ATCTCTGATC
GCTGCGGGTG GCTCTCCGGC AGCTTCGGCA ATTCCATCCC GGGCTGAACA CCCGATCCCC
GCGCAGCGGA TCGCACCGCG CGATCGTGCA GCGGTCCGAC TGCCTGGTGG CGGAGTGGTT
CCCGTCAGCC TCGCTTCCGG TGTCGCCGCC GCGCTCGCGT TGGCGCGCCT TCGCACGCGT
GCCAGGAGCC GGATCCAGCC GGTCGATGCC CCGGGCGATT CCGAACCTCT CGCACCGCTC
CTCGAGCCCG TTCGCGCAGA GCTGTTGCGT GCACACCACG CGACGGTGTG CGCCCCGGGG
AAGGGTCTCT TCCAGGATGA CGAGGACTTT GGCGACGACC CGTTCGCCGA TGACGAACCG
CTGCCCAGCG AGCCCGAATC CGCCTCGGTG ACCGAGGACG CCCTGTGGTC GTTGAAGTCC
GAACCCGGTA CCCCACTCAC CACTCCAGAG TTCGCGCCGA GCCTTCGGGC TGTCTTGGAC
GGCGCCGATG CCCCTGATGA CGTCCACGTG GCGATCTGTG GCAACTCCCC GATTCCGTTG
GCTTCCGTCA CGACAGCAGG ACTTGGGTTG ACCGGAGACG GGGCGGCCGA CGCTGCGCGG
TCTTTGCTGG TCAGCGCACT GGCGGCCGGT GGTCCGAGGG CCATTGATCA GGCCGTCGAA
GTCCAGACTA CGGTCCAGGC GCTGTCGGTG CTGCTGGGCC CTGGCGCCCT GACCGCCGAT
TCCGATCGGC TGACGGTTTT CGACTCCTTG ACCACGTTGC TGGACGACGC TGAGCGTGAG
CAGTCGGCGC GAGCGGCGGA GGTCGCAGAG TATGGACAGG CGAACGCTGC TGACGTCCGT
CGCTTCGATA ACGTGGAGCC CTTCCGCCCC CGCATCCTGC TGGTCCATCT CGAAGCCGGC
GAACGGCACC GGCTGGAGGC GATCGCCCGT GCCGGCGGCA AGGTCGACAC TCATCTCGTG
CTGCTCGGGC CGTGGGCCGC AGGTACTTCC GTCACAATCG GCGCTGACCG CCTCCTGACA
GCCACGGGTC CCGACGCCGA GTTGCTGCGT GACGCATCGG CCTTCGGCAT CACGATGGAC
GAGGCCGAAC AGATTCTCTC TGCTCTGGGC ACTGCTGCTG AGTCGCCCGC CACTCGCGAA
CCGTTCACCG ATACCGAAGC CGATGGCGAG CCGCCGGTTG AGGCCTCGCG TCAGGACGAA
TCGGTCGTGC CGTCAGCGGC GACGCCCGAC GATACAGTCC CGGTCGTGGC CCGAGCGCAC
GTCGACCAGG GTGTTCTCCT CTTGAAGGTC ATCGGTCCCT TTACCGCCGA GATCGACGGC
CGAGACGTCA CTGGATGCTT CAACCCGAGC CACAGGACAC TTCTTCTCTA CCTCGCCCTC
CGGGAGCGCC CCGTTCGGCG AGCCGAGATC ATCGAAGCTC TGTGGACGGA CGACCAGGCC
GACGGCAAGA ACGCGGAGAA GAAGCGCAGG ACACGGTTCG ACACCCGGCT GTATCAGACC
AAGAAGGCTC TGGCCGATGC CGTCGGGCAC GACTCGGAGT TCATCTCCTC GGATCGCGCC
TCCGGCTTCA TCACGCTCAA CCGGACCCTC ATCCTCACCG ATCTCGCCTG CTTCGACCAG
CTCGTCGGCC GCGCTTCCCG AGCCGCCGAC GACGCAGAGA AGACCGCACA CCTCGAAGCA
GCATGCGCGC TGTACCGCGG ACCACTGGAC GAGAGCATCC GCGGCGACTG GCTGCTGGAG
CATCGCGAGG ACCGGCTGCG CCGCTACCGC GATGCAGCCG GCGACCTCGC CCGCATCGTC
GGCCGAACCG ACCCCGACCG CGGACTTGCC ATCCTCAACC AGCTGCTGGA GCACGACCTC
TTCAACGAGG ACCTCTACCG CCGGATCATG CGTGGGCAGG CACGGCTGGG GCGGCACGAC
GCGGTGCGAC GGACCTTCAA TCTTCTGGAG ACCCGCTTCG AAGCGGTCGA GCTCGTGGTC
GATGCATCCA CCAGGGCGCT CGTACGGACA TTGACTCGTA ACGTTTGA
 
Protein sequence
MNLIARLSKA AFSLALTAGL SCGVPWLLLT YIGSPVPKQF PGLHEIVPAL SARFDIHVFI 
TIIVYLLWAC WAVLVVQLLV QVPGTFVDIV RILRRREPVR RGAAWGPGGA LARGLIAAFT
IALLAPRAAD TAAAAAKATG YVLSPTAKIA SVAPAVPGAR SASGDDYVVR SGDTLWDIAA
LHLGEPERWH DIYSLNVGRV QPDGGVLSDP QLIQPGWQLR LPENVVAPAS AVPAPAVSTT
RPSNAGTVLV PEPVETPSLI AAGGSPAASA IPSRAEHPIP AQRIAPRDRA AVRLPGGGVV
PVSLASGVAA ALALARLRTR ARSRIQPVDA PGDSEPLAPL LEPVRAELLR AHHATVCAPG
KGLFQDDEDF GDDPFADDEP LPSEPESASV TEDALWSLKS EPGTPLTTPE FAPSLRAVLD
GADAPDDVHV AICGNSPIPL ASVTTAGLGL TGDGAADAAR SLLVSALAAG GPRAIDQAVE
VQTTVQALSV LLGPGALTAD SDRLTVFDSL TTLLDDAERE QSARAAEVAE YGQANAADVR
RFDNVEPFRP RILLVHLEAG ERHRLEAIAR AGGKVDTHLV LLGPWAAGTS VTIGADRLLT
ATGPDAELLR DASAFGITMD EAEQILSALG TAAESPATRE PFTDTEADGE PPVEASRQDE
SVVPSAATPD DTVPVVARAH VDQGVLLLKV IGPFTAEIDG RDVTGCFNPS HRTLLLYLAL
RERPVRRAEI IEALWTDDQA DGKNAEKKRR TRFDTRLYQT KKALADAVGH DSEFISSDRA
SGFITLNRTL ILTDLACFDQ LVGRASRAAD DAEKTAHLEA ACALYRGPLD ESIRGDWLLE
HREDRLRRYR DAAGDLARIV GRTDPDRGLA ILNQLLEHDL FNEDLYRRIM RGQARLGRHD
AVRRTFNLLE TRFEAVELVV DASTRALVRT LTRNV