Gene Caci_4465 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4465 
Symbol 
ID8335819 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5086877 
End bp5089951 
Gene Length3075 bp 
Protein Length1024 aa 
Translation table11 
GC content77% 
IMG OID644957567 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003115169 
Protein GI256393605 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.435845 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATACGA TCCGGTTTCA GGTCCTCGGC CCGCTGCGGG GCTGCCGCGG CGAGGAGGAA 
CTGGCCACGG GCAGCCCGCA GCAGCAGGCG ATGCTGGCCG CGCTGTTGCT GCTGCCGGGG
CGGACCGCGA GTTCGGCGGA GCTGATCGAC GCCCTGTGGG GCGACCAGCC GCCGAGCCGG
GCGCGCTCGA TCCTGCGCAC GTATGCCTGG CGCTGGCGGC GGGTCCTGGA TCCGGACGCC
GCGGACGGCG CCGCCTCGGA AGTGCTGGTC TCGCTGGCCG GCGGCTACCG GCTGGCTCTG
CCGCCCTTCG GCGGCGGTGG CGGCGGGGCA GGCTCCGCAC GAGGGCGCGC GAACGGAAAC
GCCGCGCCGG CCGGTGCTTC CGCCGCCGCT GTCAACGGCG TCAAAGATCC TGGCGCCTCC
CCCACCGGCG AGAAGCCCGG CGGGTCCAGC GGGTCCCGCG AGTCCGGTTG GATCGACTGG
TCCAGCGGCG ACAGCCTGCC GGTGGACGCC GAGCGCGCCG AGCACTGGGC CGCCGAGGCC
GACAAGGCAA GCGCTGCCGG ACAGCCGGAG CAGGCTCGGG AGCTGCTGCG GCGCGCCGTG
GACCTGTGGA CCGGGGTGCC GCTGGCGGGG GTCCCGGGAC CGTTCGCCGA ACGGCAGCGG
CGGCGCTTCG CCGAGCTGCG GCTGAGCCTG CTGGAGCGGC GGATCGCGCT GGACGTGGAG
CTGGGACGCG GCGCTTCGTG CGTGCCGGAG CTGCGGGCGC TCACCGACGA ACACCCGCTG
CGGGAACGGT TGTACGCGCT GCTGATGCGG GCGCTGTCCC AGTCCGGACG GCAGGCCGAC
GCGCTGGCGG CGTTCACCGC GGCGCGGCGG CTGCTGATCG GCGAACTCGG CGTCGAGCCG
GGCGCCGAGC TGCGCGCGAT GCACGCCGAG GTGCTGGCCG GCGGTCCGCC GAGCCCGCCG
GCCGGAGCCG CGGCGACGGC GGCCGCGGCG TCGCGGATGC GGCGCGTCCC GGGTCAGGGC
CCGCAGAATC CGGTCGTGCC GCGTCCGGCG CAGCTGCCGC CGACCGAACC GGACTTCGTC
GGGCGCGGCG CGCTGGCCGA ACGCCTCGGC GCCGAGCTGA GCGCGGGCGC CGCCGGCACC
ACGCCGACGG TGCTGGCGAT CGCCGGGATG GGCGGCGTCG GCAAGAGCAC GCTGGCGCTG
CACGTCGCAC ACCGCGCGCG CCCGGCCTTC CCCGACGGCC AGCTCTACGC CGATCTGCGC
GGCACCGGTG CCACGCCGGT CCCGCCGCAG GCCGTGCTGG AGGACTTCCT GCACGCCCTC
GGCGTGGCGA CCGAGCAGAT CCCGGAGGGT ACGGCCGCGC GCTCCTCGCT GTTCCGCACT
CTGCTCGACG GCCGCCGGCT GCTCGTCGTG CTCGACGACG CCGCCAACGC CGCGCAGGTC
AGACCCCTGC TCCCGGGCGC CGGCGGCTGC GCGGTGCTGG TCACCAGCCG GGCCCGGCTG
GTCGCGCTGC CGAAGTCGGC GCAGGTGTGG CTGGACGTGT TCGACGACGA GGAAGCCCTC
GGGCTGCTCG GGCGGGTCGC CGGCCCGGAG CGTCCGCACG CCGAGCCGGA GGCCGCGCGC
CTGCTGGTCG ACGCCTGCGG ACGGCTGCCG CTGGCGGTGC GGATCGTGGC CGCCCGGCTC
GCCGCGCGCC CGGCGTGGAC CGTCGCCTCG CTGGCCGGAC GGCTGGCGGA CGAGCGCTCG
CGGCTGCGGG AGCTGCGGAT CGGGGAGCTG GCGGTCGCGC CGGCCTTCGA GGTCGGCTAC
CAGCAGCTGA CCGCCGCGCA GGCACAGGCC TTCCGGCTGC TCGGCGCGGT CGAGGCGGCG
GAGATCGGGC TGCCCGCCGC GGCCGCGGTA CTAGAGCTCC CGGTGGCCGA CGCCGAGACG
GTGCTGGAGT CGCTGGTCGA CGTGGCGATG CTGGAATCGC CGGCCGAGCA CCGGTACCGG
CATCACAGCC TGCTGCGGGA CTTCGCGCAC GGCGCTGTCG GGGACGCCGA CCGCGCGGCG
GCCGAAGGGC TGGCGGCGCG CTCGCGGCTG GCCCGGTTCC TGCTCGCCGG GGCGTGTGCG
GCCTTCGAGA CGGCGGTCCC CGGCGACCCG ATCCGGGAGA CGCTGGCGCC GCAGGGCATC
GGCGACTTCG CGTTCGACTC CCCGGCCGCC GCCCGCGCCT GGGCGCGCGG CGAGGCCGCG
ACCGCCGCGG AGCTGACGGC TCGGATCGCC GCCGAGGCGC TGCGCGAGGG CCCGCGCGCC
GCGGAGTACC GGGAGCTGAT CCCGGCTTCC ATCAACCTGC TGATAGCGAT GAGTCCCTTC
GGTCCGGGAC CGTGGGGACG CCGGACCGCG GCGGCCGTGC AGGACCTGGC GCGCGCCGCG
GAGCGAGCCG GGGACGTGCG CGGGCAGGGC CGGGCGTGGT TCCTGGCGGG CAACACCGCG
CTGGCGGCGG GGCGGCTGGA CGAGGCCTCG CAGCACGGCC GGTGGGCGCT GGAGCTGTGC
ACCCTGGCCG AGGACCCGGT GATCGCCCGG CAGGTGCTGA ACGACCTGGG GGTGATCGCG
CACGGCCGCG GCGCGTACGA CGAGGCGGCG AGCCTGTTCG GCGAGGCGGT GGCGCTGGCG
CGGTCCCTGG GACACCGCAG CGGCGAGGCC AGTTCGCTGC TGAACATGGC GGTCTCCCGC
CTGCGCGCCG GACGCGCCGC CGAAGTCCTC GCCGACTGCG ACGGAATGCT GGCCTCGGCC
CACGAACGCG GCGACGCGGC TTCAGAGGCG CAGACCCGCT ATGTCAGCGG CCTGGCGCTG
GCCGCCCTGG AGCGGTCGGC CGAGGCGGCG GAGCGGTTCG AGACGGCAGC GATCGACTGG
ACCGCGCTGG GCGCCCTGGA CCGTGCGGCC CGCGCCCGAT TCCAGCTGGC GAAGGCACTG
CACAAGCTCG GCGCGGACGA ATCCGCCCGC GATCACGCCC ACGCCGCGCT GGCCGAGTTC
GAGTTCGACG GCCGCGTGGC GGATCAGCGA GCGGTGCGGG CGCTGCTCGA CGAGCTGGAC
GCGCCGCCGG GCTGA
 
Protein sequence
MNTIRFQVLG PLRGCRGEEE LATGSPQQQA MLAALLLLPG RTASSAELID ALWGDQPPSR 
ARSILRTYAW RWRRVLDPDA ADGAASEVLV SLAGGYRLAL PPFGGGGGGA GSARGRANGN
AAPAGASAAA VNGVKDPGAS PTGEKPGGSS GSRESGWIDW SSGDSLPVDA ERAEHWAAEA
DKASAAGQPE QARELLRRAV DLWTGVPLAG VPGPFAERQR RRFAELRLSL LERRIALDVE
LGRGASCVPE LRALTDEHPL RERLYALLMR ALSQSGRQAD ALAAFTAARR LLIGELGVEP
GAELRAMHAE VLAGGPPSPP AGAAATAAAA SRMRRVPGQG PQNPVVPRPA QLPPTEPDFV
GRGALAERLG AELSAGAAGT TPTVLAIAGM GGVGKSTLAL HVAHRARPAF PDGQLYADLR
GTGATPVPPQ AVLEDFLHAL GVATEQIPEG TAARSSLFRT LLDGRRLLVV LDDAANAAQV
RPLLPGAGGC AVLVTSRARL VALPKSAQVW LDVFDDEEAL GLLGRVAGPE RPHAEPEAAR
LLVDACGRLP LAVRIVAARL AARPAWTVAS LAGRLADERS RLRELRIGEL AVAPAFEVGY
QQLTAAQAQA FRLLGAVEAA EIGLPAAAAV LELPVADAET VLESLVDVAM LESPAEHRYR
HHSLLRDFAH GAVGDADRAA AEGLAARSRL ARFLLAGACA AFETAVPGDP IRETLAPQGI
GDFAFDSPAA ARAWARGEAA TAAELTARIA AEALREGPRA AEYRELIPAS INLLIAMSPF
GPGPWGRRTA AAVQDLARAA ERAGDVRGQG RAWFLAGNTA LAAGRLDEAS QHGRWALELC
TLAEDPVIAR QVLNDLGVIA HGRGAYDEAA SLFGEAVALA RSLGHRSGEA SSLLNMAVSR
LRAGRAAEVL ADCDGMLASA HERGDAASEA QTRYVSGLAL AALERSAEAA ERFETAAIDW
TALGALDRAA RARFQLAKAL HKLGADESAR DHAHAALAEF EFDGRVADQR AVRALLDELD
APPG