Gene Caci_3752 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3752 
Symbol 
ID8335105 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4236608 
End bp4239586 
Gene Length2979 bp 
Protein Length992 aa 
Translation table11 
GC content71% 
IMG OID644956892 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003114495 
Protein GI256392931 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTTTCG GGGTCCTCGG TCCGCTCCTG GCGCACGACG GCTCGGCCGA CAGACGGATC 
GTCGCGCCGA AGCAACAAGT CATCCTGGCC ACGATGCTCC TGAACGCGAA CCGGGTGGTC
TCGGTCGAGC GCATCGCCGA GAACCTGTGG GCCGACGCCG CACCGGGCGG TGCGCACAAG
ACGCTGCACA CCTATGTCAT GAGGCTGCGA CGATCGCTCG GCACCGCCGC CGACCGGGTG
CGCACCGAGG CCCGGGGCTA CCGCTTCGTC GTGCAGGACG GGGAACTGGA TCTGCACCGC
TTCACCGCTC TGGTCGAAGC GGCCAAGGCC AGGAGCGCGG AGCAGGCTTG GGAAAGCGCC
GCAGACCTCT TACACACCGC GCTGGGAGTC TGGCGCGGGC AGCCGTTGCA GGGTGTCCAG
TCCGTCGCCC TGAGCGGCGA GGTCGACCGG CTCGCCGAAC TCCGGCTGGA CGCGCTGTTG
GCCCGGATCG ACGCGGACCT GCATTTGGAC CGCCACGAAC TGCTCGTGCC GGAACTGCGC
GACCTCGTCG TCCGGCAACC TCTTCTGGAA CGCGCGCACT CCTTCCTCAT GCTCGCGCTC
TACCGCGCGA GCCGGCGCGC CGACGCGCTG GCCGCCTTCC GGTCGGCGCG GCGCGTGCTC
GCCGAGGAGC TGGGGATCGA GCCCGGCCGG GACCTGCAGC GGCTGCACGG CCGGATCCTC
AACGCCGACC CGGCGTTGAA CGATCCGGCG TGGAACGACC CGGCTGGCTC CTCCCCGGAG
CCGCGATTTC CGTCGCAGCC GGCACCGGAT CCCACGTCGG ATGCGGAACC GGCCGCCCCC
GCGCCTCATC CCGCCGCGCC CGCACAGTTG CCGCGCGGCA TCCGTGACTT CACCGGACGC
GACCACGAAC TGGAGCGCAT CAAGACGCTG CTTTCCCACG ACCCGCCCGG CAGCGTGCCG
GCTCCGGGAG TCTGCGTCAT CGCCGGAGCC GGGGGCACCG GCAAATCGGC GCTCGCCGTC
CAGGTCGCGC ACGCCGTCCG GGACCGGTTC CCCGACGGCC AGCTCTATCT CGACCTGCGC
GGCGCCGACC GGCACCCGGT CGATCCCGGC CACGCGCTCG CCGAGTTCAT CCGCGCCCTC
GGAGACGGTG GTTCCGCGCT GCCCGAGGGG GTGGCGGACC GCTCCGCGGT CTTCCGCACG
ATGCTGGCCG ATCGGCGGGT GCTGATTTTG CTGGACGATG CCGGTGACGT CCAACAGGTG
CGGCCTCTGC TGCCGGCCGA TCCCCGGTGC TGCGTCATCA TCACGAGCCG CAGCCGCCTG
CCCGGCCTGG AGGACTGTGC GCGGCTGGAG CTCGGCTCGC TGTCACCACA GGACGGCGCC
AGCCTGTTCG GCAAAGTCGT GGGCGACGAG CGGCCGCAGT CCGAGCCGGC CGCGGTGGCG
CGCATCGTCG AGCTCTGCGG CGGGCTGCCG CTGGCGATCC GCATCGCCGG CTCGCGACTC
GCGGTGCGGC GGACCTGGCG GCTGGAGTCG CTGGCGGCCC GGCTGGGGGA CACCGCACGG
CGCCTCGACG AGCTCCGGAC GGACGACCTG CAAGTGCGGG CGACCCTCGA CATGAGCTAC
CAGCACCTGA CCGGCGACCA GGCGCGAGCC TTCCGGCTGC TCGCCGTCCC CGATGTCGAC
TCACTGTCCG TCTGGCACGC AGCGGTGCAC CTCGATGTGC CGGCACGGAC AGCTGAGACT
CTTCTGGAGA GCCTGGTGGA CGCCTTCCTG CTGGAGCCGG CCGGCGCCGA GCGCTATCGC
TACCACGATC TGACGCGCGT GTTCGCCCGC GAGGCGGCGT CCGTGACCGA GTCCGCTGAG
GCGTTGGCCG GCGCCGCGGG CCGGACTCTG GCGGCGTACG CAGAGCTGCT GGCCCATGCC
GCCGCCGCTG CCCGGCCGGG CTACCTCGAC GAATCCCCGC CCGCGCTCCG GTTCGGCACC
GCCCACGAAG CGTTGGACTG GCTGGATCAG GAGTTCCGCG CCGTCGGCGG ACTGATCGTC
CAGGCCGGCT CCGGCCCGGC CGACGCGGTC GGCGTCGCCG CGGACATGTT GTACCGCGTC
CAGTGGTACC TGCGGTCCCG GGGGCACTGG CGGCTGTGGC ATGACGCGGC GGCCGCGGTG
ATCGACGGCG CGGTCCGCAC CGGCGACACG GCCGCCGAGC TCGTCGGCCG CCAGAGCAGC
GGCCTGCTGG CCCTGCTCAC CGGAAGATTC GAGGAATCCG ACGAGAACCT GTCGGCGGCC
GTCGGGCTCG CCGAGCGGCT CGACGATTCC CTGGAAAAGG CCCGCGTCCT GAACCGGCGC
GGCCTGCTGG ACTTCCAGCG TGGTTTCTAC CGCGAGGCCG TGGCAGACCA CGAGGCGGCG
GCGGACCTGT TCAAGCGGCT CGGCAACCGG CTGGGCGAAT GCGCCAGCTT GGTGAACATC
GGCAAGTGCT TGCGCGTGTC CGGCGAACCG GCACGGGCTC TGGCTCACCT CGAACGGGCT
CTGGCGCTCA GTGAGGAACT GGGCGAATCG GAGAACGCGA CAATGGCCCG GCACCACCTG
GCCGCTTGCC ACTCCGAACT CGGCAACCAT GAAACGGCCA TATCTGCGCA GTATGACTGC
CTGGTCTTCA CGCGCGAACA CGGACTCCGC GAAGGCGAGG CCTTCGCTCT CGCCGAACTG
GGTCGTGCGC TCCTGAGAGC GGACCGCGCG CTCGAAGCCC TGGAGAGTTT CGAGGAGGCC
ATGGACCTCT TCAGCGCCCT CGGCGACCCC AATGCGGTCG CGGTGTTCCT CGCGGACTCC
GGCTTCGCGC ACCAACGCCT CGGCGACCTG GCCGCCGCGA CGAGCGCATG GCGGGCGGCA
CTGCCTGCGC TCCGGCCGGA CACGCGGGAG GCAGGCGCTG TTCGAGAGGT GTTGGGCGCC
TATACCCATG AAGAGATTCA CACCAGTGAA TCAGGGTGA
 
Protein sequence
MRFGVLGPLL AHDGSADRRI VAPKQQVILA TMLLNANRVV SVERIAENLW ADAAPGGAHK 
TLHTYVMRLR RSLGTAADRV RTEARGYRFV VQDGELDLHR FTALVEAAKA RSAEQAWESA
ADLLHTALGV WRGQPLQGVQ SVALSGEVDR LAELRLDALL ARIDADLHLD RHELLVPELR
DLVVRQPLLE RAHSFLMLAL YRASRRADAL AAFRSARRVL AEELGIEPGR DLQRLHGRIL
NADPALNDPA WNDPAGSSPE PRFPSQPAPD PTSDAEPAAP APHPAAPAQL PRGIRDFTGR
DHELERIKTL LSHDPPGSVP APGVCVIAGA GGTGKSALAV QVAHAVRDRF PDGQLYLDLR
GADRHPVDPG HALAEFIRAL GDGGSALPEG VADRSAVFRT MLADRRVLIL LDDAGDVQQV
RPLLPADPRC CVIITSRSRL PGLEDCARLE LGSLSPQDGA SLFGKVVGDE RPQSEPAAVA
RIVELCGGLP LAIRIAGSRL AVRRTWRLES LAARLGDTAR RLDELRTDDL QVRATLDMSY
QHLTGDQARA FRLLAVPDVD SLSVWHAAVH LDVPARTAET LLESLVDAFL LEPAGAERYR
YHDLTRVFAR EAASVTESAE ALAGAAGRTL AAYAELLAHA AAAARPGYLD ESPPALRFGT
AHEALDWLDQ EFRAVGGLIV QAGSGPADAV GVAADMLYRV QWYLRSRGHW RLWHDAAAAV
IDGAVRTGDT AAELVGRQSS GLLALLTGRF EESDENLSAA VGLAERLDDS LEKARVLNRR
GLLDFQRGFY REAVADHEAA ADLFKRLGNR LGECASLVNI GKCLRVSGEP ARALAHLERA
LALSEELGES ENATMARHHL AACHSELGNH ETAISAQYDC LVFTREHGLR EGEAFALAEL
GRALLRADRA LEALESFEEA MDLFSALGDP NAVAVFLADS GFAHQRLGDL AAATSAWRAA
LPALRPDTRE AGAVREVLGA YTHEEIHTSE SG