Gene Caci_5039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5039 
Symbol 
ID8336393 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5776269 
End bp5779250 
Gene Length2982 bp 
Protein Length993 aa 
Translation table11 
GC content73% 
IMG OID644958138 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003115740 
Protein GI256394176 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTCG GGGTGCTGGG ACCGGTGAGG GCCGGCGGCG GCCAAGAGGT CCCGGCGCTG 
ACGCCGATGG TTCGGAGTCT GCTGGCCGTG TTGTTGGTGG AGGCCGGACG GCCGGTGTCC
GAAGCCCGGC TGACCGAGGC GTTGTGGGGC GGCAGTCCGC CGCAGACGTC GAAAGCCGCC
CTCCAGAACC ATGTGCTCGG CTTGCGGCGC GCGCTCGGTG TGGACGAAGC CGCGCGCGTC
CGCAGAACCT ACGACGGCTA CCTGATCGAG GTCGAGGCCG GCGAACTGGA CCTGCGGGAG
TTCGAGCAGC TCTCCGGCGA GGGATCCGAG GACCTCGTGG CGGGCCGATG GCAGGCGGCG
GCTGACGCCC TGACCGGGGC GTTGGCGCTC TGGCGCGGCG ATCCGCTCGC CGACGCGCTC
CCCGCGACGC GGGACGCGGT GGACGTCGGC CGGATCCACG AGGCCCGGCT TCAGACCGTC
GAGCAGCTGG CCCGGGCCCG ACTCGAACTC GGACACTACG ACCGGGTCAT CGGCGAGATC
GAGCCGCTGC TGCGGGAGCA TCCCTGGCGG GAGGCCATGC ACGGGCAGCT GATGCACGCG
CTGCACGGCG CCGGACGGCA AGCCGAAGCG CTCACCGTCT ACCAGCGGCT GCGCACCGGC
CTGGTCACCG AACTCGGCGT CGAGCCCTCG GCCGGGCTGG CGGACCTGCA CCGGCGGATC
CTGGCCGGCG ATCCCGCGCT GATCAGGACG ATCACGCCGG CCGGCGCTAC TCAGGGTCCC
AGTCGCGTCA TCGGCGCGCC TGACGCCGGT CAGTCCGGTC ACTCCCGTGA GTCCGGTGAG
TCCGACCACA CAGCCCACAC AGCCCAAGGC TCCGGCGCGG ATTCCGCCCA CGGCGCCGAC
CACCGCCGGC CGCAGAACCC TGATCCGCGC GCCGCCGACG CCGCCAACCC CGTCACCCCC
GCCAACGCCG TCATACCCCG CCAACTGCCG GCGAAGATCA GCCACTTCAC CGGACGCACC
GCCGCCCTGG CGGTGCTGGA GGAGTTCCTC GCGGCGGCGG GCGAGGGCGA CCAGCCGCTG
ATCGCGCTCG TCGGCACCGC CGGCGTGGGC AAGACGGCGC TGGCGGTGCA CTGGGCCCAC
CGGATCGCCT ACCGATACCC GGACGGCTGC CTTTATGTGA ACCTGCGCGG CTTCGACCCC
TCACAGGAGC CTGTGACCCC CGAGCAGGCG ATCCGCGGCT TCCTCCAAGC CCTCGGGCTG
CCCCGGCAGG AGCTGCCGGC CCTGTTCGCC GACCAGGTCG GCCGCTACCG CAGCCTCGCC
GCCGAGCGCC GGCTCCTGAT CGTGCTGGAC AACGCCCGCG ACGCCGAGCA GGTGCGCGAG
CTGCTGCCCG GCAATCCGGC GTGCCTGACG CTGGTCACCA GCCGCGACCG GCTCACCGGG
CTGGTCGCCG TCGACGGCGC CCGGCCGCTC CGGCTCGACA CGCTGCCCGC CGACGAGGCG
TTCGACCTGC TGGCCCGCCG GCTCGGCGGG CGGCACGCGG CCGAGGAGCC GGACGCGATC
CGGGAGATCG CAGAGCTCTG CGCGCGGCTC CCCCTCGCCC TGAACATCGC CGCCGCCCGG
ATCGCCACGA ACCCGCATCT GCCGATCGAG ATGTTCGTCC AGGAGCTGCG CGAGGCCGGC
GCCACACTGA GGACCCTGGA CGCCGGCGAC CGGGCGGCCA GCGTCCGGAC CGTCTTCTCC
TGGTCCTACC GGCAGCTCGG CGGGCCCGCG GCCCGGCTGT TCCGGCTGCT GGGCGTACAT
CCGGGCCCTG ATCTGGGCCT GTCGGTCTGC TCGGCGCTGA CGGCCCGGCC GCGCGCCGCG
ACGCTGGCCA CCCTGGAGGA GCTCACCGGC CTGCACCTGC TCGACCAGCA CGCGCCGGGC
CGGTACGTCC AACACGACCT GCTGCGGGTC TTCGCCGGCG AACTCGGGCA GGCGGTGGAC
GGCCGGGACG CCAGCCGCGA CGCCGAGCTG CTCACCCTCG ACCACTACCT GCACAGCGCC
TTCGCCGCCG AACGCCTGCT CCAGCCGGCC CGGCCGCCGA TCGCGCTCGC GCCGCCGCAC
GCCGGCTCCG CGCCCCTGGA CTTCGCCGAC CTGGCCGAGG CGCTGCGCTG GTACGACGCG
GAGTACCCGG TGCTGCTGGC CGCCGCGCGG CGGGCCGGCG CCGTCCCGGA CCCGCACGCC
TGGCAGCTGC CCTGGTCGAT GGTCACCTAC CTCGACCGGG CCGGGTTGTG GCACGACCTC
ACCGAGACGC TGACCGGGTC GCTGGCGGCA CTGCGGCGGA TCGGCGACAT CCCGAACCTC
GTCGCCGCCC ACATGTGCCT GGCACAGGTC CTGGGCCACA GGCTCAACGA GGCTGAGGCC
GCGGAGACCC ACTTCCAGGC GGCCCTGGAC CTCGACCGCG AGACCGACGA CGCCACCACC
GAGGTGAGGG TCATGGCGAA TCTCATGACC CTGCGAGGAA GGCAAGGACG CTGGGCGGAG
TCGGTGGTCT TCGGACTGCG GGCTCTGAAG CTCCTGCGCG AGAAGGGAGA GACCACTGTC
CTCCTGCCGA CCGTCCTCAA CAAGGTGGGC TGGAGCCACG TCCACCTCGG CCGGTACGAG
GAGGCGCTCG CCTGCTCCAC CGAAGCGCTC GAGTTGTTCC GGGAGACCGG ATTCCGCATC
GGCCAGGCCG ACGCCCTGGA CACCCTCGGC CTGGCCCGCC ACCGGCTCGG CGACACCGCC
GGCGCTGTGG CCTGCTACGA AGCGGCCGAG GCGGTCTTCA TCGAGGTCGG CGAACGGTTC
CTGCTCGCCG AGACGCTGAT GCGCCTCGGC GACGTCCACC TCACCGACCA CGCCGAGGCC
GCCGCCCGCG AGGTCTGGAC CCGGTCGCTG GCGATCCTCA GCGATATCGG CCATCCGACG
GCCGAGCAGG TCGAGGAGCG GCTCCGGTCC CTGGACCGCT GA
 
Protein sequence
MKFGVLGPVR AGGGQEVPAL TPMVRSLLAV LLVEAGRPVS EARLTEALWG GSPPQTSKAA 
LQNHVLGLRR ALGVDEAARV RRTYDGYLIE VEAGELDLRE FEQLSGEGSE DLVAGRWQAA
ADALTGALAL WRGDPLADAL PATRDAVDVG RIHEARLQTV EQLARARLEL GHYDRVIGEI
EPLLREHPWR EAMHGQLMHA LHGAGRQAEA LTVYQRLRTG LVTELGVEPS AGLADLHRRI
LAGDPALIRT ITPAGATQGP SRVIGAPDAG QSGHSRESGE SDHTAHTAQG SGADSAHGAD
HRRPQNPDPR AADAANPVTP ANAVIPRQLP AKISHFTGRT AALAVLEEFL AAAGEGDQPL
IALVGTAGVG KTALAVHWAH RIAYRYPDGC LYVNLRGFDP SQEPVTPEQA IRGFLQALGL
PRQELPALFA DQVGRYRSLA AERRLLIVLD NARDAEQVRE LLPGNPACLT LVTSRDRLTG
LVAVDGARPL RLDTLPADEA FDLLARRLGG RHAAEEPDAI REIAELCARL PLALNIAAAR
IATNPHLPIE MFVQELREAG ATLRTLDAGD RAASVRTVFS WSYRQLGGPA ARLFRLLGVH
PGPDLGLSVC SALTARPRAA TLATLEELTG LHLLDQHAPG RYVQHDLLRV FAGELGQAVD
GRDASRDAEL LTLDHYLHSA FAAERLLQPA RPPIALAPPH AGSAPLDFAD LAEALRWYDA
EYPVLLAAAR RAGAVPDPHA WQLPWSMVTY LDRAGLWHDL TETLTGSLAA LRRIGDIPNL
VAAHMCLAQV LGHRLNEAEA AETHFQAALD LDRETDDATT EVRVMANLMT LRGRQGRWAE
SVVFGLRALK LLREKGETTV LLPTVLNKVG WSHVHLGRYE EALACSTEAL ELFRETGFRI
GQADALDTLG LARHRLGDTA GAVACYEAAE AVFIEVGERF LLAETLMRLG DVHLTDHAEA
AAREVWTRSL AILSDIGHPT AEQVEERLRS LDR