Gene Caci_5833 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5833 
Symbol 
ID8337194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp6737218 
End bp6740031 
Gene Length2814 bp 
Protein Length937 aa 
Translation table11 
GC content73% 
IMG OID644958937 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003116532 
Protein GI256394968 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGTAC GCGCCTCGGA CACCGATCCG ATCGCCATCC CCGCCGCCTT GCAGCGCGGC 
GTGCTGGCGA TCCTGCTCTC GCGCGCGAAC CTGGTCGTCT CCCGCGACGA ACTGGCCGAA
ACCCTGTGGG ACGGCCAGAA TCCCGGCACG GCGCGGACCA CGCTGGCCGC GTACGTCTCG
CGGCTGCGGC GCGTCCTGGG CCCGGACGCC GGCCGGCGGA TCGGCACCCG GCCGCCGGGA
TACGTGATCG AGATCGACGC CGAGGAGTAC GACTGCGCCC GCGCCGGGGA TCTGCACGCC
CGGGCGCGCG CGGCGGCCGA GCGCGGTGAT TGGGCGGCGG TGCAGGGCCT GGCCGCCGAC
GGGCTGCGGC TGTGGCGCGG GACGCCGTAC CAGGACGTGC CGGTGCCGCG GCTGCACCGG
GAGGACGCCG CCGCGCTGGA CGCGGTGCGG GTGCAGCTGA CCGAGCTGAC CGTGGAAGCG
GATCTGCGGC TGGGGCGCGG CGACAGCGCC ATCAGTACCC TGATGCGGCT GACCGAGCAG
GAGCCGCTGC GGGAGAGCTT CTCCGAGAAA CTGATGATGG CGTTGGCCGC GCAGGGCAGA
CGCGCCGAAG CCCTCGCCGT GTTCCACCGG GCGCGCAAGG TGCTCCGCGA GAATCTGGGT
ATTGATCCGG GACCGGATCT GGCTCGCGCG CATCACGACG TGCTGGCGGC TTCCGGCGGG
GACCGCCCGG CGTCGGCGGC GGCGTCGGCA GCAGCAGCGT CGGCAACGGG ATCGGCGGCG
GCCTCACCGG AGCGGATCCG AGTCCGCGGA CCGCGACAGC TCCCGCCGGC CGCCCGGCAC
TTCACCGGCC GGGAAGCGGA GCTGGCGGCG ATGGACGAGG CCGCGGCGTC CGGCGAGGTG
CTGGTCGTCA GCGGCCTGGC CGGCGTCGGC AAGACCGCGC TGACCACGCA CTGGGCCCAC
CAAGCCGCGC CGCGCTACCC CGACGGCCAG GTGTTCGTCG GGCTGCACGG TTTCGACCCG
CACAGCGTGC CGATGACCGC GCACACCGCG TGTTCGATCC TGCTGGAGTC CCTGGGTCTG
GCCACCTCGG AGATCCCCGC CGATCCCGAC GCGCGCACGG CGCTGTACCG GACGGTCGTC
GCCGGCCGGC GGCTGCTCCT GGTGCTGGAC GACGCGTGGG ACGCCGCGCA GATCAGACCA
CTGATTCCGG GTACGGCCGG CAGCCAGGTC GTGGTGACCA GCCGCAACCG GCTCGCCGGG
CTGGTCGCCG CCGACGGCGC GCGGCCGATC CTGCTGGCGC CGCTGGACAA CGGCCGCTCG
ATGGAGTTGC TCGCGCGGCG CTCGGGAATC CGGCCGCGCC CTGACGAGCC GGCCGACACC
GCCGCCGCCG AGGCGCTGGC TGCGGCATGC GCCGGATTGC CGCTGGCGCT GACGATCGCG
GCAGCCCGGC TTCAGCTGGA CCCGGACCTG TCCTGGTCGG CGCTCACCGA GCGGCTGCAC
GACCGCCGCG GCGCGCTGTC GACGCTGGAC GTCGGGGAGG CCAGCGGGAG CCTGCGTGCC
GTGTTCTCGA TGTCCTATCA GCGGTTGAGC CGATCGGCAG CGGCGTTGTT CCGACTGCTC
GGCATCCATC CGGGCCCTGA CATCGCCATG GCCGCGGCCG TGTCGTTGGC GGGCAGCGCC
GACACCTCGA GCACCGAGCA GGCGCTCGAG GAACTCCTGA GCGCCAGCCT GCTGCACCGG
CACGCCGGCC GGTTCCGGTT CCACGATCTG GTCCGTGCCT ACGCCGCCGA AACAGCACTC
GAAGACTCCC CCGAGGTACG CACCGCCGCG CGTGTGCGCA TCCACGACCA TTGCCTGCGC
TCGGCGATCG AGGCCGACCG TATGCTGCGC CCGACGCGCG AACCGCTGAG CCTGCCGGCT
CCGGTGCCCG ACGTGCGCCC GGAGTCCTTC GCCGACGTCG CCGCAGCCAC GGCGTGGTTC
CAGGAGGAAC GCCAGGTGCT CGCAGCGGTG GTCGCCTTGG CCTCGCGCGA GCAGGACGAC
GAGTACGCGC ACCGCCTGCC GTGGGCTATC AGCACCTACC TGAACCGCCG GGGCGAATGG
CCCGCGCTGG CCGAACTCCA CCTCCTGGGC GCCGAGGCCG CCGACCGCAC CGGCGACCCC
CGAGCCCGGG CGCGCACGCA CACCGACGCC TCCAGCTTCC TGCTCCAGAT CCTCTCCTTC
GACGAGGCGC GGCGGCATCT GGAACTGTCG ATGGCGCTGT GGCAGGAACT CGGCGACCTG
CGCGGGGCAT GGCTGTCCGA GCACAACATG GGCCACCTGT GCCACAAGGA GGGCCGGCAC
GCCGAAGCCG CCGAGCACGC ACGCCGCGCG CTGGAGCACG CCCGCGCCCG CGGCTGGGCA
CCGGACGTCG CGCTGGCCCT GTCCACCGCC GCGTGGAGTC TGACCCACTG CGACGACCAC
GCCGGGGCGC TGCTGTTGGC CGCCGAGGCG ATCGAGCTGG ACCGGTGCGC CGGGGACCGC
AACGGCGAGG CGCACGCGTA CGACACCGTA GGGCTGGCGT CGTTCCGCCT CGGCCGGTAT
CCCGAGGCCC TGGCCGCGTA TCACAAGGCA TTGCGGTTGT TCGAGGACCT CGGCGACATA
CGGTTCCGGG GCAGGGTCCT GATGCGCATC GGCCAGGTGC GGCGGGAAAG CGGGGAGACT
GCGGCGGCGC ACAGCGCCTG GAGCCAGGCG TTGAAGCTTT TCGAGCAGGT CGGGGCGCCG
GAGTCCGACG AGCTCCGGGA GATGCTGGAC GGGCTGGACG GTTCGGACTC CTGA
 
Protein sequence
MVVRASDTDP IAIPAALQRG VLAILLSRAN LVVSRDELAE TLWDGQNPGT ARTTLAAYVS 
RLRRVLGPDA GRRIGTRPPG YVIEIDAEEY DCARAGDLHA RARAAAERGD WAAVQGLAAD
GLRLWRGTPY QDVPVPRLHR EDAAALDAVR VQLTELTVEA DLRLGRGDSA ISTLMRLTEQ
EPLRESFSEK LMMALAAQGR RAEALAVFHR ARKVLRENLG IDPGPDLARA HHDVLAASGG
DRPASAAASA AAASATGSAA ASPERIRVRG PRQLPPAARH FTGREAELAA MDEAAASGEV
LVVSGLAGVG KTALTTHWAH QAAPRYPDGQ VFVGLHGFDP HSVPMTAHTA CSILLESLGL
ATSEIPADPD ARTALYRTVV AGRRLLLVLD DAWDAAQIRP LIPGTAGSQV VVTSRNRLAG
LVAADGARPI LLAPLDNGRS MELLARRSGI RPRPDEPADT AAAEALAAAC AGLPLALTIA
AARLQLDPDL SWSALTERLH DRRGALSTLD VGEASGSLRA VFSMSYQRLS RSAAALFRLL
GIHPGPDIAM AAAVSLAGSA DTSSTEQALE ELLSASLLHR HAGRFRFHDL VRAYAAETAL
EDSPEVRTAA RVRIHDHCLR SAIEADRMLR PTREPLSLPA PVPDVRPESF ADVAAATAWF
QEERQVLAAV VALASREQDD EYAHRLPWAI STYLNRRGEW PALAELHLLG AEAADRTGDP
RARARTHTDA SSFLLQILSF DEARRHLELS MALWQELGDL RGAWLSEHNM GHLCHKEGRH
AEAAEHARRA LEHARARGWA PDVALALSTA AWSLTHCDDH AGALLLAAEA IELDRCAGDR
NGEAHAYDTV GLASFRLGRY PEALAAYHKA LRLFEDLGDI RFRGRVLMRI GQVRRESGET
AAAHSAWSQA LKLFEQVGAP ESDELREMLD GLDGSDS