Gene Caci_5355 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5355 
Symbol 
ID8336709 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp6171080 
End bp6174496 
Gene Length3417 bp 
Protein Length1138 aa 
Translation table11 
GC content74% 
IMG OID644958453 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003116055 
Protein GI256394491 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.610748 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACATAT ATCTGACGGC GTGTCAGATC AATACTCGCA GGCTTCGCGC GTATCCTGCC 
CAGGTGGCAG ACAGCGCGTG GCGGTTCAGC GTCCTCGGCC CGGTCCGCGG CTGGGGTCCG
CACGGCGAAC TCGACCTCGG CTCGCCCCAG CAGCGCGAAC TCCTGGCCCT GCTGGCGCTG
CGCCCCGGCC GCGCCGCGAT GGCCGAGGAG CTGATCGACG CGCTGTGGGG CGGGGACGCC
CCGAAAGGCG CGCTGAGCAC CATCCGGACC TACGCCTCGC GGCTGCGGCG CGCGCTGGGC
GGGATGGACG GAACAGGCGG GACGGGCGGG ACGGGCGGGA TGAACGCCAA GACCGGTCAC
ACAGGTCGGT TCGGGCAAGC AGATCAGGCG AGCCAGGCCG GTCACGCGGC GCCGTCCGAG
TACGCCGATC ACGCCGACCA CGCCAAGCGC CCCGGACAAG CGACGCACGC CGTCCAGCCC
AGCCAGGGAC CCCACGCCGG CCACGCGCAC CACGCCGGCG CCGACAGCCC GATCAGCTCG
GTGGCCGGCG GCTACGCGCT GACCGTCGAC CCCTCGGCGG TGGACGCGCT CGCCTTCGAG
GCCGGACAGG CGGTCGCGGC GGGTCTGCGC GCCACCGGCG ACCTGGTCGG GGCGGCGCGC
CGGCTGCACG AGGCGCTCGC GCTGTGGCAG GGCGAGCCGC TGGCCGGGCT GACCGGGCCG
TTCGCGGTCG CGCAGCGTGC GCAGTGGACC GAGCGCCGGC TGGGCGCGCT GGAAGCCCGG
ATCGCCTTGG ACCTGGAGGT CGGCCGGTAC AACGCGGTGA CCGCCGAGCT CGCCGCGCTG
GCGACGGAGT TCCCGCTCCG GGAACGTTTT CGAGAGCTGC AGATGCTCGC GCTGGCCCGC
AGCGGCCGCA AAGCCGAGGC GCTGGCGGTC TACGACCAGA CGCGGCGCAC GCTCGCCGAG
GAGCTGGGCG TCGATCCCGG ACCGGATCTG CGGGCCCTGT ATCGGCAGGT CGCGGCCGGT
TCCTCGGACG GCGGCAGCCG GGTGGACGGA GCGGTTCCCA GAGCTCTGAG GAAACCGTCC
GGCCCACTGG CCGAGGGACA GACCGGCCCT TCCGCGAAGC CCGTCCCCAG GCCGGTTTCG
CAGACCTCTG TCACTCTGCC CGGTCCGGCG CAACTCCCGG CGGACGTCGC CGACTTCACC
GGACGCGCCG TCTGGTCCCG GCAGCTGCTC GACCTGATGC AGCCCGGCGA GGCGATGCCG
CTGGCGGTGC TGTCCGGTAT CGGCGGCGCG GGCAAGTCCA CGCTCGCCGT CCACACCGCG
CACCTGGCTG CCGCCAAGTT CCCCGACGGC CAGCTGTTCG CCGCCCTGCG CGGGACCGAC
CGCGAACCGG CCGATCCCGG CGGTGTGCTC GGCGGCTTTC TCAGGGCTCT GGGGACCGAT
CCGGGCGCCG TGCCGGACAC GGTGCGGGAA CGTTCCGAGC TGTTCCGCAG CACCCTGGCC
GGACGCCGGG TGCTGATCGT GCTCGACGAC GTGCGCGACG CCGAGCAGAT CCGTCCGCTG
CTGCCCGGCA CGCCTGGCTG CGCCGTGCTG GCGACCAGCC GCTCCCGCCT GACCGGCGTC
CCCGGCGCGC GGCTGCTGGA ACTCGGCGCC TTCCACCCCG ATGAGGCGCT CGCGCTGTTC
CGCGCCGTCG CCGGTCCGGA CCGGGTGACC GGCTCCGAGC CGGCGGTGCG GCGCGCGGTC
GCGGTGTGCG GCCACCTGCC GCTGGCGGTG CGCATTCTGG CCTCGCGCCT GGCCGCCCGG
CCGCACTGGA CCGCCGAGAC GCTGGCCGGC CGGCTGTGCG ACGAGGCTCG GCGGCTGGAC
GAGCTGCGCG CCGGGGATCT GGCCGTGGAG GCGACGTTCC GGCTCGGCTA CGAGCATCTG
CGCCCAGATC AGGCGCACGC CTTCCGGCTG CTGGCCGTGC CCGACGGCCC GGACATCGGC
GTCGAGGCGG TCGCGGCGCT GTTGTGCTGT CCGCAGCAGG AGGCTGAGGA CCTCGCCGAG
GAACTGGTCG ATCTGTGTCT GGTCGAGTCG CCGAGCCCGG GACGCTACCG GCTGCATTCG
CTCCTGCGGA TGTTCGCGCG GCGGTTGGCG GCCGACATCG ACGGTCCCGG TGCGCCGCGC
GCGGCACTGG ACCGCTTAAT CCAGCACTAC CTGGGGTACT GCGCCGCCGC GGCGACCCGG
GTGAACCGGT GCGCGGCCAA CCTCGTCGGG CTGATAGCCG AGCCGGAGAC GGTCGACGTC
CCGGACGATC CCGGGGAGTG CATGGCGTGG GGCGAGCGTG AGCGGGAGGC GGTGTGTGCG
GCGGCTGCGC AGGCGCTGCT GAGCCGCGAT TCCGGCGGCT CCGCGGTCGT CGATGCCTGC
GCCGATCTCC TGTACTGGCT GGGCATCTTG TGCCAGTCCG GACCCGGTGA CGAGGATCTG
GCACGGCTGG CGGCCGTCGT CATCGACGAG GCCGACCGGG ACGACAACCG CCGCGCGGAG
GTCATCGCAC GATCGCAGCT CGCGTATTCG CTGTCGCAGT CCTGGTACTT GGCGGAGGCG
GATGTCCAGG CGGCTGCGGC GGTTTCGGTG GCGCGCGGTC TCCCCGAGCC CGGCTACCTG
CTGGAAGCGT TGTCGGTGCT GTCCGCGAAC CACTGGCGCG CCGGACGCGA CGAGGAGTCC
ATGCGCGCCG CCTCCGAGGC GCTGCGTCTG CTGGAGGAGA CCGGCGCCGG CTGGGAGCAG
CTGTCGGAGG CGCAGCTCAA TCTGGCGCAG AGCCTGTGCC GCGTCCAGCA GCCCCAGGAG
GCGCGGGAAC TGGCCGAGCG GGGTCTGGCC CTGCGGCGGG CCCACGGGGA CGCCAGCGCC
CTGGCGTACG GCCTGCACGC CACCGGCATG GTGCTGCGCG AGGCCGGCTT CCCCGACCGC
GCCGCCGCCT GCCACCGCGA GGCCTCGGAC ATCCACTGCG GCATCGGGCA GCTCCGGCGG
CTGGGCTGGT CGCATCTGCG GCTCGGCGAG GCCCTGTTCG ACCAGGGCGT GCACGACGAG
GCGCTGGAGG CCGCCGGGCA CGCCGCCGAG ACGCTCACCG AACTCGGCGA CCGTCCCGGG
CTGGGATTGG CGCTGGTGCT GATCGGCCGG ATCCACGAGC GGCTCGGGGA TCCGGCCGGG
GCGCGGGCGG CGTGGCAGGA TGCTTATACG GTGTTCGACG GGACGTCGTC GCCGGTGACG
GTCGAGCTGC GCAAGCTGCT CGGGCTCGAC GCCGGTCCCG GCGGGGTGGT GGGTGTGGTG
GATGGTGCGG TGGATGCTGC GGTGGGCGGC GCGGCCGAGC CCGAACCGGG CGGCGCGCGC
CCGGCCGAGC AGTGGCCGCC GTGGATGCCG CGGCATCCGG TACCGCCGCG TGGTTAG
 
Protein sequence
MNIYLTACQI NTRRLRAYPA QVADSAWRFS VLGPVRGWGP HGELDLGSPQ QRELLALLAL 
RPGRAAMAEE LIDALWGGDA PKGALSTIRT YASRLRRALG GMDGTGGTGG TGGMNAKTGH
TGRFGQADQA SQAGHAAPSE YADHADHAKR PGQATHAVQP SQGPHAGHAH HAGADSPISS
VAGGYALTVD PSAVDALAFE AGQAVAAGLR ATGDLVGAAR RLHEALALWQ GEPLAGLTGP
FAVAQRAQWT ERRLGALEAR IALDLEVGRY NAVTAELAAL ATEFPLRERF RELQMLALAR
SGRKAEALAV YDQTRRTLAE ELGVDPGPDL RALYRQVAAG SSDGGSRVDG AVPRALRKPS
GPLAEGQTGP SAKPVPRPVS QTSVTLPGPA QLPADVADFT GRAVWSRQLL DLMQPGEAMP
LAVLSGIGGA GKSTLAVHTA HLAAAKFPDG QLFAALRGTD REPADPGGVL GGFLRALGTD
PGAVPDTVRE RSELFRSTLA GRRVLIVLDD VRDAEQIRPL LPGTPGCAVL ATSRSRLTGV
PGARLLELGA FHPDEALALF RAVAGPDRVT GSEPAVRRAV AVCGHLPLAV RILASRLAAR
PHWTAETLAG RLCDEARRLD ELRAGDLAVE ATFRLGYEHL RPDQAHAFRL LAVPDGPDIG
VEAVAALLCC PQQEAEDLAE ELVDLCLVES PSPGRYRLHS LLRMFARRLA ADIDGPGAPR
AALDRLIQHY LGYCAAAATR VNRCAANLVG LIAEPETVDV PDDPGECMAW GEREREAVCA
AAAQALLSRD SGGSAVVDAC ADLLYWLGIL CQSGPGDEDL ARLAAVVIDE ADRDDNRRAE
VIARSQLAYS LSQSWYLAEA DVQAAAAVSV ARGLPEPGYL LEALSVLSAN HWRAGRDEES
MRAASEALRL LEETGAGWEQ LSEAQLNLAQ SLCRVQQPQE ARELAERGLA LRRAHGDASA
LAYGLHATGM VLREAGFPDR AAACHREASD IHCGIGQLRR LGWSHLRLGE ALFDQGVHDE
ALEAAGHAAE TLTELGDRPG LGLALVLIGR IHERLGDPAG ARAAWQDAYT VFDGTSSPVT
VELRKLLGLD AGPGGVVGVV DGAVDAAVGG AAEPEPGGAR PAEQWPPWMP RHPVPPRG