Gene Caci_6283 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_6283 
Symbol 
ID8337646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp7225603 
End bp7228671 
Gene Length3069 bp 
Protein Length1022 aa 
Translation table11 
GC content69% 
IMG OID644959384 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003116978 
Protein GI256395414 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACCACG GGGCCACAGA AAAAGGTACC CGACCAGACC TCCGCTTCGC CATCCTCGGC 
TCGTTCGAAT GCTGGGCCGA CGGCAGCATC ATCGCTTTGC GCGGTCCGCT TCAGGAACGC
CTCGCCGTCT CCTTGTTATT GAACCGGAAC CGCGTGGTGC CCGTCTCCCG TCTTGTGGAG
ACGATCTGGG AGGACGGCGG TCCCGAGACC GCCGCGCACC AGGTCCGCAA GATGACCTCG
GACCTGCGAA AACGCGTGCC GGGCATGGCT TCTCTGCTGA CGACCTCCGG TGTGGGCTAC
CGCATCAGGG TCCCGGAGGA GACGCTGGAC CTCGGGATCT TCACCGCGGC GTTGAGCGAC
GCGCGCCGCG CGACCGAAGA GGGCGATCTG CGCTCCGCCG CGGCCAAGTT ACGCCTGGCA
CTGGACCAAT GGCGCGGTCC GGTGCTCGGA GGCGAGGGCG GGCCGGTGAT CCAGGCGATG
AGCACCGCCT TGGAGGAACG CCGGCTGACC GCTCTGGAGC AGTTGTTCGC GATCCGGCTG
GACCTCGGCG AGGCCAGCCG CCTGGTCGGC GACCTGCGGG AGGCGGTGGG TGCGAACCCG
GTCCGGGAGC ACTTGCGCGG GCAGTTGATG CGAGCCTTGT ATCTGTCCGG ACGCCAGACC
GACGCGCTGG CCGAATACCA GAGCCTGCGG CAGTATCTCG ACGAGGAACA CGGGGTCGAG
CCTGGCGCGG AGCTGGCCGA GTTGCAGCAA CGCATTCTGC GCAACGACGC GAGCTTGGGC
AGCCATGGGG AGGCTCGCAG GGAGACGCAG GTCGCGGTCC CGGCGCCGGC TTCCACGGCT
GCGGCTGGCG GCTTTCCCAC GACTCTGCCT TATGACCTGC CCGATTTCAC GGGGCGCCAG
GAGGCGCTGG ACGCCGTGTG CGCGGCGGGG CAGCCGTCCG GGTCCGGAGC GCACCGGCTG
CGGATCGTGC TGATCGACGG GATGGCGGGC AGCGGCAAGA CGGCGTTGGC TGTTCATGCC
GCGCACTTTC TGCAGGAGTC GCATCCCGAC GGTCAGCTGT ATCTGGACCT GCATGGGTTC
ACTCCCGAGC GCGATTCCGT GGACACCCAT GAGGCGCTGG GCATTCTGCT CGGCGCGCTG
GGCATCTCAG GGTCCGATGT TCCGGTCGAT CCGGAGGCGC GGATCGCGCG CTGGCGTACG
GCGACCGTCA ATCGGCGCAT GCTGCTGGTG TTCGACGACG CGGAGAGCGC CGCGCAGATC
CGCGGGCTGC TGCCGAGTTC TTCCGAGAGC ACGGTGCTGA TCACCAGCCG CGTGCGCATC
AAGGGGATCG ACGGCGCGCG GGCTGTTTCG CTGGGCGTGC TGTCTCCGGC GGAGAGCTTG
TCCCTGCTGG AAAGGGTGCT CGGACGGGAG CGCGTGGCCA ACGAGTCCGA GGCCGCGATG
CGGCTGGCGG ACTTGTGCGG GCATCTGCCT TTGGCGCTGC GAATCAGTGC GGCGCGTCTG
GCGAGCCGGG ACCACTGGAC CATCGCGCGG TTGGCGAGCC GGCTGTCGAA CGAGTCGCGC
AAGCTGGTCG AGCTGGCGGT GGAGGATCGC AGCATCAGGG CTTGCATCAA GTCCTCCCTC
GAGGCGCTCG ATCAGGAGCA TCTGGAGTAC CTGCGGTATT TCTGCCTGCA TCCCGGCGAC
GATGTGGAGA TCCACGCGGC GGCGGCGCTC ACCGGTCTGG ACGTGTACGG CGCCGAGGAC
ATCGTCGAGC TCTTGCTCGA TCGGCATCTG ATGGAGCCGC GGTTGGCGGA GCGGTACACG
GTGCACAGTT TGGTGCGTGC CTATGTGCTG GAGCAGTACA CCGACCCGGA GGGCGAGGGT
CCCGCGTTCG AGCGGCTGAC CAACTACTAC CGGACGGCGA TCTGCCGGGC TACCGACGCC
GCCTTCCCGG GGCGGAGCCC GTTCTCGGCT GGGCTGCCCG CTTACGAGGG CGACGTCCCG
GAGCTGGAGG ACCGGAGCCT GGCTCTGGCG TGGGTCGACG CCGAGTATCG CAATATTCTG
CCTTCGCTGG CCGAGGCGGT GCGGCGCGGC AGCTATCCGC CGGTCGTGGA GAGTGCTCGC
AACCTGATTT ACCACCTGCA CTTGCGCGGG CGCAGCGATG CGTTCCTTGA GGCGGCCACG
CTTGGTGTCG CCTCGTCGCG TCAGACCGGG GATCCGGCGT CGATCCGGAT GAGTCTGGTG
AACCTCGCGG TGGCTTATTG GCATCGGGGG GATCTGCCGG CGGGTCTGGT TCAGTTGCGC
GAGGCGCTGG AGGTCGCGAC CGCATCCGGG GATCGGGCGG GGCAGGCTGT GTGCCTGGGC
CGTCTCGGGA CGTTCTACTC CACGCTGGGC GATCTGCGGG CGGCTGTGGA GCACCTTGAG
CGCAGCATCC CGATGCATCT TGAGACCGGG AATCTCAAGG AGGTGGCCGA GGCTCGGTTC
CATCTCAGCT CGGCGTTCAA CACCCTCGGC CGCTACGAGG ACGCCGCCGC GCAGGCCGAG
GCGGCCGTCG CGGTGAATCA GGAGCTGAGC AGTCCCAGCA CGCTGATCAT GGCGCTGGTG
AACCTCGGGA CCGCGCAGAC CGGGCTGGGC TTGCACCATG AGGCGCTCGG GACGTTGAGC
GAGGCGTTGG AGTGGGAGGG CCGGCTCGGC GGGTCGATGT CGAAGGCGCT CATCCTGGCG
CGGATGGTTC CGGCGTGCCG GTCAACTGGC GCTCGGGACG CGGCGGCGCA GTTCGCGCGT
CAGGCCGCCG AGGCGGTGCG CGCTCCGCAT ACCTCGCCGA TGCACCGGGT CGCCGTGCTC
GATCTGCTCG GAGAGCACGC GCTGAAGGAG AACAAGCTCA GCGCGGCGCG GGAGCTGTTC
ACCGAGGCGA TCGAGCTGAG CGAGGATCTC GGGCTGCGCC TGGACGCGGC GTGGGCGACC
GCTGGGCTGG CCGGGGCGTT GGCGGCGGCC GGCGAGGACG GCGACGCGCA GGTGCTGCTG
CACCGCAGCG ATCCGATCCT GGCCGACATC GGGCTTCCGG AGAAGCTGCG GCGGCGCTCC
CGGGGCTGA
 
Protein sequence
MDHGATEKGT RPDLRFAILG SFECWADGSI IALRGPLQER LAVSLLLNRN RVVPVSRLVE 
TIWEDGGPET AAHQVRKMTS DLRKRVPGMA SLLTTSGVGY RIRVPEETLD LGIFTAALSD
ARRATEEGDL RSAAAKLRLA LDQWRGPVLG GEGGPVIQAM STALEERRLT ALEQLFAIRL
DLGEASRLVG DLREAVGANP VREHLRGQLM RALYLSGRQT DALAEYQSLR QYLDEEHGVE
PGAELAELQQ RILRNDASLG SHGEARRETQ VAVPAPASTA AAGGFPTTLP YDLPDFTGRQ
EALDAVCAAG QPSGSGAHRL RIVLIDGMAG SGKTALAVHA AHFLQESHPD GQLYLDLHGF
TPERDSVDTH EALGILLGAL GISGSDVPVD PEARIARWRT ATVNRRMLLV FDDAESAAQI
RGLLPSSSES TVLITSRVRI KGIDGARAVS LGVLSPAESL SLLERVLGRE RVANESEAAM
RLADLCGHLP LALRISAARL ASRDHWTIAR LASRLSNESR KLVELAVEDR SIRACIKSSL
EALDQEHLEY LRYFCLHPGD DVEIHAAAAL TGLDVYGAED IVELLLDRHL MEPRLAERYT
VHSLVRAYVL EQYTDPEGEG PAFERLTNYY RTAICRATDA AFPGRSPFSA GLPAYEGDVP
ELEDRSLALA WVDAEYRNIL PSLAEAVRRG SYPPVVESAR NLIYHLHLRG RSDAFLEAAT
LGVASSRQTG DPASIRMSLV NLAVAYWHRG DLPAGLVQLR EALEVATASG DRAGQAVCLG
RLGTFYSTLG DLRAAVEHLE RSIPMHLETG NLKEVAEARF HLSSAFNTLG RYEDAAAQAE
AAVAVNQELS SPSTLIMALV NLGTAQTGLG LHHEALGTLS EALEWEGRLG GSMSKALILA
RMVPACRSTG ARDAAAQFAR QAAEAVRAPH TSPMHRVAVL DLLGEHALKE NKLSAARELF
TEAIELSEDL GLRLDAAWAT AGLAGALAAA GEDGDAQVLL HRSDPILADI GLPEKLRRRS
RG