Gene Caci_4039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4039 
Symbol 
ID8335392 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4567354 
End bp4570479 
Gene Length3126 bp 
Protein Length1041 aa 
Translation table11 
GC content74% 
IMG OID644957145 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003114748 
Protein GI256393184 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.039203 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0409603 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGATAC TCCAGGCGCT CGGCGCAACC GCGCGCGGCA TCACCCGCGT GGTGAAGGCG 
CTGCTGGCCG CCGCGATCAC CATCGGCGTG CTCGCCGGCA TCCCGGCGGC CCTGCTGCAC
TACGCCGGGG ACCCGATCCC GCACTCGGTA CCGACCATGG ACGCGGTCAA GCACACCCTG
ACCAACCCCA TGACCCCGCA GATGCTGCTC AAGGCCCTGT CGGTCGTCGG CTGGTACCTG
TGGGCGATCC TGGCCGTCAG CTTCCTCGTC GAGCTGGTCT ATGCCGCACG GCGGGTCAAC
GCCCCGCACA TCCCCACCCT CGGGCCGACC CAGGCGCTGG CCGCCGCCCT GATCGCCGCC
ATCGGCATCA CCACCCTCCT GCGTGCCGCC CCCGCCCACG CCGCCGAAAC CTTCTCGGCC
TCCGCTCCGA CTGGCGGCCG GGTCGCAGCC ACCGCACCCG CGCTCGCCGG GACCGGCAGT
CTGTCCACCG CGCACCTCGC GGTCGGCAAC GCCAGCTCGG CCGCGCCACG CGAAAGCGTC
CACACAGTGA AGCCCGGCGA GTCGCTGTAC TCGATCGCCA AGGAAGACCT CGGCAGCGGC
GACGACTGGC CGGCCCTCTA CAAGATGAAC GCCGGCGTCG TCCAAGCAGA CGGCGACCAG
CTCACCAACC CAGACCTGAT CCGCCCCGAC TGGAAGATCC GCATCACGCC GCCCAGCGCC
GATCAGGCTC CCGCGACCAC TAGCACCACC ACCGCCCCGC CCACCAAAGC TCCCGCGCCG
TCGGCCCCGA AGGCGTCCGC CCCGGCGACG AGTGGCCCCA GCGCGCTGCC TTCGCCCGCT
GCTTCGCATA CTGCCGCCCC ATCCCCCGTC ACCCACGCAA CGCCTACGAA CGACGACCAC
CGGCAGGGCG ACCGCGCCGC GAAGCGCCAC GGCGGAGTCG CGGTCTCGCT GCCCGACGGC
GGAGCCATCG GCATCACCCT CGCCGTGGCG CTCGGGTCGG CGCTGGTCCT GGCCCGGCGC
TGGCGCACGC GGCGCGCCGA CCCGCGCCTG CCGATCGCCG AGCCGCCGCT TCCCGGGGCG
CTGCTGGCCG CCCGCCGTGC CCAGCGGTCC CTGGCCGCCG CCCAGCACAG CCTGGCCGCA
TCGCCCGAAG ACGCAGTCCA CGACGAGGAA CACGACGACC TCTTCGACGC CAAGGGGATC
GAAGACCTTG ACGAGGACGC CTTCGGCGCT GATCAGAACC TCATCGGCGA CGACGCTGAC
AGCTGGGACG ACGCCGAGGA GCTGGACGAG TTCGGCGCGC CGGCCGGTCC CTTGCCGGAG
CCTACGGTGA CGCGCTTTGC CGAGCCGCTG CCGCCGGGCT CGATCGGCGC CGCCGAGCGC
GACGGCGTCG AGCTGCCGCT GACCCCGACG GGCACCGGCC TGGGCCTGGT CGGCCCCGGA
GCCGCCGGGG CCGCCCGCGC GATCGCCGCC TCGGTGCTGT CAGCGGGCTC GCCCGAGCGC
ACCGCCGACC TGGCCAGGCT CATCATCCCG GCCGCGGACC TGGCCGCGCT GCTCGGCGTC
GACGAGTCCG AGCTGCCAGC CATCACCCGC GGGCTTCCCG AACTATTCGC CACCAAGGAC
CTGGCCACCG CGATCGGCGA AGCCGAAGTC CACGCGCTGC TGCGCACCCG GCTCCTGGAG
GAGTACGAGC AGCCGGACCT GGACGCGCTC GCCGCGGCGC ACCCGGACGT CGAGGACTGC
CCGCCGCTGG TGCTCATGGC CTCCCCGGCG CGCGCTCTGA GCGCGCAGAT CGCCGCCCTG
ATGAACACCT CCGCGTGTCT GCGGATCACC ACCGTCCTGC TCGGCGCGCA CCCGGACGGG
CCGACCGCCT TCGTCGAAGC CGACGGCACC GCCACCGGCC CCGCTGTCCG AGACTGGTCC
GGCGCCAGGC TGTGGAACCT GAGCGCGCCC GCCCTGGCCG ACATCCTGGA CCTGCTCGCC
CGCGCCGCCG GCCACGATCC CGGCACTCCG GGCGGCCAGG CCGAACCAGA CGACTGGCCG
GAGACCCCCC CGGCCCACGA CGCCGCAGAG CGCGCGGCGG CCGACGAGAA CGACGGCGGC
GAGGCGACGA TCACCGTCCT GCCCGTGCGC CCCATGCCGG ACCGGCAAGC CGACCCGGTC
GGCGACGACC ACGAGGAGCG CACGCTCACC GACTCCCCTG TGGCAGGTGC CAACACCGCA
TCCCTGGCCC CGGTGACCAT GCTGCCGGTG CGGCCCGCCC CGGACAGGGC GCTCGCCGAC
GCGGCGCGAA CCAGCGCCGA CGTGCGCGCC GAGGCGGCGC TGACGGCATG GAACGAAAAC
CCGATACGGA TCAACGTGCT AGGCGGACTG AACATCACCG CAGGCGGGCA GTCAGTGTCC
GGGCTTCGCA CCTCGGCCCG CGTGCTGGCC GCGCTGCTAG CAGTCAAGGG GTCCGCGGGC
GCCAGCTCCG AGCAGATCGA CGCGATGTGC TGGCCCGACG CCAACCCCCA GGAGATGGAC
CGGATCGCCA AGTGGCGTGC CGACGGCCTC AACTCCCTGC GTAAGCGCCT GGCCGCGGCC
ATGGGCCAGC GCAGCCCCCG GCTGGTCCTG CTCGACCGGG CCACCGGCCG CTACCGGCTC
AATCCCGAGC TGGTTGCCAC CGACCTGGGC ACGATCGCCG AGCTGACCGC CGCCGCCCGC
AGCGCCGGTG ACACCGAGCA GCGCCTGGCG CTGCTGGCCG CCGCCGAACC GCTGTGCCGC
GGAGCGCTGC TGGACGGAGA ACTCGGCGAC AATTTTGACT GGAGCGCGGA CTTCATCGCG
ACCGTCGCCG ACGAGCAGGT CGCCGTCCTG GCACGCCTCG CGACGCTGGC CGCTGATTCC
CGGCCGGACC AGGCCCTGGC GGCGCTGGAG AAGGCCGCCG CGTTCACCGA GGACAACGAG
ACGCTGTACC AGCAGATGTT CGACATCCTC GCTGAGGCCG GACGGCACAG CGAGATCCCC
GGCAAGCTGC GAACCCTCGA GGCGTACGCC GACTCCCTCG GGGCCGGCGT CTCGACAGCG
ACCCGCGAAG CAGCGGCGCG CGCGATGAAG CGCCAGCCGC AGCAAGGGGT CCGCCAAGGG
CACTGA
 
Protein sequence
MAILQALGAT ARGITRVVKA LLAAAITIGV LAGIPAALLH YAGDPIPHSV PTMDAVKHTL 
TNPMTPQMLL KALSVVGWYL WAILAVSFLV ELVYAARRVN APHIPTLGPT QALAAALIAA
IGITTLLRAA PAHAAETFSA SAPTGGRVAA TAPALAGTGS LSTAHLAVGN ASSAAPRESV
HTVKPGESLY SIAKEDLGSG DDWPALYKMN AGVVQADGDQ LTNPDLIRPD WKIRITPPSA
DQAPATTSTT TAPPTKAPAP SAPKASAPAT SGPSALPSPA ASHTAAPSPV THATPTNDDH
RQGDRAAKRH GGVAVSLPDG GAIGITLAVA LGSALVLARR WRTRRADPRL PIAEPPLPGA
LLAARRAQRS LAAAQHSLAA SPEDAVHDEE HDDLFDAKGI EDLDEDAFGA DQNLIGDDAD
SWDDAEELDE FGAPAGPLPE PTVTRFAEPL PPGSIGAAER DGVELPLTPT GTGLGLVGPG
AAGAARAIAA SVLSAGSPER TADLARLIIP AADLAALLGV DESELPAITR GLPELFATKD
LATAIGEAEV HALLRTRLLE EYEQPDLDAL AAAHPDVEDC PPLVLMASPA RALSAQIAAL
MNTSACLRIT TVLLGAHPDG PTAFVEADGT ATGPAVRDWS GARLWNLSAP ALADILDLLA
RAAGHDPGTP GGQAEPDDWP ETPPAHDAAE RAAADENDGG EATITVLPVR PMPDRQADPV
GDDHEERTLT DSPVAGANTA SLAPVTMLPV RPAPDRALAD AARTSADVRA EAALTAWNEN
PIRINVLGGL NITAGGQSVS GLRTSARVLA ALLAVKGSAG ASSEQIDAMC WPDANPQEMD
RIAKWRADGL NSLRKRLAAA MGQRSPRLVL LDRATGRYRL NPELVATDLG TIAELTAAAR
SAGDTEQRLA LLAAAEPLCR GALLDGELGD NFDWSADFIA TVADEQVAVL ARLATLAADS
RPDQALAALE KAAAFTEDNE TLYQQMFDIL AEAGRHSEIP GKLRTLEAYA DSLGAGVSTA
TREAAARAMK RQPQQGVRQG H