Gene Caci_5051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5051 
Symbol 
ID8336405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5797498 
End bp5800395 
Gene Length2898 bp 
Protein Length965 aa 
Translation table11 
GC content70% 
IMG OID644958150 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003115752 
Protein GI256394188 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAATTC GCTTGCTCGG TCCGGTCGAG GCGTGGGGCG ACCAGGACCG GCTGGACATC 
GGCTCGGCGA AATCGTGTCT GGTGCTGGCG GCCCTGACCA TCGCGCCGGG GCACATCGTG
CCCTGGGACG TCCTGGTCGA CCGGGTCTGG GGCGAGCAGC TTCCCGGCGA TCCGAAGGCC
TCGCTGTACG CGTACGTCGC ACGGCTGCGG CGAACGCTGG ACCCGGCAGG CGTCCACATC
CTGAGTCGCC CCGGCGGGTA CCTGTGCGAC GTGCCGCCGG AATCGGTCGA TCTGGCGCGG
TTCCAACAGC GGGTCGCAGA GCTGCGAAGC ATCGAGGCGG CCGATGCCGG CGGGCCCGAC
ACCGCCGATC GGCTCACCGA GGCACTCGCC TGGTGGCAGG GCACGCCGCT GGCGAACCTG
ACCGGCGAAT GGGTCACCCG GACCCGCCGG ACGCTGAACG AGGAACGCCT TGCCGCGTTG
CTCCTGCTCG CCGACGTCCA GGCCCGACAC GGCCGGCTCG CAGACCTCGC CGCCGACCTG
CTGGCCGCAT CCGCCGAGTA TCCGCTGTCC GAACCGCTTG CCGGGTACGT CATCCGCGCC
CTGGCAGCGG CCGGCAGGCG CGCTGAAGCG CTCGACTACT ACGCCGACGT TCGCAGCCAT
CTGGTCGACG AGCTCGGCGA GGAGCCCGGC GCGGCGTTGC AGCAGTTGCA CGTCCGGCTG
CTCCGCCGCG ACCCGAGCCT GGCCGACGAG GCACCACCGG CCGCGGCCGC ATCGCTCGTG
CCGCGTCAGC TGCCGTCAAT CGCACGCCAC TTCGTCGGCC GGCGCGCCGA GCTGAAGGCG
CTCGACGGCG TGCTCACGGG CAGCCAAGCC GCCTCGGCGG TCCTCATCTC GGCGATCTCC
GGCACGGCCG GCATCGGCAA GACGACGACC GTCGTGTATT GGGCGCACCA CGCGGCGCGG
CAGTTCCCGG ACGGCCAGCT CTACGTGAAC CTGCGTGGCT TCGACCCGAC CGGACCGCCG
ATGAAGCCGG AGGAGGCGAT CCGCGGCTTC CTCGACGTCT TCGCCGTCCC GAAAGAGCGG
ATTCCGCACG GCCTGGACGC CCAAGCCGCG CTGTATCGCA GCCTGCTCGC CGGGCGCCGG
ATGCTGGTGG TGCTCGACAA CGCCCGCGAC GCCGACCACG TCCGACCGCT GCTCCCCGGC
TCGCCGGGCT GCCTGGTCCT GGTCACCAGC CGCAGCCGGC TCACCGGGCT GGTCGTCGGC
CACGGCGCCA CGCCGATCAC GCTGGGTCTG CTCGACGACG CGGAGGCCGA GCACCTGCTG
AGCCGCTACC TCGGAGCCGA ACGCGTCGCC GCCGAACCGG ACGCGGTACG TGTCCTGATT
CAGCGGTGTG CCCGCCTGCC GCTGGCGTTG GCGGTCGCCG CCGCTCGGGC GCTGATGGAT
CCCGCGATGC CGCTGGGCGC GCTCGCCGCC GAGTTGGCCG CCGCCCCCGG ACAGCTCGAC
GCGCTGGACA CCGGCGATCC CTCGACGACA GCGCGAGCCG TGTTCTCGTG GTCGTATCTC
GCGCAACGGC CTGAGGCGCA ACGACTTTTC CGGCTATTGG GACTGCACCC CGGACCCGAC
ATCTCGGTTC CGGCAGCGGC AAGTCTCACC GGACTGAGCA CCGAAGAGGC AGCCGCGTTG
CTCAGTGAGC TGACGCGAGC CCACCTGCTC ACCGAGCACG CATCGGGTCG ATACAGCAGC
CACGACCTTC TCCGCTCCTA CGCCGCGGAG CTTGTGCAGA CAGAGAGCTC CAATGCCGAA
CGCGACACCG CGTTCCGTCG GATGCTGCAT CACTACCTGC ACAGCTCATA CCTCGCCGGC
CGGCTGCTGG ACCCGCATCG CAAGCCGATC ACCCCGGCGG CTTTGGTCGA CGGAGTCATC
CCCGAGTCCT TCGCCGACCA AATGCAGCAG GCGCTGCGCT GGTTCGAGGC TGAACGCGAG
GTGCTGCTCG CGGTGATCCG GCGCGCGGCA GCCGCTGAGC CGGCAGCCGA CAAGACGCTG
GCCGACGAGC CGCACGCCGC CGACGTCGAC ACCCTCACCT GGGAACTCGC CTGGACGGTC
ACTGACTACC TCGACAGGCG CGGGCACTGG CAAGACTGGC TCGCCACCCA GCAAGTCGCG
ATGCAGGCAG CGCAGCGGCT TGGCGACCAA GCCAAACAGG CGCACTCCCA CCGTCTTCTT
GCCAACGCGT ACATCGGGCT CGTCCACTAC GAGGCAGCCG CCGATCATCT GAGCCACGCA
CTCGACTACC ACGACCGCCT CGGCGATCTC GAGGGCACCG CCAACTGCCG GCGGTCGCTG
TGCCGCGTCC GCGAACTCCA AGGCAGGTAT CCCGAAGCAC TCGCCCACGC CGAGGAATCC
CTGCGCCTCT TCCGCGCCAC CGACAACACC ATCGGCCAAG CCCGCGCCCT GAACGCGGTC
GGCTGGCTGC ACATCCTGCT CGATGATCCC CAGCCCGCGC TCGAGTACTG CCAAAGCGCT
CTGGCCTTGT TCCAGGAACT CGGCAGCACC TACGGGGAGG CGGTGACCTG GGACAGCGTC
GGCTCAGCGC ACCACCGGCT CGGACAGACG GACCAAGCCA TCGCCTGCTT CCGGCGGTCC
ATCGACCTGC TCCGGACCGT CGGCGACCGC CACACCGAAG CCGAGACCCT CACCAATCTC
GGCGACGCCC AGCACGACAT CGGCCAGGAC GAGGCAGCCC GCACCACCTG GCAGCAGGCG
CTGGAGATCT GCGAGCACCT CGATCATCCC GACGCCGAAA AGGTGCGGAC CAGGCTTCAC
GCCTTGCGGC CGACGCCTCC CCAAACCCCT ACCTCAGCAG GGCTTCCGAA GGGGTTGGGA
TCGGGCCGCC AGTCCTGA
 
Protein sequence
MRIRLLGPVE AWGDQDRLDI GSAKSCLVLA ALTIAPGHIV PWDVLVDRVW GEQLPGDPKA 
SLYAYVARLR RTLDPAGVHI LSRPGGYLCD VPPESVDLAR FQQRVAELRS IEAADAGGPD
TADRLTEALA WWQGTPLANL TGEWVTRTRR TLNEERLAAL LLLADVQARH GRLADLAADL
LAASAEYPLS EPLAGYVIRA LAAAGRRAEA LDYYADVRSH LVDELGEEPG AALQQLHVRL
LRRDPSLADE APPAAAASLV PRQLPSIARH FVGRRAELKA LDGVLTGSQA ASAVLISAIS
GTAGIGKTTT VVYWAHHAAR QFPDGQLYVN LRGFDPTGPP MKPEEAIRGF LDVFAVPKER
IPHGLDAQAA LYRSLLAGRR MLVVLDNARD ADHVRPLLPG SPGCLVLVTS RSRLTGLVVG
HGATPITLGL LDDAEAEHLL SRYLGAERVA AEPDAVRVLI QRCARLPLAL AVAAARALMD
PAMPLGALAA ELAAAPGQLD ALDTGDPSTT ARAVFSWSYL AQRPEAQRLF RLLGLHPGPD
ISVPAAASLT GLSTEEAAAL LSELTRAHLL TEHASGRYSS HDLLRSYAAE LVQTESSNAE
RDTAFRRMLH HYLHSSYLAG RLLDPHRKPI TPAALVDGVI PESFADQMQQ ALRWFEAERE
VLLAVIRRAA AAEPAADKTL ADEPHAADVD TLTWELAWTV TDYLDRRGHW QDWLATQQVA
MQAAQRLGDQ AKQAHSHRLL ANAYIGLVHY EAAADHLSHA LDYHDRLGDL EGTANCRRSL
CRVRELQGRY PEALAHAEES LRLFRATDNT IGQARALNAV GWLHILLDDP QPALEYCQSA
LALFQELGST YGEAVTWDSV GSAHHRLGQT DQAIACFRRS IDLLRTVGDR HTEAETLTNL
GDAQHDIGQD EAARTTWQQA LEICEHLDHP DAEKVRTRLH ALRPTPPQTP TSAGLPKGLG
SGRQS