Gene Caci_3688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3688 
Symbol 
ID8335041 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4130002 
End bp4131048 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content70% 
IMG OID644956828 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003114431 
Protein GI256392867 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.240093 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGATTT CCTTCGACCC CGATGATGTC GGGTCGGCGC CGTCTTCGCG GCGTGACGGG 
GGAGATGGTG CCTTTGACGG TCAATCGACG TTCGTCCTTC TCGGCCCGTT GAGTATCGCG
GCCCAGGGGA CCATCGCCCC GCTCCAGCCC TCCCGGCCGG CGACGCTGCT CGCCACGCTG
CTGCTGCACC CCAACTCGGT GGTGTCGATC GGCGCGCTGG TGCGGGCGGT GTGGGACGAG
GAACCGCCGG TCAGCGCCAA GGCGGCACTG CACACCTGTG TCCAGAGGCT GCGCAGGCTG
TTCGCCAAGT ACGGAGTGCC CGGCGGCGAG ATCGAGGCGG TGTCGGGCGG GTACCGCATC
GCGGCTCAGG CCGAGACGCT GGATCTGATG CGGTTCCGGG GATTGGCCGC CCGGGCCCAC
GCCGCGGCTG ATCCGCAGGC TGAGTTGGCG CTGCTGCGCG AGGCGCTCGC GCTGTGGCGC
GGTCCGGCCC TGAGCAACGT CCGCTCGCAG GTGCTGCACC GGGAGGAGGT TCCGGCGCTC
GACGAGGAGC GGCTGAGCGT CGTGGAGCGC GTCTTCGATC TCGAGATCGC GCTGGACCGG
CGGCGTGAAG TACTTCCGGA GCTGTTCACC GCGACGCGGG CGCATCCCAC GCACGAGCAC
TTCTGGGAGC AGCTGATCGA GTCGCTCTAC CGCACGGGGC GCCGGGCTGA GGCGCTGGGG
GAGTACCGCC GGATCAAGCG CTATCTGCGC GAGCAGCTCG GCGTCGATCC CGGCGCGGCT
CTGCAGCAGC TGGAGCTGAT GGTGTTGCGC GGCAACGGAA GTGTCGTGGA GAGGGCCGCT
CGGCCTGAGA CCGGCGCGGT TCCGCTCCGG CTGCTCACCG AGGCGCAGAT CCTCGACCGG
CTCCAAAGCG CGGGTCTGGT GCGCAAACAG GCGCGCGGCT ATCAGATGCA CGAGCTCTTA
TACGTGTTGA CCAGGGATGC AGCCGTTGTG GACCACGGCG CGCCGGAGCC CGGCGCCCTC
CTGAGCGGAA AGGACGACGT GGATTAA
 
Protein sequence
MVISFDPDDV GSAPSSRRDG GDGAFDGQST FVLLGPLSIA AQGTIAPLQP SRPATLLATL 
LLHPNSVVSI GALVRAVWDE EPPVSAKAAL HTCVQRLRRL FAKYGVPGGE IEAVSGGYRI
AAQAETLDLM RFRGLAARAH AAADPQAELA LLREALALWR GPALSNVRSQ VLHREEVPAL
DEERLSVVER VFDLEIALDR RREVLPELFT ATRAHPTHEH FWEQLIESLY RTGRRAEALG
EYRRIKRYLR EQLGVDPGAA LQQLELMVLR GNGSVVERAA RPETGAVPLR LLTEAQILDR
LQSAGLVRKQ ARGYQMHELL YVLTRDAAVV DHGAPEPGAL LSGKDDVD