Gene Sare_1796 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1796 
Symbol 
ID5708379 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2068385 
End bp2070319 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content75% 
IMG OID641271298 
Producttranscriptional regulator, CdaR 
Protein accessionYP_001536673 
Protein GI159037420 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3835] Sugar diacid utilization regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.220013 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00020327 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACACCCC GCACAATGGC CGTCATGTCG TCGCCGGTGG AGTTCCTGGA ACTGCTCGCC 
CGCGAAGCCG CAGCGGTGGA GTTCGAGGGA CCGCTGGTGG CCGCCCGCGC CGCCGGACTG
CCCCCGGAGC GGCTGGCCGA GCTGGAGCAC GCCAGGACGG TCGCGTTGCG GGTCCGGGCA
CTGCTGGAAC GCCGGCGGCG CCGGGAAAGT GAGCTGTCCG GGCTGTACGA CACGGTCAGC
GACCTGGCCG GGCTGCGCGA TCTGGACGAC GTGCTGCGGG CGATCGTGCA CCGCGCCCGC
AACCTGCTGG GCGCCGACGT CGCGTACATG ACGCTCAACG ACGACCGGCA CGGCGACACC
TACATGCGGG TCACCGACGG CTCGGTGTCC GCCCGGTTCC AACGGCTGCG ACTGCCGATG
GGCGCCGGCC TCGGTGGCCT GGTCGCCCAG TCCGGCGCCC CGTACGTCAC GGCCAACTAT
CCGGAGGACG ACCGGTTCCA CCACACCCGG GAGATCGACG CCGGGGTCGG TGAGGAGGGC
CTGGTGGCCA TCCTCGGGGT GCCGCTGCGG CTCGGCTCGA TCTCGATCGG CGTGCTCTAC
GCGGCCAACC GCTCGGCGCG GCCGTTCGCC CGGGAGGAGG TGGCGCTGCT GGTGTCGCTG
GCCGCGCACG CCACGGTGGC GATCGACAAC GCCCGGCTGC TCGCCGAGAC CCGCTCGGCG
TTGGTGGAGC TGTCGGCGGC CAACACCACC ATCCAGGCGC ACAGCGCGTC GGTGGAGCGG
GCCGCCGCGG CGCACGACCG GATGACCGCG CTGGTGCTGC GCGGCGGCGG GGTGGAGGAG
GTCGCCGCCG CCGTGCCGGA GGTGCTCGGC GGCGCCCTGC TGGCCCTGGA CGCCGAGGGC
CGGCTCCTGG CCCGGGTCGG CGAGATCGAC GAACCGAAGC GGGCGGACAT CGCGGCGGCG
GTGGCCGCGT CCCGGACCGA GGGGCGCAGC GTGCGCCGAG GTCCGTTGTG GTATGCCGCC
GTGGTCGCCG GCACCGAGAA CCTGGGCGCG TTGGTGCTCC GCCCGGACGA CGAGCTGGTC
GACGCCGACC AGCGGATCCT GGAGCGGGCC GCCCTGGTCA CCGCACTGCT GCTGCTGTTT
CGGCGGACCG TCGCCGAGGC GGAGGGGCGG GTCCGCGGCG AACTGCTCGA CGATCTGATC
GCCCACCCGA GGCGCGACGC CGACGCGCTC CGCGACCGGG CCCGCCGACT TGACGTGGAC
CTCGACGCCG CGCATCTGCT GGTCTGCGTT GACGACGGTG CGATCGCCGC GACCGGTTCG
AGCCGGCAAC GGACACTCTC CTGGGCCGCC ACGTACGCCT CCACCCGGGG CGGGCTGGCC
GCGGCACGGA ACGGGCGGGT GGTGCTGATG CTGCCCGGCA CGGACGCGGG TGGCGCCGCC
CGCGCGGTGG CGCGGGACCT GTCGCGGGTG ACCGGCCAGC CGGTGACCGC CGGGGGCGGC
GGTCCCGCCA CCGGTCCGGC CTCGCTGGCA GCGACCTTCC ACGAGGCGGA GCGCTGCCTG
ACCGCGCTCG GCGCGCTGGG ACGAGCCGGG CAGGGCGCCG GGACCGACGA GTTGGGCTTC
GTCGGGTTGC TGCTGGGCTC GGTCGGTGCC GGCGGCGACC GCGACGTGTC CCGATTCCTG
ACCGACACCC TGGGGCCGGT GGTCGACTAC GACGCCCGCC GGGGCACCGC CCTGGTGCGC
ACGCTGGAGG CGTACTTCGG GGTGGGCGGC AGCCTGACCC GTGCGGCGGA ACGACTGCAC
GTGCACGTCA ACACGGTCAC CCAACGGCTG GAACGGGTGG GTCACCTCCT CGGCGCCGAC
TGGCAGCGAC CGGAGCGCGC CCTCGAGGTG CAGCTCGCCC TCCGGTTGCA CCGGCTCAGC
CGCCCGGCCA GCTGA
 
Protein sequence
MTPRTMAVMS SPVEFLELLA REAAAVEFEG PLVAARAAGL PPERLAELEH ARTVALRVRA 
LLERRRRRES ELSGLYDTVS DLAGLRDLDD VLRAIVHRAR NLLGADVAYM TLNDDRHGDT
YMRVTDGSVS ARFQRLRLPM GAGLGGLVAQ SGAPYVTANY PEDDRFHHTR EIDAGVGEEG
LVAILGVPLR LGSISIGVLY AANRSARPFA REEVALLVSL AAHATVAIDN ARLLAETRSA
LVELSAANTT IQAHSASVER AAAAHDRMTA LVLRGGGVEE VAAAVPEVLG GALLALDAEG
RLLARVGEID EPKRADIAAA VAASRTEGRS VRRGPLWYAA VVAGTENLGA LVLRPDDELV
DADQRILERA ALVTALLLLF RRTVAEAEGR VRGELLDDLI AHPRRDADAL RDRARRLDVD
LDAAHLLVCV DDGAIAATGS SRQRTLSWAA TYASTRGGLA AARNGRVVLM LPGTDAGGAA
RAVARDLSRV TGQPVTAGGG GPATGPASLA ATFHEAERCL TALGALGRAG QGAGTDELGF
VGLLLGSVGA GGDRDVSRFL TDTLGPVVDY DARRGTALVR TLEAYFGVGG SLTRAAERLH
VHVNTVTQRL ERVGHLLGAD WQRPERALEV QLALRLHRLS RPAS