Gene Sare_3231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3231 
Symbol 
ID5705430 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3724107 
End bp3725867 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content74% 
IMG OID641272662 
ProductPucR family transcriptional regulator 
Protein accessionYP_001538029 
Protein GI159038776 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism
[T] Signal transduction mechanisms 
COG ID[COG2508] Regulator of polyketide synthase expression 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGGCCG TGTTCCCCAC CGTTCGGGAA GTGCTCGCGC TGGAACCGGT GCGCCACGGC 
GTTCCTCGCC TCGTCGCCGG CGGCGAAGCG CTGGACCGGC GGGTCCGCTG GGTGCACGTG
GCCGAGGTAC CCGACATCGC CTCCCTGCTC GGCGGCGGGG AACTGGTGCT GACCACCGGG
ATCGGGCTAC CCGCCGACAA CGCCGGGCTG CGCGCGTTCA TCGCGGACCT GGCCGCCGTC
GGCGTTGCCG GCCTCGTCGT CGAGCTGGGC CGCCGGTACA TCGGCGGAGT GCCCCAGGCG
ATGGTGGCCG CCGCCGAACG ATGTGGGCTG CCCCTGGTCG AGCTACGCCG TGCCACCCCG
TTCGTGCGGG TCACCGAGGC GGTGCACGCC CTGATCGTGG ACGCTCAGCT CACCGAACTG
CGCGCCACCG AGGAAATCCA CCAGCGTTTC ACCGAGCTGT CGGTGGATGG AGCCGGACCG
GGGGAGGTGG TTCGGCAGGC GGCCGAACTG TCCGGCTGCG CGGTCGTGCT GGAGAACCTG
TCCCGGCAGG TGCTCGCCTA CGACGCCGCG GGAAGCAGCG CCGAACTGCT GCTCGACGGC
TGGGAGCAAC TCTCCCGGCG GATCCTGCCG GCCGGGCGGA CCGCGTACGA CGTCGACAGC
GGCTGGCTGG TGACCATGGT CGGCGCCCGG GGTCAGGACT GGGGGCGGCT GCTGGTGCGC
TGGCTCGGCG GCGGCGCGCT GACCCTCGGG GCACGGCCGG ACACCCCACC CACCCGACTC
ACCATCCTCG TGGAGCGGGC GGCATCGACC CTCGCGCTGG GCCGGTTGAT CCGCCGCGAC
GCCGAGGGCC TGGAACGGCA GTTGCACCGC ACCCTGCTGA CCGCCCTGCT CGACCACTCC
CGGCCGGTGG ACGAGGTGGC CCTGCGGGCC CGCGCCCTCG GCGTGCCGCT GGAGCGGCGG
CACCTGGTGG GTGTGGTGGT CCGGTACCGG GGCGACGATC CGGCGGAGGC CACGCCGGAC
GAGGGCACCC ACGTCCTGGG CGGCGGCCCG CAGTCCGGGC CGGCCCGGCT CCGGGACCTC
GCCGAGGCGG TTGGCCACGC CGTCCACGAG GCGCACCTGA CCGCGTTGAC CAGCGCGCTC
GACGACCAGG CGGTCGGTGC GCTGCTCGCG CTGTCCGACC CGGCAGGTGA GGAACGCGCC
CTCACGGCCT TTGCCGCGGC ACTGCACCGG GCCCGGCCCG ACATGTTCCC CGGACAGGTC
ACACCCCACC CGGACGGCGT CGCGCGGACC ACGGGTCGGC CCGACCCGGC TGGCACGGCC
GTGATCGGCG CCGGCTCGGG CGTGGGCAGC CTGCGGGAGG CACGTCGTTC CCTGGTCGAG
GCGCGGCAGG TCGCCGACGC CGCCCGCCGG GACCGGCGGG ACCTGCCGAT CTTCCGGCTG
CCGCACGTCG GGCTCGCCGG CCTGCTGCAC CTGCTGCGGG ACGAGCCGTC CCTGCAGACG
TTCGTCGAGC GGGAACTGGG CGCCCTCCTG TCCTACGACG CGCAGCACCC CCGGGAGCAG
CTGCTCGGCA CGCTGCGTGC ATACCTGGAC CAGGGCCGGA ACAAATCAGC CGGCGCTGCC
GCGGCGCACC TGTCCCGCCC GGCGTTCTAC GAGCGCCTGG CCCGGATCGG TCGAATCCTC
GACGCGGACC TCGACTCGGT CGACGCCTGC CTCAGTCTGC ACGTGGCCCT GCTGGCTCTG
GATGCCATTC GTACGCCGTA G
 
Protein sequence
MSAVFPTVRE VLALEPVRHG VPRLVAGGEA LDRRVRWVHV AEVPDIASLL GGGELVLTTG 
IGLPADNAGL RAFIADLAAV GVAGLVVELG RRYIGGVPQA MVAAAERCGL PLVELRRATP
FVRVTEAVHA LIVDAQLTEL RATEEIHQRF TELSVDGAGP GEVVRQAAEL SGCAVVLENL
SRQVLAYDAA GSSAELLLDG WEQLSRRILP AGRTAYDVDS GWLVTMVGAR GQDWGRLLVR
WLGGGALTLG ARPDTPPTRL TILVERAAST LALGRLIRRD AEGLERQLHR TLLTALLDHS
RPVDEVALRA RALGVPLERR HLVGVVVRYR GDDPAEATPD EGTHVLGGGP QSGPARLRDL
AEAVGHAVHE AHLTALTSAL DDQAVGALLA LSDPAGEERA LTAFAAALHR ARPDMFPGQV
TPHPDGVART TGRPDPAGTA VIGAGSGVGS LREARRSLVE ARQVADAARR DRRDLPIFRL
PHVGLAGLLH LLRDEPSLQT FVERELGALL SYDAQHPREQ LLGTLRAYLD QGRNKSAGAA
AAHLSRPAFY ERLARIGRIL DADLDSVDAC LSLHVALLAL DAIRTP