Gene Sare_3619 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3619 
Symbol 
ID5708166 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4175806 
End bp4177032 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content74% 
IMG OID641273044 
ProductPucR family transcriptional regulator 
Protein accessionYP_001538408 
Protein GI159039155 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism
[T] Signal transduction mechanisms 
COG ID[COG2508] Regulator of polyketide synthase expression 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000689812 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGAGCGAGC CGGGAACCGA ACTGGCGGCC ACGCTGCGCC GGATCGAGCG GGCGGCGGGG 
GCGCTCGCCA CGGCCAGCGT GGCGCGGATG GACGAGACGC TGCCCTGGTT CCGCGAACTC
CCGGCCGACC AGCGCTCCTG GGTCATGCTG GTGGCGCAGG CAGGCGCCCG TTCCCTGGTG
CAATGGCTCC GCGCCGGCGG CGGCACCGCC GACAGCACCC AGGAGGTTTC CGACGAGGTC
TTCGCCACGG CGCCGCAGGC CCTGGCACGG TCCATCAGCC TCCAGCAGAC GGTGGCCCTC
ATCAAGGTGA CCATCGACGT GGTGGAGGAG CAGGTCCCAC ACCTGGCCGC CCCGGGCGAG
GAGCCACAGT TGCGGGATGT GGTGCTGCGC TACTCCCGGG AGATCGCATT CGCCGCCGCC
CGGGTGTATG CGCGGGCCGC CGAGTCCCGC GGTTCCTGGG ACGCGCGGCT TCAGGCTCTC
CTGGTGGACG CGCTGCTGCG GGGTGACTCG CCGGACGTGT TGGCCAGCCG GGCGGCGGCA
CTGGGCTGGG CGGACGCGCC GCCGGTGGCG GTGGCGGTGG GGCGGTCCCC CGGCGGGGAG
GTGTCCGCCG TGTTGCACAC CGTCTACCGG CTGGCCCGGC GGATCGGCGC CGAGGTCATC
GGCGGGGTGC ACGGCGACCG CCTGGTCATC GTGCTCGGTG GCGTGGCCGA TCCGGTGGCC
GCCACCGGCA AGCTGCTCGA CGCCTTCGGC GCCGGCCCGG TCGTGGTGGG CCCGGCCGTG
CCGAGCCTGG ACGAGGCCAC CGACTCCGCC CGGGCCGCGC TCGCCGGGTT CCGTGCTGCC
CCGGCCTGGC CGGCCGCACC GCGGCCGGTC CCCGCAGCCG ACCTGCTACC GGAACGGGCG
CTCGCCGGGG ACGCCGAGGC GCGCCGCCGG CTGCGGCACG ACGTGTACGC CACGCTGGTC
CGCTCCGGCG GGGAACTACT GGAGACCCTG GACGCCTTCT TCTCCGCCAG CGGCACCCTG
GAGAGCGCGG CCCGGGCGCT GTTCGTACAC CCCAACACCG TGCGGTACCG GCTGCGACGG
GTAGCGGAGG TGACCGGGCT CTCCCCGCTC GCGGCCCGGG ACGCGTACGC GCTCCAGGTG
GCGCTCACCG TCGGCCGGCT CGACCCGGTG GTTACCCTCA CACCGAATCG GACAAAACCT
CATATATCTC GTGAGACAGG ACAATAA
 
Protein sequence
MSEPGTELAA TLRRIERAAG ALATASVARM DETLPWFREL PADQRSWVML VAQAGARSLV 
QWLRAGGGTA DSTQEVSDEV FATAPQALAR SISLQQTVAL IKVTIDVVEE QVPHLAAPGE
EPQLRDVVLR YSREIAFAAA RVYARAAESR GSWDARLQAL LVDALLRGDS PDVLASRAAA
LGWADAPPVA VAVGRSPGGE VSAVLHTVYR LARRIGAEVI GGVHGDRLVI VLGGVADPVA
ATGKLLDAFG AGPVVVGPAV PSLDEATDSA RAALAGFRAA PAWPAAPRPV PAADLLPERA
LAGDAEARRR LRHDVYATLV RSGGELLETL DAFFSASGTL ESAARALFVH PNTVRYRLRR
VAEVTGLSPL AARDAYALQV ALTVGRLDPV VTLTPNRTKP HISRETGQ