Gene Sare_0956 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0956 
Symbol 
ID5704492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1078024 
End bp1079124 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content74% 
IMG OID641270471 
ProductRNA polymerase ECF-subfamily sigma factor 
Protein accessionYP_001535859 
Protein GI159036606 
COG category[K] Transcription 
COG ID[COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.112676 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0742081 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTCGCCG CGACGGTGCG CGTCACCGGC GACCTCGACC TCGCCGAGGA GTGCGTGCAG 
GACGCGTACG TGATCGCGCT CGACGCCTGG CTGCACGACG GCGTCCCGGA CAACCCCGGC
GCTTGGCTGA CGACGACCGC ACGGCGGCGG GCCCTCGACG GCCGCCGCCG CGAGCGCACC
CTGCGCGCCA AGCTGCCGCT GCTGGTCGAA CCCGAGGAGT CCACGGTGGA GGACATCACC
GACGACCGGC TGCGCCTGCT CTTCACCTGC TGCCACCCGG CCCTCACCCG GGAGGCACAG
GTCGCGCTCA CGCTCCGGCT GGTCTGCGGG CTCACCACAG CCGAGATTGC TCATGCGTTT
TTGATCTCCG AGGCGACGAT GGCGGCCCGG CTCACCCGGG CCAAGAAGAG GATCGCCGCG
GCCCGGATCG CCTACCGCGC GCCGGCTCCC GAGGAGCTGC CGGACCGGCT GGACGCGGTA
CTGACCGTGG TGCACCTGCT CTACACCACC GGGCACACCG CCCCGGCCGG GGACCGGCTG
GTGCGGGTGG ACCTGGTGGA GAAGGTGTTC GACCTGGCCC GGATGCTGCG GATGCTCATG
CCCGATGAGC GGGAGGTACG CGGGCTGCTG GCCCTGCTGC TGCTCACCGA CGCCCGCCGG
GCGACCCGAA CGGCTACCGA TGGGCGGCTG CTTCTCCTCG CCGAGCAAGA CCGCGGCCGG
TGGGACCGGG CATTGATCGC CGAGGGCGCG GCGCTGGTCC CGGGCGCGCT GCGCGGCGGG
GCCGGCCGTT TCGCGCTGCA GGCGGCCATC GCGGCGCTGC ACGCCGAGGC ACCCACCTAC
GAAGACACCG ACTGGCGCCA GATCGTCGGC CTGTACGACG TGCTGCTGAC GGTCTGGACG
TCACCGGTGG TGGCCCTGAA CCGGGCGGTC GCTGTGTCCA TGGCGGACGG ACCGACCGCC
GCCCTGGCAA CCATCGAGGC GCTGGACGCC GACGGCCGGC TCGCCGGTTA CCGGTACCTG
CCGGCGACTC GGGCTGACCT GCTGCGGCGG CTGGGCCGGC ACACCGAGGC GGCGGCAGGT
ACCGGCAGGC GCTGGAGCTG A
 
Protein sequence
MLAATVRVTG DLDLAEECVQ DAYVIALDAW LHDGVPDNPG AWLTTTARRR ALDGRRRERT 
LRAKLPLLVE PEESTVEDIT DDRLRLLFTC CHPALTREAQ VALTLRLVCG LTTAEIAHAF
LISEATMAAR LTRAKKRIAA ARIAYRAPAP EELPDRLDAV LTVVHLLYTT GHTAPAGDRL
VRVDLVEKVF DLARMLRMLM PDEREVRGLL ALLLLTDARR ATRTATDGRL LLLAEQDRGR
WDRALIAEGA ALVPGALRGG AGRFALQAAI AALHAEAPTY EDTDWRQIVG LYDVLLTVWT
SPVVALNRAV AVSMADGPTA ALATIEALDA DGRLAGYRYL PATRADLLRR LGRHTEAAAG
TGRRWS