Gene Sare_2937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2937 
Symbol 
ID5705242 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3328358 
End bp3329431 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content65% 
IMG OID641272386 
ProductRNA polymerase factor sigma-70 
Protein accessionYP_001537754 
Protein GI159038501 
COG category[K] Transcription 
COG ID[COG1595] DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family
[TIGR02960] RNA polymerase sigma-70 factor, TIGR02960 family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0803223 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.127014 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCTACC TCAATCCGAG ACCGGAGCCG TCGCGTGTCG CACCGCTGGC CGAGCAGGCG 
CCGACGGACG AGCCGGCCTT CGTTCGCCAC ATAGAACGCC ATCGTCGCGA GCTTCAGGTG
CACTGCTACC GCATGCTCGG ATCGTTCGAG GACTCCGAGG ACCTGGTTCA GGAGACATTT
CTGCGTGCGT GGCGCGGCCG CCATGGCTTC AAGGGCACGT CGACGATGCG AGCCTGGCTC
TACCGGATCG CCACCAACGC ATGTCTGGAC TTTCTCAGCC GCCACGCCCG CTGGCCACTG
ACCCACTCGT GGAGTCACTC GGTCGGCGGG GCGCCCGCGC CCGACGACGT GCCCTGGCTA
CAGCCATATC CGGACCACCT GCTCGACCTT GCCGAGCCCA GCGGACGCGA ACCGGACGCC
GTGGTCGTAG CCAAGGAAAC GATCGAGCTG GCGTTCCTCG TGGCCGTTCA GCATCTGCCA
CCACGGCAGC GAGCCGTCCT GATCCTGCGC GACATCCTGG GTCAGTCCGC CAGCGACACG
GCCGGCGTCC TGGAGCTCAG CGTGCCAGCG GTGAACAGTG CACTGCAACG TGCCCGCATC
AGGTTGAGGG AACATCTTCC ACGACAACGA TCGCAGTGGA CACCAGAGGT GGGGCCGACG
CAGGCTGAGA AAGCTCTCGC GCGGCGCTAC ATCGCCGCGG TGGAAAATGC CGACGTGGAC
GCGATGGCTC AGCTGCTGCA CGACGACGTT CGCGCCACCA TGCCCATGCC GCAACCCTGC
GAGCCGTCGC AGTCCGGTCT GCCCAGCACC CGGTGGATCG GACGAGCCGC CTATCTCGCC
GGTTTCGGCA TGGGGCTCGA CCCCGCATCG CCTGCGTATT TTGGACAGTG GCTCTGTATT
CCCACCGAAG CCAATCGGCA ACCAGCGATT GCGTGCTACA CCCGCCGCGA GCAGCGAGAC
GTGTGGCAGG CACAAGTCAT CGACGTACTC CAGATTCGCG GTGACAAGAT TGTAGAGATC
ACCGCCTTCG GGCCTGAGAA CTTCACCAGG TTCGGTCTTC CACTCACCCG ATGA
 
Protein sequence
MSYLNPRPEP SRVAPLAEQA PTDEPAFVRH IERHRRELQV HCYRMLGSFE DSEDLVQETF 
LRAWRGRHGF KGTSTMRAWL YRIATNACLD FLSRHARWPL THSWSHSVGG APAPDDVPWL
QPYPDHLLDL AEPSGREPDA VVVAKETIEL AFLVAVQHLP PRQRAVLILR DILGQSASDT
AGVLELSVPA VNSALQRARI RLREHLPRQR SQWTPEVGPT QAEKALARRY IAAVENADVD
AMAQLLHDDV RATMPMPQPC EPSQSGLPST RWIGRAAYLA GFGMGLDPAS PAYFGQWLCI
PTEANRQPAI ACYTRREQRD VWQAQVIDVL QIRGDKIVEI TAFGPENFTR FGLPLTR