Gene Sare_4650 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4650 
Symbol 
ID5705706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5268851 
End bp5270080 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content71% 
IMG OID641274050 
ProductRNA polymerase ECF-subfamily sigma factor 
Protein accessionYP_001539397 
Protein GI159040144 
COG category[K] Transcription 
COG ID[COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.267181 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.578872 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGGCGGG TCAGCGAGGC GGTCGCCACC GCCTACGCCG AGCACTGGGG TCGCATTGTC 
GCCCTGCTGA TCCGGCTTGC TGGGGACTGG GACCTGGCGG AGGAGTGCGC CCAGGACGCC
TTTGCCGAGG CGCTCACGCG CTGGCCGACG GAGGGGATAC CGAACCGCCC TGGGGGATGG
CTCACCACCA CGGCACGTAA TCGTGCGGTG GACCGGCTCC GTCGGTCCAC CGTGGAAGCC
AGAAAGCTGC GCGACGTGTC CCGGCTGGGG CAGCCGGCCC CGGTCGGCAA CTTTCCCGAC
GAACGCCTCG AGCTGATGTT CACCTGTGCG CACCCGGCGC TCACCAGCGA GGCGCAGGTC
GCGCTCATTC TGCGCTGTCT GGTCGGGCTC CGGACCGCGG AGATCGCTCG GGCGTTCCTG
GTGTCGGAAC ACACCATGGG GCAACGGCTG TTCCGCGCGA AGAACAAGAT CCGCCATGCC
GGAATCCCGT TTCGGGTGCC GCCGGCCCAG CTGCTGCCGG AGCGACTGTC GGCCGTACTC
GCGGTGCTCT ACCTGCTGTT CAACGAGGGC TACGCGGCGA CCGCCGGTAC GAACCTCGTC
AAGGCCAGCC TCTCCGGAGA GGCGATCCGG CTGGCCCGGC TCCTCACCAC CCTGATGCCG
GCTGAGCCCG AAGCGCGCGG ACTACTTGCG CTCATGCTGC TGCACGACGC CCGCCGTGCG
TCTCGCGTAG ATGAGCACGG TGACCTCGTC ACCCTCGCCG ACCAGGACCG TTCGGCCTGG
AACCACACCC AGATCGCCGA AGCGGTCGCA CTGCTGGAAC AGGCGCTGGC CCAGCGCCGC
CCCGGCGCCT ACCAGGTGCA GGCGGCGATC GCCGCGGTCC ACGCCGAGGC GTCCGAGGCG
GCGACGACGG ACTGGCCGCA GATCGTCGGC CTGTACGCGC AACTCATCCG CCTGGCACCC
AGCCCGGTCG TCGAGCTCAA CCGGGCGGTG GCCGTGGCGA TGACCGACGG GCCCGAAGCC
GGACTGGCGT TGGTGGATCG CCTGGCCGCC GCCGGTACGC TCAACGACTA CTACCTGCTG
CCGGCGACCC GGGCCGACCT GCTGCGCCGC CTGGGAAAGC ACTCCGAGGC GACGGTCGCC
TACCGCCGGG CACTCGATCT GTGCGCTACC GACGCCGAGC GCCGGTACCT GTGCCGGCGC
CTGCGCGAGG TGTCGGCACG CCTCTCGTAG
 
Protein sequence
MGRVSEAVAT AYAEHWGRIV ALLIRLAGDW DLAEECAQDA FAEALTRWPT EGIPNRPGGW 
LTTTARNRAV DRLRRSTVEA RKLRDVSRLG QPAPVGNFPD ERLELMFTCA HPALTSEAQV
ALILRCLVGL RTAEIARAFL VSEHTMGQRL FRAKNKIRHA GIPFRVPPAQ LLPERLSAVL
AVLYLLFNEG YAATAGTNLV KASLSGEAIR LARLLTTLMP AEPEARGLLA LMLLHDARRA
SRVDEHGDLV TLADQDRSAW NHTQIAEAVA LLEQALAQRR PGAYQVQAAI AAVHAEASEA
ATTDWPQIVG LYAQLIRLAP SPVVELNRAV AVAMTDGPEA GLALVDRLAA AGTLNDYYLL
PATRADLLRR LGKHSEATVA YRRALDLCAT DAERRYLCRR LREVSARLS