Gene Sare_1424 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1424 
Symbol 
ID5704813 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1647859 
End bp1648845 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content68% 
IMG OID641270934 
ProductRNA polymerase sigma factor SigB 
Protein accessionYP_001536315 
Protein GI159037062 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0865315 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000203032 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATGGACC ACAGGATGCG CGCCACAAGC GGCACCGACA GCCTGACCGA TCTGGATGCC 
ACTGACGAGC GCGGTGTATC CACTGATCTG GTTCGGGCCT ACCTCTACGG CATCGGCAAG
ACGAAGCTGC TGACCGCCGC TCAGGAGGTG GAGCTGTCCC GCCGAATCGA GGCCGGGCTC
TTCGCCGAGG CGAAGTTGGC CGCCTGCACG CCGGTCTCCG CCACGCTCCG GGCCGACCTG
GAACTCGTCG CCGTCGAGGG GCGCGCCGCC AAGGACCACC TGTTGGAGGC GAACCTCCGC
CTGGTGGTCA GCATCGCCAA GCGGTACACC GGCCGTGGGA TGGCCTTCCT CGACCTGATC
CAGGAAGGCA ACCTCGGCCT GATCCGCGCG GTCGAGAAGT TCGACTACAC CAAGGGCTAC
AAGTTCTCCA CCTACGCCAC CTGGTGGATC CGCCAGGCCA TCACCCGCGC CATGGCCGAC
CAGTCCCGCA CCATCCGCAT TCCGGTACAC ATGGTCGAGC AGGTCAACCG GATGGTACGG
ACGCGGCGTG ACCTGTCGGT CTCGCTTGGT CGGGAGCCCA CGGTCACGGA GGTGGCCCGC
GCGTTGGACG TCCCGGAAGT CCAGATCATC GAGCTGATCT CGTACGACCG GGAGCCGGTG
AGCCTGGACC AGGCCGTCGG CGAGGACGGC GAGAGCCCAC TCGGCGACTT CGTCGCGGTG
GTGAACGCGA CGGCCGCGCC CGACAACACC GCCGAGCGAG GCGAGCTGCG TCAGGAGGTA
CGGGGTGTGC TCGCCACCCT GTCCCAGCGG GAACAGGCGG TGATCCGGCT CCGGTTCGGG
CTGGACGACG GGCGACAGCG CACCCTGGAC GAGGTCGGTC GGGAATTCGG CCTCTCCCGG
GAGCGGATCC GCCAGATCGA GAAGGGGACA CTGCGCAAGC TACGCGCCCC GGAGCGGGCG
CAGCGGCTGG CGGCGTACGC CTGCTGA
 
Protein sequence
MMDHRMRATS GTDSLTDLDA TDERGVSTDL VRAYLYGIGK TKLLTAAQEV ELSRRIEAGL 
FAEAKLAACT PVSATLRADL ELVAVEGRAA KDHLLEANLR LVVSIAKRYT GRGMAFLDLI
QEGNLGLIRA VEKFDYTKGY KFSTYATWWI RQAITRAMAD QSRTIRIPVH MVEQVNRMVR
TRRDLSVSLG REPTVTEVAR ALDVPEVQII ELISYDREPV SLDQAVGEDG ESPLGDFVAV
VNATAAPDNT AERGELRQEV RGVLATLSQR EQAVIRLRFG LDDGRQRTLD EVGREFGLSR
ERIRQIEKGT LRKLRAPERA QRLAAYAC