Gene Sare_2917 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2917 
Symbol 
ID5705037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3299461 
End bp3300606 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content71% 
IMG OID641272366 
ProductRNA polymerase ECF-subfamily sigma factor 
Protein accessionYP_001537734 
Protein GI159038481 
COG category[K] Transcription 
COG ID[COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGGGG CGCTGCTGCG GGACCTCGTG CCCGCGGTGA TCGGTGTCCT CGTCCGGCGT 
GGGGCCGACT TCACGTCGGC CGAGGATGCC GTGCAGGACG CCGTGGTCGA GGCCGTCCGG
GTGTGGCCGG ACGACCCGCC GCGGGATCCC AAGGGCTGGC TGGTCACGGT GGCCTGGCAC
AAGTTCCTCG ACACCGCCCG TGCAGATGCC TGCCGGCGGC GCCGCGAGGT ACGCCTTGAG
GGCGAGCCCA TGCCTGGGCC GGTCGCGGCG GTGGACGACA CGCTCCAGCT GTACTTTCTG
TGCGCTCACC CCTGCCTGAC GCCGGCATCG TCCGTTGCGC TCACGTTGCG TGCGGTCGGC
GGCCTGACCA CGCGGCAGAT CGCGCAGGCG TACCTCGTGC CGGAGCCGAC CATGGCTCAG
CGGATCAGTC GGGCGAAGCG CACCCTCTCG GGCGTCCGGT TCGATCAGCC CGGTGATGTC
GCCACGGTGC TGCGCGTGCT CTACCTGGTC TTCAACGAGG GCTACTCCGG CGACGTCGAC
CTCGCCGCCG AAGCGATCCG ACTTACCCGC CACCTAGCCT CCATGATCAA TCATGCGGAG
GTGGGCGGCC TGCTTGCACT CATGCTGTTG CACCACGCCC GGCGCCCGGC CCGCACCGGC
CCCGACGGCA GGCTTGTGCC CCTTGCCGAG CAGGATCGCG GCCGGTGGCG CAGCCATCTG
ATCGCCGAGG GTGTCCAGGT GCTCCAGGCA GCCCTCGCCC GAGACCGGCT GGGCGAGTTT
CAGGCCCAAG CCGCCATCGC CGCGCTCCAC GCCGACGCCC GGACGGTCGA CGAGACCGAC
TGGGTGCAGA TCGTCGAGTG GTATGACGAA CTGGTGCGAC TCACCGACAG CCCGGTGGCA
CGCCTCAACC GGGCCGTCGC GGTCGGCGAG GCAGACGGCC CGCGGACGGG CCTGGCCGCC
CTGGCGGAGC TCGATCCCGC CCTGCCCCGC CACGCCGCCG TCGCGGCTTA CCTGCACGAA
CGTGACGGTG ATCCAGTGGC CGCAGCGCGG CTCTACACCG AAGCTGCCCG ATCAGTGCCC
AGCCGCTCCG AACGCGACCA CCTCACACGG CAGGCCGCAC GACTCAACGC GCCGCGGCAA
ACCTGA
 
Protein sequence
MNGALLRDLV PAVIGVLVRR GADFTSAEDA VQDAVVEAVR VWPDDPPRDP KGWLVTVAWH 
KFLDTARADA CRRRREVRLE GEPMPGPVAA VDDTLQLYFL CAHPCLTPAS SVALTLRAVG
GLTTRQIAQA YLVPEPTMAQ RISRAKRTLS GVRFDQPGDV ATVLRVLYLV FNEGYSGDVD
LAAEAIRLTR HLASMINHAE VGGLLALMLL HHARRPARTG PDGRLVPLAE QDRGRWRSHL
IAEGVQVLQA ALARDRLGEF QAQAAIAALH ADARTVDETD WVQIVEWYDE LVRLTDSPVA
RLNRAVAVGE ADGPRTGLAA LAELDPALPR HAAVAAYLHE RDGDPVAAAR LYTEAARSVP
SRSERDHLTR QAARLNAPRQ T