Gene RoseRS_0191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_0191 
Symbol 
ID5207126 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp237364 
End bp238572 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content59% 
IMG OID640593821 
ProductRpoD family RNA polymerase sigma factor 
Protein accessionYP_001274577 
Protein GI148654372 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000166048 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00192271 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAAATCAA TGCAAGAAGG TAGACACATG GCCACCGAGA TCGTGGAGCA GACACAACAG 
GCGTGGGCGC AAACCCTCGA GTATCTGCTG GAAATCGGGC GCACACGCGG GTTCCTTACC
TACAACGAAA TCCTTGAAGC GTTACCGCAA CCTGAGCACC ACATTGCTGA TGTTGATCAA
CTCTATGCTT CCCTTCAAGC AGAGGGCATT CGCGTCGTCG AAACCCCGCT CGACATCCAC
GACAACGGTT CGACCGGCGA CGATGAGTTG CTGGCGGATA TGCCCGACCT GACCGATGTG
GCGCTCGATG ATCCGGTCCG CATGTATTTG CAGGAGATCG GTCAGGTTCC ACTCCTGTCG
GCGGAACAGG AAGTCATGCT GGCAAAGGCG ATGGAAGCCG GTCACCGTGC GCGTCGCGCG
CTCGAACGCG AAGAGTACAG CTCCTGGCAG GAGCGCGTGA TGTACGAGCA GCAGGTCGCG
CAGGGGAATG AGGCGCGCCA GCACCTGATC CAGGCCAACC TGCGACTGGT CGTTTCGATT
GCCAAGAAGT ACACATCGTA TGGGCTGACG ATGATGGACC TGGTGCAGGA GGGCAATATC
GGTCTCATGC GCGCAGTCGA AAAGTTCGAC TATACCAAAG GGCACAAATT CTCCACGTAT
GCCACATGGT GGATCCGCCA GGCGATCACC CGCGCCATCG CCGATCAGAG CCGCACCATT
CGTCTGCCGG TGCATATGGG TGAGGCGATC AGCCAGGTGA AGCGTACCTC GCACAAACTC
CAGCAGACGA TGCAGCGCGA ACCTACGCCG GAAGAGATCG CCGACGCAAT GGGCATCAGT
TCGACGAAGG TACGCCGCAC GCTGGAGGCG TCGATGCACC CGCTCTCGCT CGAAATGCCG
GTCGGGCAGG AAGGTGAAGG GCGGATGGGC GACTTTATCG AAGACGACCG GATCTCGACG
CCGGCTGAGG CTGCTGCGGC TTCGATGTTG CGTGAGCAAC TTGAAGAGGT GTTGCAGAAG
CTGCCGGAGC GCGAGCGCAA GATTATTCAG TTGCGCTACG GCTTGAAAGA TGGGCGGTAT
CGCACGCTTG AAGAGGTCGG CATGGAGTTT GGCATTACGC GCGAACGCAT CCGCCAGATC
GAAGCGGTGG CGCTTCGAAA ATTGCGCCAT CCGCACCTCG GTAAGAAGTT GCGCGGCTAT
CTCGATTGA
 
Protein sequence
MKSMQEGRHM ATEIVEQTQQ AWAQTLEYLL EIGRTRGFLT YNEILEALPQ PEHHIADVDQ 
LYASLQAEGI RVVETPLDIH DNGSTGDDEL LADMPDLTDV ALDDPVRMYL QEIGQVPLLS
AEQEVMLAKA MEAGHRARRA LEREEYSSWQ ERVMYEQQVA QGNEARQHLI QANLRLVVSI
AKKYTSYGLT MMDLVQEGNI GLMRAVEKFD YTKGHKFSTY ATWWIRQAIT RAIADQSRTI
RLPVHMGEAI SQVKRTSHKL QQTMQREPTP EEIADAMGIS STKVRRTLEA SMHPLSLEMP
VGQEGEGRMG DFIEDDRIST PAEAAAASML REQLEEVLQK LPERERKIIQ LRYGLKDGRY
RTLEEVGMEF GITRERIRQI EAVALRKLRH PHLGKKLRGY LD