Gene RoseRS_0032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_0032 
Symbol 
ID5206965 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp33506 
End bp34603 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content61% 
IMG OID640593666 
Productstage II sporulation E family protein 
Protein accessionYP_001274425 
Protein GI148654220 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2208] Serine phosphatase RsbU, regulator of sigma subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAAGTC AACGCAAACG ACCTTTTCAC CGCTTCACGC TGCTCAGTCA CGCAGCGACC 
GGTCGACTGC GCCGCAGGGA GCGACCTGCA CAGCCGCGGC TCACCCCGGA GGAAGCGCGT
CGTCAGGCGG AAATCGAGCA CGAACTGCTC CTGGCGCGTG ATATTCAACA AGGGTTACTG
CTCGAAGCGG TGCCACGCCT GCCAGGATGG GAAATTCACG CCATTTCGCT GCCAGCGCGC
GATCTGGGAG GCGACCTGTA CGATTTTTTG CCGCTCGGTG AGGAACGCCA CGGGATCATG
ATCGGCGATG TTTCAGGAAA AGGGTTGCCA GCAGCCCTCC GTATGGCCGT CGCGCGTACC
GTGTTTCGCT ATGCCGCCCG GCGCGGCGCA ACACCCGGTC CGACGCTTGC GGACGTTAAT
CGCGGGATCA TCGCCGACAT TCCACAGGGC ATGATCACCA TGCTGTATGC CGTGCTCGAT
CTGCGCCACG GTATTGTGCA GGTGGCGAAT GCCGGGCATC ATTATCCGCT GCTGCTCAAC
GGGCGCGTCA GCGAACTGGA ACTCTCAGGA TTGCCGCTTG GCGTCGATGA CGATGTTGAT
TACGAAGAGA TATGCGCCGA TATCGAACCG GGCGCCACGG TGATGATGTA CACCGATGGC
GTGGTCGAGG CGACAAATAG CAGGGGCGAA TACTTCGGGT ACGAGCGGTT GGAGCGACTG
TTGATCGAAA GCGCAACCCT GAAGCCGCGT GCCCTGGTTG CACGGTTGCT GCACGAACTG
CGCGCCTGGA GCGACGCCGG TCAGGATGAC GATATTACCG TTGTGGCGGT GCGACGACGG
TTCGAGCGAC TCGCCGATGA GTTGTACAGC ATTCTCCGCG ATGTCCTGGG TGATGATCGC
GCCGGGCAGG CCTGGGAGAC GTTGCCGCGC CCCGATGACC ACGAAGGCGC CGATGCCTGG
ACGGAAGCCT TGCCGGAGAT CGTCAAAGCG GTGCAGAGTC GTTTCGGGCG CGGTCTGGCG
CGCGAGTTGA ACGCGCAGAT CCGTCTGACG CTCGAAGAAT ACCGAATCAT GAAAAAATAT
GGACCAATGC GCTACTAA
 
Protein sequence
MRSQRKRPFH RFTLLSHAAT GRLRRRERPA QPRLTPEEAR RQAEIEHELL LARDIQQGLL 
LEAVPRLPGW EIHAISLPAR DLGGDLYDFL PLGEERHGIM IGDVSGKGLP AALRMAVART
VFRYAARRGA TPGPTLADVN RGIIADIPQG MITMLYAVLD LRHGIVQVAN AGHHYPLLLN
GRVSELELSG LPLGVDDDVD YEEICADIEP GATVMMYTDG VVEATNSRGE YFGYERLERL
LIESATLKPR ALVARLLHEL RAWSDAGQDD DITVVAVRRR FERLADELYS ILRDVLGDDR
AGQAWETLPR PDDHEGADAW TEALPEIVKA VQSRFGRGLA RELNAQIRLT LEEYRIMKKY
GPMRY