Gene RoseRS_3968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3968 
Symbol 
ID5210952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4966079 
End bp4967623 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content63% 
IMG OID640597562 
ProductRNA-binding S1 domain-containing protein 
Protein accessionYP_001278268 
Protein GI148658063 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0539] Ribosomal protein S1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00507698 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.122374 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGATC AGGAGCGCCC CGTCGAAGAG CGAAATGAAC TGCCGGGGGA TCTGCCGGCT 
GCTGCACCAG GCGTGGCAGT GGTTGAGGCA GAAGCGCCTC CTCAGCAGCA AGCAATAGAA
TCGCCATCGC CCCCGACCGC CGAAACGACA GGTGACGCCG GGGGGCGCAA AGCGGACGGA
GGTGTCGCGT CATCGCCCCC AACTGCCGAA ACAGCAGGTG ACGCCGGGGG GCATGCAGCG
GATGAAGGCG TCGCATCGTC TGACGGCAGC GAAGGGGAGG AGACCGATCA CGCTGACGCC
TCGCCGGTGA CGGACGACGC CGGGTCTGCA AGCGCGGCGG CGCCTCCCGC CGAAGAAGCG
AGTTATCAGG CGCCGGCTGA GGAAACTCCG GGGCGACCAC GACGGGTGAA AGACCTGGCG
CCGGGTATGG AACTGGAAGG ACGGGTGACG TCTATTGCAT TGTATGGCAT TTTTGTGGAT
ATTGGCGTCG GGCGAGACGG TCTGGTGCAT ATTTCAGAAA TGAGCGATAC GCGGATCGAG
TCACCCAGCG ACCTGGTCAA GATTGGCGAT ACGGTCAAAG TACGGGTCAA AAGTGTTGAA
CCCGACGGGC GCCGTATCAG TCTGACGATG CGCACAAAAG AGCGCAGCGC CGAACCGCGC
AGCGGCCGCG GCAAGAAGAA GCCGGAAGTC GATTACGAGA AGCTGGCTGC GCTGCGAGTC
GGTGACAATG TTGAAGGGAC GGTGACCGGG ATGGCGCCGT TTGGCGTGTT CGTCGATATC
GGCGTCGGCA AAGACGGTCT GGTGCATGTC TCGGAGCTGG CGGAAGGGCG TGTCGAAAAG
GCGGAAGATG TGGTTCAGGT CGGGCAGACC TACACCTTCA AGGTGCTGGA AGTCGATGCA
GCCGGCGCGC GGATCAGCCT GAGCCTGCGG CGGGCGCAGC GCGGGCAGAA ACTCCAGCAA
CTGGAGAAGG GGCAGATTCT GGAAGGGACA ATCAGCGGTC TGGCGCCGTT TGGTGCGTTC
GTCGATATTG GCGTCGGGCG CGACGGTCTG GTGCACATCT CTGAACTGTC GAACACGCAC
GTGGCGCGGG TGGAGGATGT CGTCAAGGTT GGCGACAGGG TGCAGGTGCG GGTGCTCGAC
GTCGATCCGC AGAGCAAGCG CATCAGCCTG AGTCTGCGGT TGGAGGATAC GCCGCGTGAG
TTGCCGCCCC GCGAGGAACG TCCCCGCGAG GAGCGTCCCC GCGAGGAACG TCCCCGTGAG
GAGCGTCCAC GCGGGGAGCG TCCACGTGGG GAAGGGCGTC CGCCGCGTGA AGAACGCCAG
TCGCGGCGCA CTGGGGAACG CCTGCCTGAA ACATACCGCT CGGCTGATAA TGAAGACGAT
TTCAGCGGGA ATGCAACCAT CGAGGATTTG ATGTCGAAGT TCGGCGGATC ACGGCGCAAC
GAACGCCGTC GTCGCCAGGA AGATGACGAT GATATGGACG ATCGTCACCT TCGCCGTCAG
CGCGATGCCA TCCGGCGCAC GTTGCAGCAA CTCGATGATG AGTAG
 
Protein sequence
MTDQERPVEE RNELPGDLPA AAPGVAVVEA EAPPQQQAIE SPSPPTAETT GDAGGRKADG 
GVASSPPTAE TAGDAGGHAA DEGVASSDGS EGEETDHADA SPVTDDAGSA SAAAPPAEEA
SYQAPAEETP GRPRRVKDLA PGMELEGRVT SIALYGIFVD IGVGRDGLVH ISEMSDTRIE
SPSDLVKIGD TVKVRVKSVE PDGRRISLTM RTKERSAEPR SGRGKKKPEV DYEKLAALRV
GDNVEGTVTG MAPFGVFVDI GVGKDGLVHV SELAEGRVEK AEDVVQVGQT YTFKVLEVDA
AGARISLSLR RAQRGQKLQQ LEKGQILEGT ISGLAPFGAF VDIGVGRDGL VHISELSNTH
VARVEDVVKV GDRVQVRVLD VDPQSKRISL SLRLEDTPRE LPPREERPRE ERPREERPRE
ERPRGERPRG EGRPPREERQ SRRTGERLPE TYRSADNEDD FSGNATIEDL MSKFGGSRRN
ERRRRQEDDD DMDDRHLRRQ RDAIRRTLQQ LDDE