Gene Rcas_2015 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2015 
Symbol 
ID5539493 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2582455 
End bp2583690 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content58% 
IMG OID640894150 
ProductRNA-binding S1 domain-containing protein 
Protein accessionYP_001432121 
Protein GI156741992 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0539] Ribosomal protein S1 
TIGRFAM ID[TIGR00717] ribosomal protein S1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0652883 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACAGC AAGAACAGGC TGCCACAAAC CATCAACAGA CGAACGCTGC CCAATCGTTC 
GGCGACGCTG CGAATGGCCA GTCAGCCGAT CAGCGTGACG ACCGGGAATT GATGGAGCAG
TTCCTCGCCA ACCCTGCTCA CGACTACCGC AATCTTCAAT ATGGTGATAC GGTCGATGGC
ATCATCATGC GCGTCGGTCG CGATGAGATT CTGGTCGACA TCGGCGCCAA AGCCGAGGGT
GTGGTGCCGG CCAGAGAGAT GCAGTCGCTC TCTGACGATG ATCGAGCGGC GTTGAAACCG
GGCGATCCGC TGCTGGTCTT TGTTGTTCAA TCCGAGGACA AAGAAGGTCG AGCGACGCTC
TCGATCGATC GGGCGCGTCA GGAGAAGAGT TGGCGTCGCT TACAGCAGTG TTATGAGACC
GGCGAGATTA TCGAAGCAAA AGTGATTAAC TACAACAAAG GTGGGCTGCT TGTCAATCTC
GATGGTGTGC GCGGATTTGT GCCTTCCTCG CAGGTCAGCG GCATCGGTCG CGGCTCCGAG
GCGCAGAAGC AGTCGGAGAT GGCGCGCATG GTGGGGCAGA CGCTGGCGTT GAAGGTGATC
GAGATCAATC GTAATCGCAA TCGGTTGATC CTCTCGGAAC GCCAGGCTGC AATGGATGTG
CGCGAAGGGC GCAAAGGTGA GTTGCTGTCG GCGCTGAAAG AAGGCGATGT TCGGGAGGGC
GTCGTCACAT CCGTTTGTGA CTTCGGCGCG TTTGTCGATA TAGGCGGCGC CGATGGGTTG
GTGCATCTTT CGGAACTTTC CTGGAGCCGG GTCAAGCATC CGAGCGAGAT TCTGAAGCCG
GGTGACAAAG TGCAGGTGTA TGTGCTCAGT ATCGATAATG AGCGTAAACG GATTGCGCTC
TCGCTGAAGC GTACCCAGCA CGAGCCGTGG GCCACGGTTG GCGAGCGGTA TCACATTGGC
CAGATGGTTG AGGGTGTCGT GACGCAACTG GCGCCGTTCG GCGCATTCGT GCGGATTGAG
GACGGGGTTG AAGGGCTGAT CCATGTGTCT GAAATGGGTG ATGGACGGGT CCAGCATCCG
CGCGATGTGT TGCAGGAAGG CGATGCGGTC CAGGCACGCA TCATCCGTAT CGATCCGGCG
CGGAAGCGCA TCGGTTTGAG CATGCGCCAG TCATCCGACG ATCAGATCGC GCATCAGTCA
TCCGACAAAG AAGAGGAGTC TGATGCTGAC GAGTGA
 
Protein sequence
MEQQEQAATN HQQTNAAQSF GDAANGQSAD QRDDRELMEQ FLANPAHDYR NLQYGDTVDG 
IIMRVGRDEI LVDIGAKAEG VVPAREMQSL SDDDRAALKP GDPLLVFVVQ SEDKEGRATL
SIDRARQEKS WRRLQQCYET GEIIEAKVIN YNKGGLLVNL DGVRGFVPSS QVSGIGRGSE
AQKQSEMARM VGQTLALKVI EINRNRNRLI LSERQAAMDV REGRKGELLS ALKEGDVREG
VVTSVCDFGA FVDIGGADGL VHLSELSWSR VKHPSEILKP GDKVQVYVLS IDNERKRIAL
SLKRTQHEPW ATVGERYHIG QMVEGVVTQL APFGAFVRIE DGVEGLIHVS EMGDGRVQHP
RDVLQEGDAV QARIIRIDPA RKRIGLSMRQ SSDDQIAHQS SDKEEESDAD E