Gene Rcas_0934 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0934 
Symbol 
ID5538400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1239605 
End bp1240771 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content60% 
IMG OID640893083 
ProductRNA-binding S1 domain-containing protein 
Protein accessionYP_001431066 
Protein GI156740937 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0539] Ribosomal protein S1 
TIGRFAM ID[TIGR00717] ribosomal protein S1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000731277 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.167396 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGCAG TACCCGACAA TGCCGTCCAG GAAACAGAAG ATTTCGATTG GACGCAGATG 
CTCGACGACT ACGACTATGC GCGTCCGCAA CGCGGCGAGG TGCGCGAGGG CGTGGTCATG
AAGATTGAAG ACGGCGGTAT TCTGGTCTCG ATTGGCACCA AGCGCGAAGG GATCATTCCG
ATTGCCGACG TGCGCGCTAT CGGCGATGAG GTGTTGAACA ACCTGAAGGT GGGCGATCGG
ATTCAGGTGT ACGTTCAGGA CCCGGAGAAT CGCCAGGGCG ATCTGGTCTT GTCGCTGACG
ATGGTACAAG TTGCGCGCGA TTGGGAAGAA GCGGCTCGCC TGAGCGCTGA GGGCGGCATT
GTGCAGGGCC AGGTTATTGG CTACAACAAA GGTGGCTTGC TGGTACAGTT CAATCGCATT
CGTGGGTTTG TGCCGGCATC TCAGGTGGCG CAACTCCATG GACGCACTGC TGCCGAGGAA
CGGCAGCAGG CGTTGCAACG CATGGTTGGT CAGACCATCC CGCTGAAGGT GATCGAGGTG
GATCGTGATC GCAATCGGTT GGTGCTATCG GAGCGCAGCG CAACGCAGGA GTGGCGCAAG
GCGCAGAAGC AGCGTCTGCT TACGGAACTT CAGCCGGGCG ATATTTTGAC CGGGCGCGTC
AATCAACTGA CGAACTTCGG CGCATTCATC GATCTCGGCG GCGCCGATGG TCTGGCGCAT
ATCTCCGAAT TGTCGTGGCA GCGCGTCAAC CATCCCCGCG AGGTGCTGTC GCCAGGGCAG
GAAGTGAGGG TCATGGTCGT GGAGATCGAT GCCGAACGTG AACGCATTGG TCTCAGCCTG
CGCCGCCTTC AACCCAATCC ATGGGATACA ATCGATCAGC GCTACTCGCT TGGACAACTC
GTGAGCGGTC CGGTGACGAA CGTTGCACCG TTTGGCGCAT TTGTGCAGAT AGAAGAGGCG
GTCGAAGGTC TGATCCACGC CAGCGAACTC GACGCCGATC CGCAGGCGCA GCCGCGCGAT
CTGTTGCAGC CCGGTCAGAT CATCACGGCC CGAATTATCA GCCTCGATAA GCAGCGCCAG
CGTATGGGGC TTAGCCTGCG CCGCAACAGC GCCGATGAAC CGCCGCCGGA GGAGACGCCG
GTTGAGGCGC CGTCCACCGA TATGTAA
 
Protein sequence
MKAVPDNAVQ ETEDFDWTQM LDDYDYARPQ RGEVREGVVM KIEDGGILVS IGTKREGIIP 
IADVRAIGDE VLNNLKVGDR IQVYVQDPEN RQGDLVLSLT MVQVARDWEE AARLSAEGGI
VQGQVIGYNK GGLLVQFNRI RGFVPASQVA QLHGRTAAEE RQQALQRMVG QTIPLKVIEV
DRDRNRLVLS ERSATQEWRK AQKQRLLTEL QPGDILTGRV NQLTNFGAFI DLGGADGLAH
ISELSWQRVN HPREVLSPGQ EVRVMVVEID AERERIGLSL RRLQPNPWDT IDQRYSLGQL
VSGPVTNVAP FGAFVQIEEA VEGLIHASEL DADPQAQPRD LLQPGQIITA RIISLDKQRQ
RMGLSLRRNS ADEPPPEETP VEAPSTDM