Gene Rcas_1422 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1422 
Symbol 
ID5538895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1817226 
End bp1818737 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content65% 
IMG OID640893559 
ProductRNA-binding S1 domain-containing protein 
Protein accessionYP_001431535 
Protein GI156741406 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0539] Ribosomal protein S1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.324209 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGATC AGGAGCGTCC CGTCGAAGAG CGGAATGAAC TGCTGGGCGA TTCGGCGCCG 
ACTGCGGCAG GCGTTGCGGT GGCGGACGCG GAAACTCAGC AGCAGTCGAT AGAGTCTCCC
CCTGTTGGGG AAACGCCGGA TGGCGCCGGG GCGGCAGCGG ATGTGAGCGA CGTGGCGGTG
TCGTCTGATG GCGGCGATAG CCGCGAATCC GTCCGCGCCG CCGATGAGTC GCCGGCTGCG
GTGGCGGAAA GCGCCTCGGA AGCGCCGGTT CCTGCTGCTG AAACGCCGCC GACCGGGAGC
GCCGAAGTCG CGCCGGAAGC GGCGGCGCCT GCCGGTACTG CCGAAGCCGC AAGTTATCAG
GCGCCGGCTG AAGCGCCAAC CGGACGCCCG CGACGGGTGA AGGACCTGGC GCCCGGTATG
GAACTGGAAG GACGGGTCAC CTCGATTGCG CTCTACGGCA TCTTCGTTGA TATTGGCGTC
GGGCGCGACG GTCTGGTGCA TATTTCGGAG ATGAGCGACA CCCGTATCGA ATCGCCGAGT
GATCTGGTCA AGATTGGCGA TACGGTGAAG GTGCGGGTAA AGAGCGTCGA ACCCGATGGT
CGCCGGATCA GCCTGACGAT GCGCATGAAG GAGCGGGGCG CGGAACCGCG CAGTGGTCGC
GGCAAAAAGA AGCCCGAGGT GGATTACGAT AAACTTGCTG CGCTGCGCGT CGGCGATAAT
GTCGAGGGGA CGGTGACCGG GCTGGCGCCG TTTGGCGTGT TCGTCGATAT TGGCGTCGGC
AAGGATGGGC TGGTGCATGT GTCGGAACTG GCGGAAGGGC GCGTCGAAAA GGCTGAGGAT
GTCGTGCAGG TTGGTCAGAC CTATACCTTC AAGGTGCTGG AAGTCGATGC CGAGGGCGCT
CGCATCAGCC TGAGTCTGCG CCGGGCGCAG CGTGGTCAAA AGTTGCAGCA ACTGGAGAAG
GGGCAGATTC TCGAAGGCAC GATCAGCGGT CTGGCGCCGT TTGGCGCGTT CGTCGATATT
GGCGTCGGGC GCGACGGGCT GGTGCATATT TCTGAGTTGT CGAACGCGCG TGTGGCGCGC
GTCGAAGATG CGGTCAAGGT TGGCGATAAG GTGCAGGTGC GGGTGCTCGA TGTCGATCCG
CAGAGCAAAC GGATCAGCCT GAGCCTGCGG CTGGAGGATA CGCCGCGCGA GCCGCCGCCG
CGTGAGGAAC GACCGCGTGA GGAACGACCG CGGGAAGAGC GACCGCGCAT GGAGCGCGCA
GTGCGCAGCG AAGGGCGCCC GCCGCGTGAA GAGCGCCCGC CGCGGCGTGA GCGTGTCAGC
GATGCCTACT CCCCGGAGGA GGATGATTTT GGTGGGAATG CCACCCTCGA CGATTTGATG
TCGAAGTTCG GCGGACCGCG CCGCAGTGAG CGTCGCCGCC GCCAGGACGA TGACGATGAT
GTGGAAGACC GGAGTCTCCG CCGCCAGCGC GACGCTATTC GTCGCACGCT CCAACAACTC
GACGACGATT GA
 
Protein sequence
MTDQERPVEE RNELLGDSAP TAAGVAVADA ETQQQSIESP PVGETPDGAG AAADVSDVAV 
SSDGGDSRES VRAADESPAA VAESASEAPV PAAETPPTGS AEVAPEAAAP AGTAEAASYQ
APAEAPTGRP RRVKDLAPGM ELEGRVTSIA LYGIFVDIGV GRDGLVHISE MSDTRIESPS
DLVKIGDTVK VRVKSVEPDG RRISLTMRMK ERGAEPRSGR GKKKPEVDYD KLAALRVGDN
VEGTVTGLAP FGVFVDIGVG KDGLVHVSEL AEGRVEKAED VVQVGQTYTF KVLEVDAEGA
RISLSLRRAQ RGQKLQQLEK GQILEGTISG LAPFGAFVDI GVGRDGLVHI SELSNARVAR
VEDAVKVGDK VQVRVLDVDP QSKRISLSLR LEDTPREPPP REERPREERP REERPRMERA
VRSEGRPPRE ERPPRRERVS DAYSPEEDDF GGNATLDDLM SKFGGPRRSE RRRRQDDDDD
VEDRSLRRQR DAIRRTLQQL DDD