Gene Clim_0349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0349 
SymbolrpsA 
ID6354343 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp385593 
End bp387353 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content49% 
IMG OID642667979 
Product30S ribosomal protein S1 
Protein accessionYP_001942421 
Protein GI189345892 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0539] Ribosomal protein S1 
TIGRFAM ID[TIGR00717] ribosomal protein S1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000135822 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAGAAA CACTATCGCT GGAGAAGAAA GTCCAGGAAA GGCCCGCAAG GAAAAAAGTC 
AAAGTTTTCG CTCACTACGA TTCCGCGGCA CTTGCCGAGA TGGAAAAGCT CTATACGAGT
ACGCTGAGCG AAATCAGGGA AGACGAGATC GTCAAGGGCC GTATCGTCTC CATTTCCAAC
AAGGACGTCA CCATTGACGT CGGTTACAAG TCAGAGGGTA TTGTCTCACT GCTTGAATTC
CGTGACGAAG AAGAGGGAGA AGTCAAGGTT GGCGATGAAG TAGAGGTTTA TCTCGAAAAC
ATCGAAGACA AAATGGGGCA GCTCATTCTT TCCAAGAAGA AAGCTGACGT TCTGAGAATC
TGGGACAAGA TCTACGATTC AATTGAAAAC GACACGATCA TCAACGGCAA GATCATCAAC
CGCGTCAAGG GCGGTATGAC TGTCTCCCTG TCCGGTGTTG AAGCCTTCCT TCCCGGTTCG
CAGATCGATG TCAAGCCTGT TCGTGATTTC GATGCCCTGG TCGGTCAGAC TATGGATTTC
AGGGTTGTCA AAATCAATCC CGTTACTCAG AATATTGTTG TCAGTCACAA GGTCATCCTC
GAAGAGGAGT ATGCAGCACG CCGTGAAGAG ATGCTTGCCA ATATCAAGGT TGGTATGGTG
CTCGAAGGTA CGGTCAAAAA TATCACCGAC TTCGGTATTT TTGTCGATCT TGGCGGTCTC
GACGGTCTGG TGCATATCAC CGATATTACC TGGGGCAGAA TCAACCATCC GTCGGAAGTC
GTCGAACTTG ATCAGCCGAT CAAGGTTGTT GTTGTCGGCT TCGATGAGAA CACCAAGCGT
GTCTCTCTCG GCATGAAGCA GCTTGAGTCT CATCCGTGGG AAAACATCGA ACTTAAATAT
CCTGTCGGAT CCAAAGCGAA CGGCCGTGTG GTTTCCATTA CCGATTACGG CGCATTTGTC
GAGATCGAGA AAGGTATTGA GGGACTTGTC CACATTTCCG AAATGAGCTG GACGCAGCAC
ATCAAACATC CGGGTCAGTT CGTTACTCTC GGTCAGGAGG TTGAGTGTGT GATTCTCAAT
ATCGATAAAG AGCACACCAA GCTTTCGCTC TCCATGAAAC GGGTGAACGA AGACCCCTGG
ATCGCGCTTT CAGAAAAATA TATCGAGAAT TCATTGCATA AAGGCACGGT CAGCAACATC
ACCGATTTTG GTGTATTTGT TGAGCTTGAA GCCGGAGTTG ACGGTCTGGT GCACATCTCC
GATCTGTCAT GGACGAAGAA AATCCGCCAT CCGAGCGAAC TGGTCAAGAA AAACCAGGAA
CTGGAAGTCA AGGTGCTGAA ATTTGACGTC AATGCTCGCC GTATCGCTCT CGGTCACAAG
CAGATCAATC CTGATCCGTG GGATGAGTTC GAGCAGAAGT ATGCCGTAGG CGCCGAAACT
CCGGGAAATA TCTCACAGAT CATCGAGAAG GGTGTCATTG TCATTCTGCC CGGCGATGTT
GACGGTTTTG TGCCGGTATC GCATCTGCTT CAGGGCGGCG TGAAGGATAT TCACTCCTCG
TTCGCTGTGG ATAATGAACT TCCGCTTCGC GTGATCGAGT TCGACAAAGA GAACAAAAGG
ATCATTCTTT CGGCTCTCGA ATATTTCAAG GACAAGAGCA AGGAGGAGAT CGAAGCATAT
CTTCAGGCTC ATCCGAACGA AAAGAAAGAG ATCGAGGATG CTACCGCAGA GCTGGAGCCT
CAGCCGAAAG GCAGCAAGTA A
 
Protein sequence
MPETLSLEKK VQERPARKKV KVFAHYDSAA LAEMEKLYTS TLSEIREDEI VKGRIVSISN 
KDVTIDVGYK SEGIVSLLEF RDEEEGEVKV GDEVEVYLEN IEDKMGQLIL SKKKADVLRI
WDKIYDSIEN DTIINGKIIN RVKGGMTVSL SGVEAFLPGS QIDVKPVRDF DALVGQTMDF
RVVKINPVTQ NIVVSHKVIL EEEYAARREE MLANIKVGMV LEGTVKNITD FGIFVDLGGL
DGLVHITDIT WGRINHPSEV VELDQPIKVV VVGFDENTKR VSLGMKQLES HPWENIELKY
PVGSKANGRV VSITDYGAFV EIEKGIEGLV HISEMSWTQH IKHPGQFVTL GQEVECVILN
IDKEHTKLSL SMKRVNEDPW IALSEKYIEN SLHKGTVSNI TDFGVFVELE AGVDGLVHIS
DLSWTKKIRH PSELVKKNQE LEVKVLKFDV NARRIALGHK QINPDPWDEF EQKYAVGAET
PGNISQIIEK GVIVILPGDV DGFVPVSHLL QGGVKDIHSS FAVDNELPLR VIEFDKENKR
IILSALEYFK DKSKEEIEAY LQAHPNEKKE IEDATAELEP QPKGSK