Gene Smed_3458 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3458 
SymbolrpsA 
ID5324345 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3665828 
End bp3667534 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content61% 
IMG OID640792409 
Product30S ribosomal protein S1 
Protein accessionYP_001329111 
Protein GI150398644 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0539] Ribosomal protein S1 
TIGRFAM ID[TIGR00717] ribosomal protein S1 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGCAA CCAACCCCAC CCGTGATGAT TTCGCAGCGC TTCTGGAAGA GTCCTTCGCC 
AAGACGGACC TCGCCGAAGG CTACGTCGCC AAGGGCATCG TCACGGCGAT CGAAAAGGAC
GTCGCAATCG TCGACGTCGG CCTGAAGGTC GAAGGCCGCG TACCGCTGAA GGAATTCGGC
GCCAAGGCCA AGGACGGCAC GCTCAAGGTC GGCGACGAAG TCGAAGTCTA TGTCGAGCGC
ATTGAAAACG CTCTCGGCGA AGCTGTTCTG TCGCGCGAGA AGGCTCGCCG CGAAGAAAGC
TGGCAGCGCC TGGAAGTGAA GTTCGAAGCC GGCGAACGCG TCGAAGGCAT CATCTTCAAC
CAGGTCAAGG GCGGCTTCAC CGTCGATCTC GACGGCGCCG TAGCCTTCCT GCCGCGTTCG
CAGGTCGACA TCCGTCCGAT CCGCGACGTG ACCCCGCTGA TGCACAACCC GCAGCCCTTC
GAAATCCTGA AGATGGACAA GCGCCGCGGC AACATCGTCG TTTCGCGCCG CACGGTTCTC
GAAGAGTCGC GCGCCGAGCA GCGTTCTGAA ATCGTTCAGA ATCTCGAGGA GGGCCAGGTT
GTCGAGGGTG TCGTCAAGAA CATCACCGAT TACGGTGCGT TCGTCGACCT CGGCGGCATC
GACGGTCTGC TGCACGTCAC CGACATGGCA TGGCGTCGCG TCAATCATCC GTCGGAAATC
CTGAACATCG GCCAGCAGGT CAAGGTTCAG ATCATCCGCA TCAACCAGGA AACCCACCGC
ATCTCGCTCG GCATGAAGCA GCTCGAGTCC GATCCGTGGG ACGGCATCGG TGCGAAGTAT
CCGGTCGGCA AGAAGATCTC GGGCACGGTT ACGAACATCA CCGACTACGG TGCTTTCGTC
GAGCTGGAGC CGGGCATCGA AGGCCTGATC CACATTTCCG AGATGTCCTG GACGAAGAAG
AACGTTCACC CCGGCAAGAT CCTGTCCACG AGCCAGGAAG TCGACGTGGT CGTTCTCGAA
GTCGATCCGA CCAAGCGCCG CATTTCGCTC GGCCTCAAGC AGACGCTCGA GAACCCATGG
CAGGCATTCG CGCATAGCCA TCCGGCTGGC ACGGAAGTCG AAGGCGAAGT CAAGAACAAG
ACTGAATTCG GTCTGTTCAT CGGCCTCGAT GGCGATGTCG ACGGCATGGT TCACCTCTCC
GATCTCGACT GGAACCGTCC GGGCGAGCAG GTCATCGAAG AATTCAACAA GGGCGACGTC
GTCCGTGCTG TGGTTCTCGA TGTGGACGTC GACAAGGAGC GTATCTCGCT CGGCATCAAG
CAGCTCGGCC GTGATGCGGT CGGTGAAGCT GCTGCTTCCG GCGACCTGCG CAAGAATGCC
GTCGTTTCGG CCGAAGTCAT CGGCGTCAAC GATGGCGGCA TCGAGGTGCG GCTCGTCAAT
CACGAGGACG TCACCGCCTT CATCCGCCGC GCTGATCTCT CGCGCGACCG CGACGAACAG
CGTCCGGAGC GCTTCTCGGT CGGCCAGACC GTCGACGCGC GCGTCACCAA CTTCTCCAAG
AAGGACCGCA AGATCCAGCT GTCGATCAAG GCTCTGGAAA TCGCGGAAGA GAAGGAAGCC
GTCGCTCAGT TCGGCTCTTC CGACTCGGGT GCTTCGCTTG GCGACATTCT GGGCGCTGCG
CTCAAGAACC GCCAGAACAA CGAGTAA
 
Protein sequence
MSATNPTRDD FAALLEESFA KTDLAEGYVA KGIVTAIEKD VAIVDVGLKV EGRVPLKEFG 
AKAKDGTLKV GDEVEVYVER IENALGEAVL SREKARREES WQRLEVKFEA GERVEGIIFN
QVKGGFTVDL DGAVAFLPRS QVDIRPIRDV TPLMHNPQPF EILKMDKRRG NIVVSRRTVL
EESRAEQRSE IVQNLEEGQV VEGVVKNITD YGAFVDLGGI DGLLHVTDMA WRRVNHPSEI
LNIGQQVKVQ IIRINQETHR ISLGMKQLES DPWDGIGAKY PVGKKISGTV TNITDYGAFV
ELEPGIEGLI HISEMSWTKK NVHPGKILST SQEVDVVVLE VDPTKRRISL GLKQTLENPW
QAFAHSHPAG TEVEGEVKNK TEFGLFIGLD GDVDGMVHLS DLDWNRPGEQ VIEEFNKGDV
VRAVVLDVDV DKERISLGIK QLGRDAVGEA AASGDLRKNA VVSAEVIGVN DGGIEVRLVN
HEDVTAFIRR ADLSRDRDEQ RPERFSVGQT VDARVTNFSK KDRKIQLSIK ALEIAEEKEA
VAQFGSSDSG ASLGDILGAA LKNRQNNE