Gene SNSL254_A1014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A1014 
SymbolrpsA 
ID6486393 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp1026270 
End bp1027943 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content51% 
IMG OID642736420 
Product30S ribosomal protein S1 
Protein accessionYP_002040179 
Protein GI194444412 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0539] Ribosomal protein S1 
TIGRFAM ID[TIGR00717] ribosomal protein S1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.535746 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGAAT CTTTTGCTCA ACTCTTTGAA GAATCCTTAA AAGAAATCGA AACCCGCCCG 
GGTTCCATCG TTCGTGGTGT TGTTGTTGCT ATCGACAAAG ACGTAGTACT GGTTGACGCC
GGTCTGAAAT CTGAGTCTGC CATTCCGGCT GAGCAGTTCA AAAACGCCCA GGGCGAACTG
GAAATCCAGG TTGGTGACGA AGTTGACGTT GCTCTGGATG CAGTAGAAGA CGGCTTCGGT
GAAACTCTGC TCTCTCGTGA GAAAGCTAAA CGTCACGAAG CATGGATCAC GCTGGAAAAA
GCTTACGAAG ATGCTGAAAC TGTTACCGGT GTTATCAACG GCAAAGTTAA GGGTGGCTTC
ACTGTTGAGC TGAACGGTAT TCGCGCGTTC CTGCCAGGTT CTCTGGTAGA CGTTCGTCCG
GTGCGTGACA CTCTGCACCT GGAAGGCAAA GAGCTTGAAT TCAAAGTAAT CAAGCTGGAC
CAGAAACGTA ACAACGTTGT GGTTTCTCGT CGTGCCGTTA TCGAATCCGA AAACAGCGCA
GAACGCGATC AGCTGCTGGA AAACCTGCAG GAAGGCATGG AAGTTAAAGG TATCGTTAAG
AACCTCACTG ACTACGGTGC ATTCGTTGAT CTGGGCGGCG TTGACGGCCT GCTGCACATC
ACTGATATGG CCTGGAAACG CGTTAAGCAT CCGAGCGAAA TCGTGAACGT TGGCGACGAA
ATCAACGTGA AAGTGCTGAA ATTCGACCGC GAGCGTACCC GTGTATCCCT GGGTCTGAAA
CAGCTGGGCG AAGATCCGTG GGTTGCTATC GCTAAACGTT ATCCGGAAGG TACCAAACTG
ACCGGTCGCG TGACCAACCT GACCGACTAC GGCTGCTTCG TTGAAATCGA AGAAGGCGTT
GAAGGCCTGG TTCACGTTTC CGAAATGGAC TGGACCAACA AAAACATCCA CCCGTCCAAA
GTGGTTAACG TTGGCGACGT AGTGGAAGTG ATGGTTCTGG ATATCGACGA AGAACGTCGT
CGTATCTCCT TAGGGTTGAA GCAGTGCAAA TCTAACCCGT GGCAGCAGTT CGCAGAAACC
CACAACAAGG GCGACCGCGT TGAAGGTAAA ATCAAGTCTA TCACTGACTT CGGTATCTTC
ATCGGCCTGG ACGGCGGCAT CGACGGCCTG GTTCACCTGT CTGACATCTC CTGGAACGTT
GCAGGCGAAG AAGCAGTTCG TGAATACAAA AAAGGCGACG AAATCGCTGC AGTTGTTCTG
CAGGTTGACG CAGAACGTGA ACGTATCTCC TTGGGCGTTA AACAGCTCGC AGAAGATCCG
TTCAACAACT GGGTTGCTCT GAACAAGAAA GGCGCTATCG TAACCGGTAA AGTCACTGCA
GTTGACGCGA AAGGCGCAAC CGTAGAACTG GCTGACGGCG TTGAAGGTTA CCTGCGTGCT
TCTGAAGCAT CCCGTGACCG CGTTGAAGAT GCGACTCTGG TTCTGAGCGT TGGCGACGAC
GTTGAAGCTA AATTCACCGG CGTTGATCGT AAAAACCGCG CAATCAGCCT GTCTGTTCGT
GCGAAAGACG AAGCTGACGA GAAAGATGCC ATCGCAACTG TTAACAAACA GGAAGATGCA
AACTTCTCTA ACAACGCAAT GGCTGAAGCA TTCAAAGCAG CTAAAGGCGA GTAA
 
Protein sequence
MTESFAQLFE ESLKEIETRP GSIVRGVVVA IDKDVVLVDA GLKSESAIPA EQFKNAQGEL 
EIQVGDEVDV ALDAVEDGFG ETLLSREKAK RHEAWITLEK AYEDAETVTG VINGKVKGGF
TVELNGIRAF LPGSLVDVRP VRDTLHLEGK ELEFKVIKLD QKRNNVVVSR RAVIESENSA
ERDQLLENLQ EGMEVKGIVK NLTDYGAFVD LGGVDGLLHI TDMAWKRVKH PSEIVNVGDE
INVKVLKFDR ERTRVSLGLK QLGEDPWVAI AKRYPEGTKL TGRVTNLTDY GCFVEIEEGV
EGLVHVSEMD WTNKNIHPSK VVNVGDVVEV MVLDIDEERR RISLGLKQCK SNPWQQFAET
HNKGDRVEGK IKSITDFGIF IGLDGGIDGL VHLSDISWNV AGEEAVREYK KGDEIAAVVL
QVDAERERIS LGVKQLAEDP FNNWVALNKK GAIVTGKVTA VDAKGATVEL ADGVEGYLRA
SEASRDRVED ATLVLSVGDD VEAKFTGVDR KNRAISLSVR AKDEADEKDA IATVNKQEDA
NFSNNAMAEA FKAAKGE