Gene SbBS512_E2417 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E2417 
SymbolrpsA 
ID6269041 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp2212792 
End bp2214465 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content51% 
IMG OID641726412 
Product30S ribosomal protein S1 
Protein accessionYP_001880894 
Protein GI187733119 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0539] Ribosomal protein S1 
TIGRFAM ID[TIGR00717] ribosomal protein S1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000560944 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGAAT CTTTTGCTCA ACTCTTTGAA GAGTCCTTAA AAGAAATCGA AACCCGCCCG 
GGTTCTATCG TTCGTGGCGT TGTTGTTGCT ATCGACAAAG ACGTAGTACT GGTTGACGCT
GGTCTGAAAT CTGAGTCCGC CATCCCGGCT GAGCAGTTCA AAAACGCCCA GGGCGAGCTG
GAAATCCAGG TAGGTGACGA AGTTGACGTT GCTCTGGACG CAGTAGAAGA CGGCTTCGGT
GAAACTCTGC TGTCCCGTGA GAAAGCTAAA CGTCACGAAG CCTGGATCAC GCTGGAAAAA
GCTTACGAAG ATGCTGAAAC TGTTACCGGT GTTATCAACG GCAAAGTTAA GGGCGGCTTC
ACTGTTGAGC TGAACGGTAT TCGTGCGTTC CTGCCAGGTT CTCTGGTAGA CGTTCGTCCG
GTGCGTGACA CTCTGCACCT GGAAGGCAAA GAGCTTGAAT TTAAAGTAAT CAAGCTGGAT
CAGAAGCGCA ACAACGTTGT TGTTTCTCGT CGTGCCGTTA TCGAATCCGA AAACAGCGCA
GAGCGCGATC AGCTGCTGGA AAACCTGCAG GAAGGCATGG AAGTTAAAGG TATCGTTAAG
AACCTCACTG ACTACGGTGC ATTCGTTGAT CTGGGCGGCG TTGACGGCCT GCTGCACATC
ACTGACATGG CCTGGAAACG CGTTAAGCAT CCGAGCGAAA TCGTCAACGT GGGCGACGAA
ATCACTGTTA AAGTGCTGAA GTTCGACCGC GAACGTACCC GTGTATCCCT GGGCCTGAAA
CAGTTGGGCG AAGATCCGTG GGTAGCTATC GCTAAACGTT ATCCGGAAGG TACCAAACTG
ACTGGTCGCG TGACCAACCT GACCGACTAC GGCTGCTTCG TTGAAATCGA AGAAGGCGTT
GAAGGCCTGG TACACGTTTC CGAAATGGAT TGGACCAACA AAAACATCCA CCCGTCCAAA
GTTGTTAACG TTGGCGATGT AGTGGAAGTT ATGGTTCTGG ATATCGACGA AGAACGTCGT
CGTATCTCCC TGGGTCTGAA ACAGTGCAAA GCTAACCCGT GGCAGCAGTT CGCGGAAACC
CACAACAAGG GCGACCGTGT TGAAGGTAAA ATCAAGTCTA TCACTGACTT CGGTATCTTC
ATCGGCCTGG ACGGCGGCAT CGACGGCTTG GTTCACCTGT CTGACATCTC CTGGAACGTT
GCAGGCGAAG AAGCAGTTCG TGAATACAAA AAAGGCGACG AAATCGCTGC AGTTGTTCTG
CAGGTTGACG CAGAACGTGA ACGTATCTCC CTGGGCGTTA AACAGCTCGC AGAAGATCCG
TTCAACAACT GGGTTGCTCT GAACAAGAAA GGCGCTATCG TAACCGGTAA AGTAACTGCA
GTTGACGCTA AAGGCGCAAC CGTAGAACTG GCTGACGGCG TTGAAGGTTA CCTGCGTGCT
TCTGAAGCAT CCCGTGACCG CGTTGAAGAC GCTACCCTGG TTCTGAGCGT TGGCGACGAA
GTTGAAGCTA AATTCACCGG CGTTGATCGT AAAAACCGCG CAATCAGCCT GTCTGTTCGT
GCGAAAGACG AAGCTGACGA GAAAGATGCA ATCGCAACTG TTAACAAACA GGAAGATGCA
AACTTCTCCA ACAACGCAAT GGCTGAAGCT TTCAAAGCAG CTAAAGGCGA GTAA
 
Protein sequence
MTESFAQLFE ESLKEIETRP GSIVRGVVVA IDKDVVLVDA GLKSESAIPA EQFKNAQGEL 
EIQVGDEVDV ALDAVEDGFG ETLLSREKAK RHEAWITLEK AYEDAETVTG VINGKVKGGF
TVELNGIRAF LPGSLVDVRP VRDTLHLEGK ELEFKVIKLD QKRNNVVVSR RAVIESENSA
ERDQLLENLQ EGMEVKGIVK NLTDYGAFVD LGGVDGLLHI TDMAWKRVKH PSEIVNVGDE
ITVKVLKFDR ERTRVSLGLK QLGEDPWVAI AKRYPEGTKL TGRVTNLTDY GCFVEIEEGV
EGLVHVSEMD WTNKNIHPSK VVNVGDVVEV MVLDIDEERR RISLGLKQCK ANPWQQFAET
HNKGDRVEGK IKSITDFGIF IGLDGGIDGL VHLSDISWNV AGEEAVREYK KGDEIAAVVL
QVDAERERIS LGVKQLAEDP FNNWVALNKK GAIVTGKVTA VDAKGATVEL ADGVEGYLRA
SEASRDRVED ATLVLSVGDE VEAKFTGVDR KNRAISLSVR AKDEADEKDA IATVNKQEDA
NFSNNAMAEA FKAAKGE