Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E2417 |
Symbol | rpsA |
ID | 6269041 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 2212792 |
End bp | 2214465 |
Gene Length | 1674 bp |
Protein Length | 557 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641726412 |
Product | 30S ribosomal protein S1 |
Protein accession | YP_001880894 |
Protein GI | 187733119 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0539] Ribosomal protein S1 |
TIGRFAM ID | [TIGR00717] ribosomal protein S1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0000000560944 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGAAT CTTTTGCTCA ACTCTTTGAA GAGTCCTTAA AAGAAATCGA AACCCGCCCG GGTTCTATCG TTCGTGGCGT TGTTGTTGCT ATCGACAAAG ACGTAGTACT GGTTGACGCT GGTCTGAAAT CTGAGTCCGC CATCCCGGCT GAGCAGTTCA AAAACGCCCA GGGCGAGCTG GAAATCCAGG TAGGTGACGA AGTTGACGTT GCTCTGGACG CAGTAGAAGA CGGCTTCGGT GAAACTCTGC TGTCCCGTGA GAAAGCTAAA CGTCACGAAG CCTGGATCAC GCTGGAAAAA GCTTACGAAG ATGCTGAAAC TGTTACCGGT GTTATCAACG GCAAAGTTAA GGGCGGCTTC ACTGTTGAGC TGAACGGTAT TCGTGCGTTC CTGCCAGGTT CTCTGGTAGA CGTTCGTCCG GTGCGTGACA CTCTGCACCT GGAAGGCAAA GAGCTTGAAT TTAAAGTAAT CAAGCTGGAT CAGAAGCGCA ACAACGTTGT TGTTTCTCGT CGTGCCGTTA TCGAATCCGA AAACAGCGCA GAGCGCGATC AGCTGCTGGA AAACCTGCAG GAAGGCATGG AAGTTAAAGG TATCGTTAAG AACCTCACTG ACTACGGTGC ATTCGTTGAT CTGGGCGGCG TTGACGGCCT GCTGCACATC ACTGACATGG CCTGGAAACG CGTTAAGCAT CCGAGCGAAA TCGTCAACGT GGGCGACGAA ATCACTGTTA AAGTGCTGAA GTTCGACCGC GAACGTACCC GTGTATCCCT GGGCCTGAAA CAGTTGGGCG AAGATCCGTG GGTAGCTATC GCTAAACGTT ATCCGGAAGG TACCAAACTG ACTGGTCGCG TGACCAACCT GACCGACTAC GGCTGCTTCG TTGAAATCGA AGAAGGCGTT GAAGGCCTGG TACACGTTTC CGAAATGGAT TGGACCAACA AAAACATCCA CCCGTCCAAA GTTGTTAACG TTGGCGATGT AGTGGAAGTT ATGGTTCTGG ATATCGACGA AGAACGTCGT CGTATCTCCC TGGGTCTGAA ACAGTGCAAA GCTAACCCGT GGCAGCAGTT CGCGGAAACC CACAACAAGG GCGACCGTGT TGAAGGTAAA ATCAAGTCTA TCACTGACTT CGGTATCTTC ATCGGCCTGG ACGGCGGCAT CGACGGCTTG GTTCACCTGT CTGACATCTC CTGGAACGTT GCAGGCGAAG AAGCAGTTCG TGAATACAAA AAAGGCGACG AAATCGCTGC AGTTGTTCTG CAGGTTGACG CAGAACGTGA ACGTATCTCC CTGGGCGTTA AACAGCTCGC AGAAGATCCG TTCAACAACT GGGTTGCTCT GAACAAGAAA GGCGCTATCG TAACCGGTAA AGTAACTGCA GTTGACGCTA AAGGCGCAAC CGTAGAACTG GCTGACGGCG TTGAAGGTTA CCTGCGTGCT TCTGAAGCAT CCCGTGACCG CGTTGAAGAC GCTACCCTGG TTCTGAGCGT TGGCGACGAA GTTGAAGCTA AATTCACCGG CGTTGATCGT AAAAACCGCG CAATCAGCCT GTCTGTTCGT GCGAAAGACG AAGCTGACGA GAAAGATGCA ATCGCAACTG TTAACAAACA GGAAGATGCA AACTTCTCCA ACAACGCAAT GGCTGAAGCT TTCAAAGCAG CTAAAGGCGA GTAA
|
Protein sequence | MTESFAQLFE ESLKEIETRP GSIVRGVVVA IDKDVVLVDA GLKSESAIPA EQFKNAQGEL EIQVGDEVDV ALDAVEDGFG ETLLSREKAK RHEAWITLEK AYEDAETVTG VINGKVKGGF TVELNGIRAF LPGSLVDVRP VRDTLHLEGK ELEFKVIKLD QKRNNVVVSR RAVIESENSA ERDQLLENLQ EGMEVKGIVK NLTDYGAFVD LGGVDGLLHI TDMAWKRVKH PSEIVNVGDE ITVKVLKFDR ERTRVSLGLK QLGEDPWVAI AKRYPEGTKL TGRVTNLTDY GCFVEIEEGV EGLVHVSEMD WTNKNIHPSK VVNVGDVVEV MVLDIDEERR RISLGLKQCK ANPWQQFAET HNKGDRVEGK IKSITDFGIF IGLDGGIDGL VHLSDISWNV AGEEAVREYK KGDEIAAVVL QVDAERERIS LGVKQLAEDP FNNWVALNKK GAIVTGKVTA VDAKGATVEL ADGVEGYLRA SEASRDRVED ATLVLSVGDE VEAKFTGVDR KNRAISLSVR AKDEADEKDA IATVNKQEDA NFSNNAMAEA FKAAKGE
|
| |