Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_0180 |
Symbol | rpsA |
ID | 3706213 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 199616 |
End bp | 201301 |
Gene Length | 1686 bp |
Protein Length | 561 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 637736697 |
Product | 30S ribosomal protein S1 |
Protein accession | YP_342243 |
Protein GI | 77163718 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0539] Ribosomal protein S1 |
TIGRFAM ID | [TIGR00717] ribosomal protein S1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0000762655 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCGAAA GCTTTGCAGA GTTGTTTGAG GAAAGCCAGG CGGATATGCC GATGGCGCCA GGTTCTATTG TAACGGGCAC CGTCATGGAA ATTCGTTCCG ACGTTGTGAT TGTCAACGCC GGATTTAAGT CCGAGGGAGT GATCTCTGCG GAACAGTTCC GTGATGAAAA GGGAAATTTG AATATCTCTG AGGGTGATTT AGTCGAAGTC GCTCTGGAAG CCATTGAGGA TGGTTTTGGC GAGACTCGTC TTTCCCGCCA GAAGGCCAAG GAAGCTCGGG TTTGGAATGA TCTTGAAGGA GCCTTTGAGG CAGGCGAAAC CATTACCGGC GTGTTGACTG GCAAGGTCAA AGGCGGTTTT ACAGTGGATT TAAATGGGGT TCGTGCGTTT CTACCGGGCT CATTGGTAGA TGCCCGCCCC GTGCGAGACA CTGCCTATCT TGAGAGTAAA GAACTTGAAT TTAAACTCAT CAAGCTGGAT CGTCGGCGCA ACAATATAGT GGTTTCACGC CGTGCAGTTA TGGAAGCCGA GTACAGTGCA GAGCGAGAGG CGCTCTTATC TAGTTTAGAG GAAGGAAAGA CGGTGAAGGG CGTCGTCAAA AATCTTACTG ATTATGGCGC CTTCGTGGAC TTGGGAGGGT TAGATGGATT GCTCCACATC ACGGATATGT CGTGGAAACG AATCAAGCAT CCTTCCGAAG TAGTGAATAT TGGCGATGAC ATTACCGTTC AGGTGCTCAA ATTCGATAGA GAGCGCCAGC GCGTCTCTCT CGGGCTGAAG CAAATGGGAG AAGATCCCTG GAAAGATTTG GCGCGGCGTT ATCTTGACGG TACTCGCCTC TTTGGTAAGG TAACGAATGT TACCGACTAT GGTTGTTTTG TGGAGATTGA AGAAGGGGTA GAGGGTTTGG TTCATATGTC AGAGATGGAC TGGACCAATA AAAATATCCA CCCCTCGAAG ATGGTTCAAG TTGGTGAGGA AGTTGAGGTG ATGGTGCTTG ATATTGATGA GGAGCGCCGT CGTATTTCCC TAGGAATGAA GCAATGCCTA CCTAATCCAT GGGAAGAGTT TGCCCATAGA TATAATAAGG ATGATCGCGT AGCTGGCGAG ATTAAATCCA TTACCGACTT TGGCATTTTT GTCGGCTTAG AGGGGGGCAT TGATGGCCTG GTTCATTTAT CGGATATTTC CTGGTCGGCT TCGGGCGAAG AGATAATCCG TGATTATAAG AAAGGCGATC AGGTTGAGGC GGTTGTTCTG GCCATTGATC CAGAGCGAGA GCGCATTTCT CTAGGAGTCA AGCAACTTGA GGACGATCCT TTCTCCAGCT ATATTGCGAC CTGCCCTAAA GGCAGCATAG TTAAGGGAAT CGTCAAGGTG GTTGATACTA GAGGCGCGGT CATTGAGTTG GCTGAGGGTG TGGAAGGACA TCTGCGCGCC TCAGAAATTG CTAGAGAGCG TATAGATGAT GCCCGTACTT CCCTTAATGT GGGGGACTCA ATAGAGGCCA AGTTCACTGG CATCGATCGT AAGAATCGTG TAATTACCCT CTCCGTTCGT GCTAAAGATG TAGAAGAAGA GGCGGAAGCT ATTAAGGAGT ACTCCGGAAC GGGTGCGGAA GCTGCTTCGA CTACACTGGG CGATATCCTC AAGGAGCAAA TGGAAAGGCA AGAGGAAGAA GGCTAA
|
Protein sequence | MGESFAELFE ESQADMPMAP GSIVTGTVME IRSDVVIVNA GFKSEGVISA EQFRDEKGNL NISEGDLVEV ALEAIEDGFG ETRLSRQKAK EARVWNDLEG AFEAGETITG VLTGKVKGGF TVDLNGVRAF LPGSLVDARP VRDTAYLESK ELEFKLIKLD RRRNNIVVSR RAVMEAEYSA EREALLSSLE EGKTVKGVVK NLTDYGAFVD LGGLDGLLHI TDMSWKRIKH PSEVVNIGDD ITVQVLKFDR ERQRVSLGLK QMGEDPWKDL ARRYLDGTRL FGKVTNVTDY GCFVEIEEGV EGLVHMSEMD WTNKNIHPSK MVQVGEEVEV MVLDIDEERR RISLGMKQCL PNPWEEFAHR YNKDDRVAGE IKSITDFGIF VGLEGGIDGL VHLSDISWSA SGEEIIRDYK KGDQVEAVVL AIDPERERIS LGVKQLEDDP FSSYIATCPK GSIVKGIVKV VDTRGAVIEL AEGVEGHLRA SEIARERIDD ARTSLNVGDS IEAKFTGIDR KNRVITLSVR AKDVEEEAEA IKEYSGTGAE AASTTLGDIL KEQMERQEEE G
|
| |