Gene Noc_0180 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0180 
SymbolrpsA 
ID3706213 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp199616 
End bp201301 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content49% 
IMG OID637736697 
Product30S ribosomal protein S1 
Protein accessionYP_342243 
Protein GI77163718 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0539] Ribosomal protein S1 
TIGRFAM ID[TIGR00717] ribosomal protein S1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000762655 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCGAAA GCTTTGCAGA GTTGTTTGAG GAAAGCCAGG CGGATATGCC GATGGCGCCA 
GGTTCTATTG TAACGGGCAC CGTCATGGAA ATTCGTTCCG ACGTTGTGAT TGTCAACGCC
GGATTTAAGT CCGAGGGAGT GATCTCTGCG GAACAGTTCC GTGATGAAAA GGGAAATTTG
AATATCTCTG AGGGTGATTT AGTCGAAGTC GCTCTGGAAG CCATTGAGGA TGGTTTTGGC
GAGACTCGTC TTTCCCGCCA GAAGGCCAAG GAAGCTCGGG TTTGGAATGA TCTTGAAGGA
GCCTTTGAGG CAGGCGAAAC CATTACCGGC GTGTTGACTG GCAAGGTCAA AGGCGGTTTT
ACAGTGGATT TAAATGGGGT TCGTGCGTTT CTACCGGGCT CATTGGTAGA TGCCCGCCCC
GTGCGAGACA CTGCCTATCT TGAGAGTAAA GAACTTGAAT TTAAACTCAT CAAGCTGGAT
CGTCGGCGCA ACAATATAGT GGTTTCACGC CGTGCAGTTA TGGAAGCCGA GTACAGTGCA
GAGCGAGAGG CGCTCTTATC TAGTTTAGAG GAAGGAAAGA CGGTGAAGGG CGTCGTCAAA
AATCTTACTG ATTATGGCGC CTTCGTGGAC TTGGGAGGGT TAGATGGATT GCTCCACATC
ACGGATATGT CGTGGAAACG AATCAAGCAT CCTTCCGAAG TAGTGAATAT TGGCGATGAC
ATTACCGTTC AGGTGCTCAA ATTCGATAGA GAGCGCCAGC GCGTCTCTCT CGGGCTGAAG
CAAATGGGAG AAGATCCCTG GAAAGATTTG GCGCGGCGTT ATCTTGACGG TACTCGCCTC
TTTGGTAAGG TAACGAATGT TACCGACTAT GGTTGTTTTG TGGAGATTGA AGAAGGGGTA
GAGGGTTTGG TTCATATGTC AGAGATGGAC TGGACCAATA AAAATATCCA CCCCTCGAAG
ATGGTTCAAG TTGGTGAGGA AGTTGAGGTG ATGGTGCTTG ATATTGATGA GGAGCGCCGT
CGTATTTCCC TAGGAATGAA GCAATGCCTA CCTAATCCAT GGGAAGAGTT TGCCCATAGA
TATAATAAGG ATGATCGCGT AGCTGGCGAG ATTAAATCCA TTACCGACTT TGGCATTTTT
GTCGGCTTAG AGGGGGGCAT TGATGGCCTG GTTCATTTAT CGGATATTTC CTGGTCGGCT
TCGGGCGAAG AGATAATCCG TGATTATAAG AAAGGCGATC AGGTTGAGGC GGTTGTTCTG
GCCATTGATC CAGAGCGAGA GCGCATTTCT CTAGGAGTCA AGCAACTTGA GGACGATCCT
TTCTCCAGCT ATATTGCGAC CTGCCCTAAA GGCAGCATAG TTAAGGGAAT CGTCAAGGTG
GTTGATACTA GAGGCGCGGT CATTGAGTTG GCTGAGGGTG TGGAAGGACA TCTGCGCGCC
TCAGAAATTG CTAGAGAGCG TATAGATGAT GCCCGTACTT CCCTTAATGT GGGGGACTCA
ATAGAGGCCA AGTTCACTGG CATCGATCGT AAGAATCGTG TAATTACCCT CTCCGTTCGT
GCTAAAGATG TAGAAGAAGA GGCGGAAGCT ATTAAGGAGT ACTCCGGAAC GGGTGCGGAA
GCTGCTTCGA CTACACTGGG CGATATCCTC AAGGAGCAAA TGGAAAGGCA AGAGGAAGAA
GGCTAA
 
Protein sequence
MGESFAELFE ESQADMPMAP GSIVTGTVME IRSDVVIVNA GFKSEGVISA EQFRDEKGNL 
NISEGDLVEV ALEAIEDGFG ETRLSRQKAK EARVWNDLEG AFEAGETITG VLTGKVKGGF
TVDLNGVRAF LPGSLVDARP VRDTAYLESK ELEFKLIKLD RRRNNIVVSR RAVMEAEYSA
EREALLSSLE EGKTVKGVVK NLTDYGAFVD LGGLDGLLHI TDMSWKRIKH PSEVVNIGDD
ITVQVLKFDR ERQRVSLGLK QMGEDPWKDL ARRYLDGTRL FGKVTNVTDY GCFVEIEEGV
EGLVHMSEMD WTNKNIHPSK MVQVGEEVEV MVLDIDEERR RISLGMKQCL PNPWEEFAHR
YNKDDRVAGE IKSITDFGIF VGLEGGIDGL VHLSDISWSA SGEEIIRDYK KGDQVEAVVL
AIDPERERIS LGVKQLEDDP FSSYIATCPK GSIVKGIVKV VDTRGAVIEL AEGVEGHLRA
SEIARERIDD ARTSLNVGDS IEAKFTGIDR KNRVITLSVR AKDVEEEAEA IKEYSGTGAE
AASTTLGDIL KEQMERQEEE G