Gene Paes_1794 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_1794 
SymbolrpsA 
ID6458803 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp1959014 
End bp1960780 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content49% 
IMG OID642725779 
Product30S ribosomal protein S1 
Protein accessionYP_002016454 
Protein GI194334594 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0539] Ribosomal protein S1 
TIGRFAM ID[TIGR00717] ribosomal protein S1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000217401 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.053041 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGAAA CACAAACAAT CGAACAACCG AAGGTAACTG AGTCAGGTCA CGGTAATCAG 
CGGGTCAAAT TTTTTGCAGA GTACGAACTT TCCGAACTTC AGCAGATGGA GAAGCTCTAT
TCGAGCACGC TCAACGAGAT CACTGAAGAA GAGATCGTCA AAGGACGAAT TGTTGGTATT
TCCAACAAAG ACGTCACCAT CGATGTCGGC TACAAATCAG AAGGCATCGT TTCAAAACTT
GAGTTCCGCG ATGAAGATGA GCTCAAAGTC GGCGACGAGG TCGAGGTTTA TCTCGAAAAC
ATCGAAGACA AGATGGGTCA GCTTATTCTC TCCAAGAGGA AAGCAGATGT TCTCAGAATC
TGGGACAAGA TTTACGATTC AATCGAGAAC GATACCATCA TCAACGGAAA GATCATCAAC
CGCGTCAAAG GCGGCATGAC GGTTTCGCTT TCAGGAGTCG AAGCCTTCCT TCCGGGTTCT
CAGATCGATG TCAAGCCCGT CCGCGATTTC GACGCACTCG TCGGTCAGAC CATGGACTTC
AGGGTTGTCA AAATCAATCC GGTCACCCAG AACATCGTTG TCAGTCACAA GGTCATTCTT
GAAGAAGAGT ACGCAGCGAA ACGCGAAGAG ATGCTGGCCA ATATCAAGGT CGGCATGGTT
CTCGAAGGTA CGGTTAAAAA CATCACCGAC TTCGGTATTT TCGTTGATCT TGGCGGTCTT
GACGGTCTTG TTCACATTAC CGATATCACC TGGGGCCGTA TCAACCATCC TTCAGAAGTT
GTCGAACTTG ATCAGCCGAT CAAGGTTGTT GTTGTTGGCT TTGACGAAGA CACCAAGCGT
GTCTCTCTCG GCATGAAGCA GCTCGAGCCT CATCCGTGGG AAAATATCGA GATCAAATAC
CCTGTTGGAA CCAAAACCAA CGGTCGTGTT GTCTCCATTA CCGACTACGG TGCTTTTGTC
GAGATCGAGA AAGGCATCGA GGGTCTTGTT CACATTTCCG AAATGAGCTG GACGCAGCAT
ATCAAGCATC CAAGCCAGTT TGTTTCTCTC GGCCAGGAAG TCGAAGTCGT TATCCTCAAC
ATCGACAAGG ATCACACCAA GCTTTCACTC TCCATGAAAC GCGTCACCGA AGATCCGTGG
ATCGCGCTTT CCGAGAAATA TATCGAAGCG TCCCTGCACA AGGGCACTGT CAGCAACATC
ACCGATTTCG GTGTCTTTGT TGAGCTTGAA CCCGGTGTCG ATGGCCTGGT ACACATTTCA
GACCTCTCAT GGACCAAGAA AATCCGTCAT CCCAGCGAAC TGGTTAAAAA GAATCAGGAT
CTTGAAGTCA AGGTGCTCAA ATTCGACGTC AACGCACGCC GTATCGCCCT TGGTCACAAG
CAGATCAACC CGGATCCATG GGATGAATTC GAGCAGAAAT ACGCAGTCGG CGCCGAATGC
GCAGGCGAGA TCTCCCAGAT CATTGAAAAA GGCGTGATCG TCATCCTTCC TGGTGACGTC
GATGGTTTTG TCCCGGTATC GCACCTGCTT CAGGGCGGTG TCAAGGACAT CCACACCTCA
TTCAAAGTCG GTGATGCACT CCCGCTTCGC GTGATCGAGT TCGACAAAGA GAACAAACGA
ATCATCCTCT CCGCGCTCGA GTACTTCAAA GACAAGAGCA AGGAAGAGAT CGAAGAGTAC
CTGCAGGCTC ATCCGAACGA GAAGAAAGAG ATCGAAGATG CCAGCGCATC ACTCGAATCT
CAGCCAAAAT CCTCGAAAAA GGCCTAA
 
Protein sequence
MSETQTIEQP KVTESGHGNQ RVKFFAEYEL SELQQMEKLY SSTLNEITEE EIVKGRIVGI 
SNKDVTIDVG YKSEGIVSKL EFRDEDELKV GDEVEVYLEN IEDKMGQLIL SKRKADVLRI
WDKIYDSIEN DTIINGKIIN RVKGGMTVSL SGVEAFLPGS QIDVKPVRDF DALVGQTMDF
RVVKINPVTQ NIVVSHKVIL EEEYAAKREE MLANIKVGMV LEGTVKNITD FGIFVDLGGL
DGLVHITDIT WGRINHPSEV VELDQPIKVV VVGFDEDTKR VSLGMKQLEP HPWENIEIKY
PVGTKTNGRV VSITDYGAFV EIEKGIEGLV HISEMSWTQH IKHPSQFVSL GQEVEVVILN
IDKDHTKLSL SMKRVTEDPW IALSEKYIEA SLHKGTVSNI TDFGVFVELE PGVDGLVHIS
DLSWTKKIRH PSELVKKNQD LEVKVLKFDV NARRIALGHK QINPDPWDEF EQKYAVGAEC
AGEISQIIEK GVIVILPGDV DGFVPVSHLL QGGVKDIHTS FKVGDALPLR VIEFDKENKR
IILSALEYFK DKSKEEIEEY LQAHPNEKKE IEDASASLES QPKSSKKA