Gene Sare_3369 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3369 
SymbolrpsA 
ID5707993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3890376 
End bp3891857 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content67% 
IMG OID641272795 
Product30S ribosomal protein S1 
Protein accessionYP_001538162 
Protein GI159038909 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0539] Ribosomal protein S1 
TIGRFAM ID[TIGR00717] ribosomal protein S1 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.412554 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0109188 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAGCA GCATCGAGGC CCCCTCGAGC GCCACCAAGG TCACCGTCGA CGATCTCGGG 
AGCGAAGAGG CTTTCCTCGC CGCGATCGAC GAGACCATCA AGTACTTCAA CGACGGCGAC
ATTGTCGAAG GCACCGTCGT CAAGGTCGAT CGGGACGAGG TCCTGCTCGA CATCGGCTAC
AAGACCGAGG GCGTCATCCC CTCTCGAGAG CTGTCGATCA AGCACGACGT GGACCCCGCC
GAGGTTGTCT CGGTCGGTGA CCACATCGAG GCCCTTGTCC TTCAGAAGGA GGACAAGGAG
GGGCGTCTGA TCCTCTCGAA GAAGCGGGCG CAGTACGAGC GCGCCTGGGG CACGATCGAG
AAGATCAAGG ACGAGGACGG TGTCGTCCGT GGCTCGGTCA TCGAGGTGGT CAAGGGTGGC
CTGATCCTCG ACATCGGGCT GCGTGGCTTC CTGCCGGCTT CCCTGGTCGA GATGCGCCGC
GTGCGGGACC TCCAGCCGTA TGTCGGACGC GAGCTCGAAG CCAAGATCAT CGAGTTGGAC
AAGAACCGCA ACAACGTGGT TCTGTCCCGC CGGGCCTGGC TCGAGCAGAC GCAGTCCGAG
GTGCGCACCG AGTTCCTCAA CAAGCTGCAG AAGGGTCAGG TCCGCAAGGG CGTCGTGTCG
TCGATCGTCA ACTTCGGTGC GTTCGTCGAC CTGGGCGGCG TGGACGGCCT GGTGCACGTC
TCCGAGTTGT CCTGGAAGCA CATCGACCAC CCGTCCGAGG TGGTCGAGGT GGGCCAGGAG
GTCGAGGTCG AGGTCCTGGA CGTTGACCTG GACCGCGAGC GGGTCTCGCT GTCGCTGAAG
GCGACCCAGG AGGACCCGTG GCGCCAGTTC GCCCGCACCC ACGCGATCCA GCAGATCGTG
CCCGGTAAGG TCACCAAGCT GGTGCCGTTC GGCGCGTTCG TCCGGGTCGA CGACGGCATT
GAGGGCCTGG TCCACATCTC CGAGCTGGCC GAGCGGCACG TGGAGATCCC GGAGCAGGTC
GTTCAGGTCG GCTCCGAGGT CATGGTCAAG GTCATCGACA TCGACCTGGA GCGGCGCCGG
ATCTCGCTGT CGCTCAAGCA GGCCAACGAG GGCTTCGTCG AGGGTGAGGA GCACTTCGAC
CCGACCCTCT ACGGCATGGC CGCGACCTAC GACGCCGAGG GCAACTACAT CTACCCCGAG
GGTTTCGACC CGGAGACGGG CGAGTGGCTC GAGGGCTACG AGAAGCAGCG CGAGACCTGG
GAGCAGCAGT ACGCCGAGGC GCGTCAGCGC TGGGAGGCCC ACCAGAAGCA GGTGCAGGCA
TCCCGCGCCG CCGACGCCGA GGCCGCTGCC AACCCGCCGG CTGCCGGCCC CACCACCACG
ACGACCGCGG CCCCGAGCCG GCCGGCGGAG GAGCCGGCTG GCACCCTCGC CACCGACGAG
GCGCTCGCCG CCCTGCGGGA GAAGCTCGCT GGCGGCAAGT GA
 
Protein sequence
MTSSIEAPSS ATKVTVDDLG SEEAFLAAID ETIKYFNDGD IVEGTVVKVD RDEVLLDIGY 
KTEGVIPSRE LSIKHDVDPA EVVSVGDHIE ALVLQKEDKE GRLILSKKRA QYERAWGTIE
KIKDEDGVVR GSVIEVVKGG LILDIGLRGF LPASLVEMRR VRDLQPYVGR ELEAKIIELD
KNRNNVVLSR RAWLEQTQSE VRTEFLNKLQ KGQVRKGVVS SIVNFGAFVD LGGVDGLVHV
SELSWKHIDH PSEVVEVGQE VEVEVLDVDL DRERVSLSLK ATQEDPWRQF ARTHAIQQIV
PGKVTKLVPF GAFVRVDDGI EGLVHISELA ERHVEIPEQV VQVGSEVMVK VIDIDLERRR
ISLSLKQANE GFVEGEEHFD PTLYGMAATY DAEGNYIYPE GFDPETGEWL EGYEKQRETW
EQQYAEARQR WEAHQKQVQA SRAADAEAAA NPPAAGPTTT TTAAPSRPAE EPAGTLATDE
ALAALREKLA GGK