Gene Saro_1322 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1322 
SymbolrpsA 
ID3917771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1365649 
End bp1367349 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content65% 
IMG OID640444059 
Product30S ribosomal protein S1 
Protein accessionYP_496600 
Protein GI87199343 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0539] Ribosomal protein S1 
TIGRFAM ID[TIGR00717] ribosomal protein S1 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.85252 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCAGCA ATCCCTCGCG CGACGATTTC GCCGCGCTTC TTGACGAATC GCTTGGCGGC 
GCCGCCAACG GTGGCTTCGA AGGCCGCGTC GTCAAGGGCA CCATCACCGC CATCGAAAAC
GACAAGGCCG TCATCGACGT GGGCCTGAAG AGCGAGGGCC GCGTTGCCCT GCGCGAATTC
GCCGCCCCCG GCCAGCCGCA CGGCCTCAAG GTCGGCGACG AAGTCGAAGT CTACGTCGAC
CGCGTCGAGA ACGCCGACGG CGAAGCCATG CTGTCGCGCG ACCGCGCTCG CCGCGAAGCC
GCGTGGGACA AGCTGGAAAG CGAATTTGGC GAAGGCAAGC GCGTTGAAGG CGTGATCTTC
GGCCGCGTGA AGGGTGGCTT CACCGTCGAC CTCGACGGCG CCGTGGCCTT CCTCCCCGGC
TCGCAGGTCG ACATCCGCCC GGTCCGCGAC GTGACCCCGC TGATGGACAT GCCGCAGCCG
TTCCAGATCC TCAAGATGGA CCGCCGCCGC GGCAACATCG TCGTCTCGCG CCGCGCGGTG
CTGGAAGAAA CCCGCGCCGA ACAGCGCTCG GGCCTGATCC AGAACCTCAA GGAAGGCCAG
ATCATCGACG GCGTCGTCAA GAACATCACC GACTACGGTG CGTTCGTCGA CCTCGGCGGC
ATCGACGGCC TGCTCCATGT CACCGACATG AGCTACAAGC GCGTCAACCA CCCGTCGGAA
GTGATCGCCA TCGGCGATAC CGTCCGCGTC CAGATCATCC GCATCAACCA GGACACGCAG
CGCATCAGCC TCGGCATGAA GCAGCTTGAA AGCGATCCGT GGGATGGCGT CGCCGCCAAG
TACCCGGTCG GCGCGAAGCT GCGTGGCACT GTCACCAACA TCACCGAATA CGGCGCGTTC
GTCGAGCTGG AAGCCGGCAT CGAAGGCCTC GTCCACGTTT CGGAAATGTC CTGGACCAAG
AAGAACGTCC ACCCCGGCAA GATCGTCTCG ACCTCGCAGG AAGTCGACGT CATGGTGCTG
GAAGTCGACA GCGACAAGCG CCGCATCAGC CTCGGCCTCA AGCAGGCCCA GCAGAACCCC
TGGGAAGCCT TTGCAGAAAA GCACCCGGTC GGTTCGACCG TGGAAGGCGA AGTCAAGAAC
GCGACCGAAT TCGGCCTGTT CATCGGCCTC GACGGCGACG TCGACGGCAT GGTCCACATG
TCGGACATCG CCTGGGGCAT CTCGGGCGAG GACGCGCTGG CGCTGCACCG CAAGGGCGAG
CAGGTCTCGG CCGTGGTTCT CGACGTCGAC GTCGAGAAGG AACGCATCAG CCTCGGCATG
AAGCAGCTTG AAAAGGGCGC TCCGGCGGCC GGCGGCGTTG CTTCCTCGGG CTCGCTGCGT
CGTGGCGAAG TCGTCACCGT CACCGTTCTC GAAGTCCGCG ATGGCGGCCT CGAAGTGCAG
GCTGGCGAAG ACGGCGCGAC CGGCTTCATC AAGCGCTCGG ACCTCGGCCG CGACCGCGAC
GAGCAGCGTC CGGACCGCTT CCAGGTCGGC CAGAAGATCG ACGCCATGGT CACCGGCTTC
GATCGTTCGA AGAAGCCGAA CTTCTCGGTC AAGGCGCGCC AGCTCGCAGA AGAGAAGGAA
GCCGTGGAAC AGTACGGCTC GTCGGATGCC GGCGCTTCGC TGGGCGACAT CCTCGGCGCC
GCGCTGAAGG CGAAGCAGTA A
 
Protein sequence
MASNPSRDDF AALLDESLGG AANGGFEGRV VKGTITAIEN DKAVIDVGLK SEGRVALREF 
AAPGQPHGLK VGDEVEVYVD RVENADGEAM LSRDRARREA AWDKLESEFG EGKRVEGVIF
GRVKGGFTVD LDGAVAFLPG SQVDIRPVRD VTPLMDMPQP FQILKMDRRR GNIVVSRRAV
LEETRAEQRS GLIQNLKEGQ IIDGVVKNIT DYGAFVDLGG IDGLLHVTDM SYKRVNHPSE
VIAIGDTVRV QIIRINQDTQ RISLGMKQLE SDPWDGVAAK YPVGAKLRGT VTNITEYGAF
VELEAGIEGL VHVSEMSWTK KNVHPGKIVS TSQEVDVMVL EVDSDKRRIS LGLKQAQQNP
WEAFAEKHPV GSTVEGEVKN ATEFGLFIGL DGDVDGMVHM SDIAWGISGE DALALHRKGE
QVSAVVLDVD VEKERISLGM KQLEKGAPAA GGVASSGSLR RGEVVTVTVL EVRDGGLEVQ
AGEDGATGFI KRSDLGRDRD EQRPDRFQVG QKIDAMVTGF DRSKKPNFSV KARQLAEEKE
AVEQYGSSDA GASLGDILGA ALKAKQ