Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1322 |
Symbol | rpsA |
ID | 3917771 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 1365649 |
End bp | 1367349 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640444059 |
Product | 30S ribosomal protein S1 |
Protein accession | YP_496600 |
Protein GI | 87199343 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0539] Ribosomal protein S1 |
TIGRFAM ID | [TIGR00717] ribosomal protein S1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.85252 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCAGCA ATCCCTCGCG CGACGATTTC GCCGCGCTTC TTGACGAATC GCTTGGCGGC GCCGCCAACG GTGGCTTCGA AGGCCGCGTC GTCAAGGGCA CCATCACCGC CATCGAAAAC GACAAGGCCG TCATCGACGT GGGCCTGAAG AGCGAGGGCC GCGTTGCCCT GCGCGAATTC GCCGCCCCCG GCCAGCCGCA CGGCCTCAAG GTCGGCGACG AAGTCGAAGT CTACGTCGAC CGCGTCGAGA ACGCCGACGG CGAAGCCATG CTGTCGCGCG ACCGCGCTCG CCGCGAAGCC GCGTGGGACA AGCTGGAAAG CGAATTTGGC GAAGGCAAGC GCGTTGAAGG CGTGATCTTC GGCCGCGTGA AGGGTGGCTT CACCGTCGAC CTCGACGGCG CCGTGGCCTT CCTCCCCGGC TCGCAGGTCG ACATCCGCCC GGTCCGCGAC GTGACCCCGC TGATGGACAT GCCGCAGCCG TTCCAGATCC TCAAGATGGA CCGCCGCCGC GGCAACATCG TCGTCTCGCG CCGCGCGGTG CTGGAAGAAA CCCGCGCCGA ACAGCGCTCG GGCCTGATCC AGAACCTCAA GGAAGGCCAG ATCATCGACG GCGTCGTCAA GAACATCACC GACTACGGTG CGTTCGTCGA CCTCGGCGGC ATCGACGGCC TGCTCCATGT CACCGACATG AGCTACAAGC GCGTCAACCA CCCGTCGGAA GTGATCGCCA TCGGCGATAC CGTCCGCGTC CAGATCATCC GCATCAACCA GGACACGCAG CGCATCAGCC TCGGCATGAA GCAGCTTGAA AGCGATCCGT GGGATGGCGT CGCCGCCAAG TACCCGGTCG GCGCGAAGCT GCGTGGCACT GTCACCAACA TCACCGAATA CGGCGCGTTC GTCGAGCTGG AAGCCGGCAT CGAAGGCCTC GTCCACGTTT CGGAAATGTC CTGGACCAAG AAGAACGTCC ACCCCGGCAA GATCGTCTCG ACCTCGCAGG AAGTCGACGT CATGGTGCTG GAAGTCGACA GCGACAAGCG CCGCATCAGC CTCGGCCTCA AGCAGGCCCA GCAGAACCCC TGGGAAGCCT TTGCAGAAAA GCACCCGGTC GGTTCGACCG TGGAAGGCGA AGTCAAGAAC GCGACCGAAT TCGGCCTGTT CATCGGCCTC GACGGCGACG TCGACGGCAT GGTCCACATG TCGGACATCG CCTGGGGCAT CTCGGGCGAG GACGCGCTGG CGCTGCACCG CAAGGGCGAG CAGGTCTCGG CCGTGGTTCT CGACGTCGAC GTCGAGAAGG AACGCATCAG CCTCGGCATG AAGCAGCTTG AAAAGGGCGC TCCGGCGGCC GGCGGCGTTG CTTCCTCGGG CTCGCTGCGT CGTGGCGAAG TCGTCACCGT CACCGTTCTC GAAGTCCGCG ATGGCGGCCT CGAAGTGCAG GCTGGCGAAG ACGGCGCGAC CGGCTTCATC AAGCGCTCGG ACCTCGGCCG CGACCGCGAC GAGCAGCGTC CGGACCGCTT CCAGGTCGGC CAGAAGATCG ACGCCATGGT CACCGGCTTC GATCGTTCGA AGAAGCCGAA CTTCTCGGTC AAGGCGCGCC AGCTCGCAGA AGAGAAGGAA GCCGTGGAAC AGTACGGCTC GTCGGATGCC GGCGCTTCGC TGGGCGACAT CCTCGGCGCC GCGCTGAAGG CGAAGCAGTA A
|
Protein sequence | MASNPSRDDF AALLDESLGG AANGGFEGRV VKGTITAIEN DKAVIDVGLK SEGRVALREF AAPGQPHGLK VGDEVEVYVD RVENADGEAM LSRDRARREA AWDKLESEFG EGKRVEGVIF GRVKGGFTVD LDGAVAFLPG SQVDIRPVRD VTPLMDMPQP FQILKMDRRR GNIVVSRRAV LEETRAEQRS GLIQNLKEGQ IIDGVVKNIT DYGAFVDLGG IDGLLHVTDM SYKRVNHPSE VIAIGDTVRV QIIRINQDTQ RISLGMKQLE SDPWDGVAAK YPVGAKLRGT VTNITEYGAF VELEAGIEGL VHVSEMSWTK KNVHPGKIVS TSQEVDVMVL EVDSDKRRIS LGLKQAQQNP WEAFAEKHPV GSTVEGEVKN ATEFGLFIGL DGDVDGMVHM SDIAWGISGE DALALHRKGE QVSAVVLDVD VEKERISLGM KQLEKGAPAA GGVASSGSLR RGEVVTVTVL EVRDGGLEVQ AGEDGATGFI KRSDLGRDRD EQRPDRFQVG QKIDAMVTGF DRSKKPNFSV KARQLAEEKE AVEQYGSSDA GASLGDILGA ALKAKQ
|
| |