Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_3034 |
Symbol | rpsA |
ID | 8448647 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 3328262 |
End bp | 3329719 |
Gene Length | 1458 bp |
Protein Length | 485 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645042118 |
Product | 30S ribosomal protein S1 |
Protein accession | YP_003202360 |
Protein GI | 258653204 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0539] Ribosomal protein S1 |
TIGRFAM ID | [TIGR00717] ribosomal protein S1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.000095518 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00573325 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCCATCT CTACCGTCGC CGCCCCACAG GTCGCCATCA ACGACATCGG GTCGGCTGAG GACTTCCTCG CCGCCATCGA TTCCACGATC AAGTACTTCA ACGATGGCGA CATCGTCGAA GGCACCATCG TCAAGGTCGA TCGCGATGAG GTCCTGCTCG ACATCGGCTA CAAGACCGAG GGCGTCATCC CCTCGCGTGA GCTGTCCATC AAGCACGATG TCGACCCCAG CGAGGTCGTC GAGATCGGCG AGCACGTCGA GGCCCTCGTC CTGCAGAAGG AGGACAAGGA AGGCCGCCTC ATCCTGTCCA AGAAGCGAGC TCAGTACGAG CGCGCCTGGG GCACGATCGA GAAGATCAAG GAAGAGGACG GCGTCGTCTC CGGCACCGTC ATCGAGGTCG TCAAGGGCGG CCTGATCCTG GACATCGGCC TGCGCGGCTT CCTGCCCGCG TCCCTGGTCG AGATGCGCCG TGTCCGCGAC CTGCAGCCCT ACGTGGGCCG CCAGCTGGAC GCCAAGATCA TCGAGCTGGA CAAGAACCGC AACAACGTGG TGCTGTCCCG CCGGCAGTGG CTCGAGCAGA CCCAGTCCGA GGTGCGCAGC GAGTTCCTCA ACCAGCTGGG CAAGGGCCAG GTCCGCAAGG GCGTCGTGTC CTCCATCGTC AACTTCGGTG CGTTCGTCGA CCTGGGTGGC GTGGACGGCC TGGTGCACGT CTCCGAGCTG TCCTGGAAGC ACATCGACCA CCCGTCCGAG GTCGTCGAGG TCGGCCAGGA GGTCACCGTC GAGGTCCTGG ACGTCGACAT GGACCGCGAG CGCGTCTCGC TGTCGCTCAA GGCGACCCAG GAGGATCCGT GGCGGCACTT CGCCCGGACC CACGCCATCG GCCAGGTCGT CCCCGGCAAG GTCACCAAGC TGGTTCCGTT CGGCGCGTTC GTGCGCGTCT ACGACGGCAT CGAGGGCCTG GTGCACATCT CGGAGCTGGC CGGCCGCCAC GTGGAGGTCC CCGAGCAGGT CGTGACCGTC GACGACGAGA TCTTCGTCCG CGTCATCGAC ATCGACCTGG AGCGTCGTCG GATCTCGCTG TCGCTCAAGC AGGCCAACGA GGGCATCACC GCCGAGACCG AGTTCGACGA CGTGCGGGCC CAGTACGGCG TGGTCGACCA CTACGACGAG CAGGGCAACT TCGTGCCGCC GGAGGGCTTC GACGTCGAGA CCGGCGAGTG GCTCGAGGGC TTCGACACGC AGCGCGAGGC GTGGGAGAAG CAGTACGCCG ACGCGCACGC GGTGTACGAG TCGCACCTCA AGCAGATCGC GGCCGCCCAG GTGGCGGACG CGGAGTCGGC CGAGCCGAGC AACTACTCGT CCGACACGGC TGCGGCCGCC GGCGAGCAGG TCCCGGCCGG TTCGCTGGTC AACGACGAGC AGCTCGCTGC GCTGCGGGAG CGTCTGGCCG GTAACTGA
|
Protein sequence | MSISTVAAPQ VAINDIGSAE DFLAAIDSTI KYFNDGDIVE GTIVKVDRDE VLLDIGYKTE GVIPSRELSI KHDVDPSEVV EIGEHVEALV LQKEDKEGRL ILSKKRAQYE RAWGTIEKIK EEDGVVSGTV IEVVKGGLIL DIGLRGFLPA SLVEMRRVRD LQPYVGRQLD AKIIELDKNR NNVVLSRRQW LEQTQSEVRS EFLNQLGKGQ VRKGVVSSIV NFGAFVDLGG VDGLVHVSEL SWKHIDHPSE VVEVGQEVTV EVLDVDMDRE RVSLSLKATQ EDPWRHFART HAIGQVVPGK VTKLVPFGAF VRVYDGIEGL VHISELAGRH VEVPEQVVTV DDEIFVRVID IDLERRRISL SLKQANEGIT AETEFDDVRA QYGVVDHYDE QGNFVPPEGF DVETGEWLEG FDTQREAWEK QYADAHAVYE SHLKQIAAAQ VADAESAEPS NYSSDTAAAA GEQVPAGSLV NDEQLAALRE RLAGN
|
| |