Gene Namu_3034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3034 
SymbolrpsA 
ID8448647 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3328262 
End bp3329719 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content67% 
IMG OID645042118 
Product30S ribosomal protein S1 
Protein accessionYP_003202360 
Protein GI258653204 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0539] Ribosomal protein S1 
TIGRFAM ID[TIGR00717] ribosomal protein S1 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.000095518 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00573325 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCATCT CTACCGTCGC CGCCCCACAG GTCGCCATCA ACGACATCGG GTCGGCTGAG 
GACTTCCTCG CCGCCATCGA TTCCACGATC AAGTACTTCA ACGATGGCGA CATCGTCGAA
GGCACCATCG TCAAGGTCGA TCGCGATGAG GTCCTGCTCG ACATCGGCTA CAAGACCGAG
GGCGTCATCC CCTCGCGTGA GCTGTCCATC AAGCACGATG TCGACCCCAG CGAGGTCGTC
GAGATCGGCG AGCACGTCGA GGCCCTCGTC CTGCAGAAGG AGGACAAGGA AGGCCGCCTC
ATCCTGTCCA AGAAGCGAGC TCAGTACGAG CGCGCCTGGG GCACGATCGA GAAGATCAAG
GAAGAGGACG GCGTCGTCTC CGGCACCGTC ATCGAGGTCG TCAAGGGCGG CCTGATCCTG
GACATCGGCC TGCGCGGCTT CCTGCCCGCG TCCCTGGTCG AGATGCGCCG TGTCCGCGAC
CTGCAGCCCT ACGTGGGCCG CCAGCTGGAC GCCAAGATCA TCGAGCTGGA CAAGAACCGC
AACAACGTGG TGCTGTCCCG CCGGCAGTGG CTCGAGCAGA CCCAGTCCGA GGTGCGCAGC
GAGTTCCTCA ACCAGCTGGG CAAGGGCCAG GTCCGCAAGG GCGTCGTGTC CTCCATCGTC
AACTTCGGTG CGTTCGTCGA CCTGGGTGGC GTGGACGGCC TGGTGCACGT CTCCGAGCTG
TCCTGGAAGC ACATCGACCA CCCGTCCGAG GTCGTCGAGG TCGGCCAGGA GGTCACCGTC
GAGGTCCTGG ACGTCGACAT GGACCGCGAG CGCGTCTCGC TGTCGCTCAA GGCGACCCAG
GAGGATCCGT GGCGGCACTT CGCCCGGACC CACGCCATCG GCCAGGTCGT CCCCGGCAAG
GTCACCAAGC TGGTTCCGTT CGGCGCGTTC GTGCGCGTCT ACGACGGCAT CGAGGGCCTG
GTGCACATCT CGGAGCTGGC CGGCCGCCAC GTGGAGGTCC CCGAGCAGGT CGTGACCGTC
GACGACGAGA TCTTCGTCCG CGTCATCGAC ATCGACCTGG AGCGTCGTCG GATCTCGCTG
TCGCTCAAGC AGGCCAACGA GGGCATCACC GCCGAGACCG AGTTCGACGA CGTGCGGGCC
CAGTACGGCG TGGTCGACCA CTACGACGAG CAGGGCAACT TCGTGCCGCC GGAGGGCTTC
GACGTCGAGA CCGGCGAGTG GCTCGAGGGC TTCGACACGC AGCGCGAGGC GTGGGAGAAG
CAGTACGCCG ACGCGCACGC GGTGTACGAG TCGCACCTCA AGCAGATCGC GGCCGCCCAG
GTGGCGGACG CGGAGTCGGC CGAGCCGAGC AACTACTCGT CCGACACGGC TGCGGCCGCC
GGCGAGCAGG TCCCGGCCGG TTCGCTGGTC AACGACGAGC AGCTCGCTGC GCTGCGGGAG
CGTCTGGCCG GTAACTGA
 
Protein sequence
MSISTVAAPQ VAINDIGSAE DFLAAIDSTI KYFNDGDIVE GTIVKVDRDE VLLDIGYKTE 
GVIPSRELSI KHDVDPSEVV EIGEHVEALV LQKEDKEGRL ILSKKRAQYE RAWGTIEKIK
EEDGVVSGTV IEVVKGGLIL DIGLRGFLPA SLVEMRRVRD LQPYVGRQLD AKIIELDKNR
NNVVLSRRQW LEQTQSEVRS EFLNQLGKGQ VRKGVVSSIV NFGAFVDLGG VDGLVHVSEL
SWKHIDHPSE VVEVGQEVTV EVLDVDMDRE RVSLSLKATQ EDPWRHFART HAIGQVVPGK
VTKLVPFGAF VRVYDGIEGL VHISELAGRH VEVPEQVVTV DDEIFVRVID IDLERRRISL
SLKQANEGIT AETEFDDVRA QYGVVDHYDE QGNFVPPEGF DVETGEWLEG FDTQREAWEK
QYADAHAVYE SHLKQIAAAQ VADAESAEPS NYSSDTAAAA GEQVPAGSLV NDEQLAALRE
RLAGN