Gene Arth_2065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2065 
SymbolrpsA 
ID4445404 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2327596 
End bp2329071 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content62% 
IMG OID639689873 
Product30S ribosomal protein S1 
Protein accessionYP_831545 
Protein GI116670612 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0539] Ribosomal protein S1 
TIGRFAM ID[TIGR00717] ribosomal protein S1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00632138 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCATCA CCTCCACCGA GAAGCCCGGT ACCCCCGTAG TCGCGATTAA CGACATCGGT 
ACCGCTGAGG ACTTCCTCGC AGCAGTCGAC GCCACCATCA AGTACTTCAA CGACGGAGAC
CTCGTCGAAG GTACCGTCGT CAAGGTCGAC CGCGACGAAG TTCTGCTCGA CATCGGTTAC
AAGACCGAAG GTGTCATCCC CTCCCGCGAG CTGTCCATCA AGCACGACGT TGATCCCGGA
GACGTCGTCT CCGTCGGCGA TCAGGTCGAA GCCCTGGTGC TCACCAAGGA AGACAAAGAA
GGCCGCCTGA TCCTCTCCAA GAAGCGTGCT CAGTACGAGC GCGCCTGGGG CGACATCGAG
AAGGTCAAGG AAGAAGACGG TGTCGTCACC GGTACCGTCA TCGAGGTTGT CAAGGGTGGT
CTTATCCTCG ACATCGGTCT GCGCGGCTTC CTGCCCGCAT CCCTCGTCGA GATGCGCCGT
GTGCGCGACC TTGCTCCGTA CATCGGTCAG CAGATCGAAG CCAAGATCAT CGAGCTGGAC
AAGAACCGCA ACAACGTTGT GCTGTCCCGC CGTGCATGGC TCGAGCAGAC CCAGTCCGAG
GTCCGCTCCA CGTTCCTCAA CAAGCTGGAA AAGGGCCAGG TTCGTCCGGG CGTCGTTTCC
TCCATCGTCA ACTTCGGTGC CTTCGTGGAC CTGGGCGGCG TAGACGGCCT GGTTCACGTT
TCCGAGCTGT CCTGGAAGCA CATCGACCAC CCGTCCGAGG TTGTCGAAGT TGGCCAGGAA
GTCACTGTCG AGGTTCTCGA GGTCGACCTG GACCGCGAGC GCGTTTCCCT GTCGCTCAAG
GCTACGCAGG AAGATCCGTG GCAGACCTTC GCCCGCACCC ACGCCCTCGG CCAGGTTGTT
CCGGGTAAGG TCACCAAGCT CGTTCCGTTC GGTGCGTTCG TTCGCGTCGA AGACGGCATC
GAAGGCCTCG TTCACATCTC CGAGCTGGCA GTCCGCCACG TGGAGCTGGC AGAGCAGGTT
GTCTCCGTTG GTGACGAGCT GTTCGTCAAG GTCATCGACA TCGACCTTGA GCGCCGCCGC
ATCTCCCTCT CCCTCAAGCA GGCTAACGAG GGCGTTGACG CCGACAGCAC CGAATTCGAT
CCCGCTCTCT ACGGCATGGC CGCTGAGTAC GACGAAGAGG GCAACTACAA GTACCCGGAG
GGCTTCGACC CGGAGTCCAA CGAGTGGCTT GAAGGCTACG AGAACCAGCG CGCAGCCTGG
GAGCAGCAGT ACGCTGACGC CCAGACCCGC TGGGAAGCAC ACAAGAAGCA GGTTGCCCAG
CACGCTGCCG ACGACGCTGC AGCTGCAACG TCCGGTGACA GCGATTCCGG CACCACCAGC
TACTCCTCCG AGCCGGCTGC AGCCGAGACC GGTGCAGGCA CGCTTGCTTC GGACGAGGCT
CTTGCAGCAC TGCGCGAGAA GCTGACCGGC AACTAA
 
Protein sequence
MTITSTEKPG TPVVAINDIG TAEDFLAAVD ATIKYFNDGD LVEGTVVKVD RDEVLLDIGY 
KTEGVIPSRE LSIKHDVDPG DVVSVGDQVE ALVLTKEDKE GRLILSKKRA QYERAWGDIE
KVKEEDGVVT GTVIEVVKGG LILDIGLRGF LPASLVEMRR VRDLAPYIGQ QIEAKIIELD
KNRNNVVLSR RAWLEQTQSE VRSTFLNKLE KGQVRPGVVS SIVNFGAFVD LGGVDGLVHV
SELSWKHIDH PSEVVEVGQE VTVEVLEVDL DRERVSLSLK ATQEDPWQTF ARTHALGQVV
PGKVTKLVPF GAFVRVEDGI EGLVHISELA VRHVELAEQV VSVGDELFVK VIDIDLERRR
ISLSLKQANE GVDADSTEFD PALYGMAAEY DEEGNYKYPE GFDPESNEWL EGYENQRAAW
EQQYADAQTR WEAHKKQVAQ HAADDAAAAT SGDSDSGTTS YSSEPAAAET GAGTLASDEA
LAALREKLTG N