Gene TM1040_0200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0200 
SymbolrpsA 
ID4078648 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp218325 
End bp220001 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content59% 
IMG OID638005494 
Product30S ribosomal protein S1 
Protein accessionYP_612195 
Protein GI99080041 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0539] Ribosomal protein S1 
TIGRFAM ID[TIGR00717] ribosomal protein S1 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.977671 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCAAA CAACATCGAT GGAGGAATTC GAAGCCCTCC TGCAAGAAAG CTTCGAAATG 
GACACCCCCG AAGAGGGTTC CGTTGTCAAA GGCAAGATCA TTGCCATTGA GGCCGGTCAG
GCCATCATCG ACGTCGGCTA CAAGATGGAA GGTCGCGTCG AACTCAAAGA ATTCGCAAAC
CCCGGCGAAG CTCCTGAAGT TTCCGTTGGT GACGAGGTAG AAGTCTACCT GCGCGCCGCT
GAAAACGCCC GTGGCGAAGC TGTTATCTCC CGCGAGATGG CACGCCGCGA AGAAGCATGG
GACCGCCTGG AAAAAGCATA CGCAGACGAC CAGCGCGTCG AAGGTGCAAT CTTCGGTCGC
GTCAAAGGTG GCTTCACCGT CGATCTCGGC GGCGCCGTGG CCTTCCTGCC CGGCTCGCAA
GTTGACGTGC GCCCCGTGCG CGACGCAGGC CCGCTGATGG GTCTGAAGCA GCCGTTCCAG
ATCCTGAAAA TGGACCGTCG TCGTGGCAAC ATCGTTGTCT CCCGTCGTGC GATCCTCGAA
GAGTCCCGCG CCGAGCAGCG TGCCGAAGTC ATCGGCAACC TGTCCGAAGG CCAGGCGGTC
GACGGTGTGG TCAAGAACAT CACCGAATAC GGTGCGTTTG TTGACCTAGG CGGTGTTGAC
GGCTTGCTGC ACGTCACCGA CATGGCATGG CGCCGTGTGA ACCACCCCTC CGAGATCCTG
TCCATTGGCG AGACCGTCAA GGTTCAGGTC ATCAAGATCA ACAAAGAGAC TCACCGTATC
TCCCTCGGCA TGAAGCAGCT GCAGGAAGAT CCGTGGGATC TGGTTGGCGC CAAGTACCCG
CTGGAATCCG TTCACAAGGG TCGCGTCACC AACATCACCG ATTACGGTGC ATTTGTTGAG
CTGGAGCCCG GTGTCGAAGG TCTGGTCCAC GTCTCCGAGA TGTCCTGGAC CAAGAAAAAC
GTGCACCCCG GCAAGATCGT TTCCACCTCG CAGGAAGTGG ACGTCATGGT TCTGGAAATC
GACGGCGCCA AGCGTCGCGT GTCCCTGGGC CTCAAGCAGA CCATGCGTAA CCCGTGGGAA
GTGTTTGCAG AAACACACCC CGAGGGCACT CAGGTCGAAG GCGAAGTCAA GAACATCACC
GAATTCGGTC TGTTCATCGG CCTCGACGGC GACATCGACG GCATGGTTCA CCTCTCCGAC
CTCAGCTGGG ACGAGCGTGG CGAAGATGCG ATCCAGAACT ACCGCAAAGG CGACATGGTT
TCGGCCGTTG TCTCCGAAGT GGACGTTGAA AAAGAGCGTA TCTCCCTGTC GATCAAAGCC
CTGGGCGGTG ACAAGTTCGC AGACGCCGTT GGCGGCGTGA AGCGTGGCTC CATCGTGACC
GTGGAAGTGA CCGCGATCGA AGATGGTGGC ATCGAAGTGG AATATGAAGG CATGAAGTCC
TTTATCCGCC GCTCCGACCT CAGCCGTGAC CGTGCCGAGC AGCGCCCCGA GCGTTTCTCT
GTCGGTGACA AGGTCGACGT CCGCGTCACC AACATCGACT CCAAGACTCG TCGTCTGGGC
CTGTCGATCA AGGCACGCGA GATCGCAGAA GAGAAAGAAG CCGTCGAACA GTATGGTTCT
TCCGACTCCG GCGCGTCTCT TGGCGACATC CTGGGCGCAG CGCTCAAGAG CGAGTAA
 
Protein sequence
MAQTTSMEEF EALLQESFEM DTPEEGSVVK GKIIAIEAGQ AIIDVGYKME GRVELKEFAN 
PGEAPEVSVG DEVEVYLRAA ENARGEAVIS REMARREEAW DRLEKAYADD QRVEGAIFGR
VKGGFTVDLG GAVAFLPGSQ VDVRPVRDAG PLMGLKQPFQ ILKMDRRRGN IVVSRRAILE
ESRAEQRAEV IGNLSEGQAV DGVVKNITEY GAFVDLGGVD GLLHVTDMAW RRVNHPSEIL
SIGETVKVQV IKINKETHRI SLGMKQLQED PWDLVGAKYP LESVHKGRVT NITDYGAFVE
LEPGVEGLVH VSEMSWTKKN VHPGKIVSTS QEVDVMVLEI DGAKRRVSLG LKQTMRNPWE
VFAETHPEGT QVEGEVKNIT EFGLFIGLDG DIDGMVHLSD LSWDERGEDA IQNYRKGDMV
SAVVSEVDVE KERISLSIKA LGGDKFADAV GGVKRGSIVT VEVTAIEDGG IEVEYEGMKS
FIRRSDLSRD RAEQRPERFS VGDKVDVRVT NIDSKTRRLG LSIKAREIAE EKEAVEQYGS
SDSGASLGDI LGAALKSE