Gene GM21_0995 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0995 
SymbolrpsA 
ID8136316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1175332 
End bp1177086 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content61% 
IMG OID644868609 
Product30S ribosomal protein S1 
Protein accessionYP_003020818 
Protein GI253699629 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0539] Ribosomal protein S1 
TIGRFAM ID[TIGR00717] ribosomal protein S1 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones104 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGACG AAAAAAGGAC GTTCAAGAAA AAAGATATGC CTATCAGGCG GTTACACGAT 
AATGACGAGG AGCTGGATCA GGCCGAGATG GGTGGCGAGT TTGCCGACCT TTTCCAGGAC
AGTCTGAGGC AGCCGCAGAG CGGCGAGGTA GTCAAGGCCG TCGTGGTCCA GATCGAGCAG
GACGTTGTGC TCGTGGACGT CGGTTACAAG TCCGAAGGCG CTATCCGCAT CGGCGAGTTC
ATCGACGAAA GCGGCGAACT CAACGTCAAG GTCGGTGACG AAGTGAACGT TTACTTCGAG
CGCGGCGAGA ACATCCGCGG CCACATGGTC CTCTCGAAGA AGAAAGCCGA TTCCCAGGTT
GCCTGGGAAA CCATCGCCGC AGCCGGCGAG GGTGGCGTCA TCGAAGGCAA GATCACCGGC
AAGGTGAAGG GCGGCATGAC CGTCGACGTC GGCGTCGAGG CATTCCTTCC CGCGTCGCAG
GTTGACCTTC GTCCCGGCGG CAACATGGAT CGCTTCGTCG GCCAGACCTA CCAGTTCAGG
ATCCTGAAGC TGAACAGGAA GCGCGGCAAC CTCGTTCTTT CCCGTCGCGT GCTGCTCGAG
GAAGAGCGCG AGAAGGCAAG GACCGAGACC CTGGCGACCC TGAAGGAAGG GGACATCGTC
AACGGCGTGG TCAAGAACAT CGCCGAGTAC GGCGCGTTCG TCGACCTGGG CGGCGTGGAC
GGCCTGCTGC ACGTAACCGA CATGTCCTGG GGCCGCCTCG GCCACCCCTC CGAGATGGTC
AAAGTGGGCG ACACCCTGAA CGTGATGGTC CTTAAGTACG ACCGCGAGAA AGGGAAGATC
TCCCTCGGCC TCAAGCAGAC CGTGCCCGAT CCCTGGCTCA ACGTCGGCGA TCGCTACAGG
GAAGGCGAGA GGGTGAGCGG CAAAGTCGTG AGCCTGACCG ACTACGGCGC ATTCATCTCC
CTGGAAGACG GCATCGAAGG TCTGGTGCAC GTTTCCGAGA TGTCCTGGAC CAGGAGGGTG
CGTCACCCGT CCGAAATCCT CAAGGTCGGA GAGGAAGTCG AAGCAGTCAT CCTGGGCGTC
GATCCGGGCA ACCGCAGGAT CTCCCTGGGG CTCAAGCAGA CCGAGATCAA CCCCTGGACC
GTGATCGGGG AGCGTTACCC GGTAGGTACC AAGATCGAAG GTCAGATCAA GAACATCACC
GACTTCGGCG TCTTCATCGG CATCGAGGAC GGCATCGACG GCCTGGTGCA TGTCTCCGAC
ATCTCCTGGA CCCGCCGCGT GAAGCACCCG GGCGAGATGT TCACCAAAGG GCAGACCGTG
CAGGCAGTCG TTCTCAACAT CGACGTCGAG AACGAGCGTC TCTCCCTGGG CATCAAGCAG
CTGGCGGCCG ATCCGTGGGA AGAGATCCCC AGGAAGTACC GTCCGGGCTC CAAGGTCAAA
GGGCGCGTCA CCTCCGTGAC CGACTTCGGC ATCTTCGTCG AGATCGAAGA AGGGATCGAG
GGACTGATCC ACGTCTCCGA GATCTCCTAC GAGAAAGTCG CCTCCCCGAA AGACTTCGCC
AACGTGGGCG ATGAGCTTGA GGCGGTAGTG CTGAACGTGG ACATGGTCGA GAAGAAGATC
GCCCTCTCCA TCAAGGCGCT GCAGACCGCC ATGGAGAAGG CAGAGATGGC CTCCTACATG
GGTAGCCAGG GTGAGGCCAC CTCCAGCTTC GGCGACCTGC TGAAAGAGAA GCTGAAGAAG
AGCACCGAGG AATAG
 
Protein sequence
MGDEKRTFKK KDMPIRRLHD NDEELDQAEM GGEFADLFQD SLRQPQSGEV VKAVVVQIEQ 
DVVLVDVGYK SEGAIRIGEF IDESGELNVK VGDEVNVYFE RGENIRGHMV LSKKKADSQV
AWETIAAAGE GGVIEGKITG KVKGGMTVDV GVEAFLPASQ VDLRPGGNMD RFVGQTYQFR
ILKLNRKRGN LVLSRRVLLE EEREKARTET LATLKEGDIV NGVVKNIAEY GAFVDLGGVD
GLLHVTDMSW GRLGHPSEMV KVGDTLNVMV LKYDREKGKI SLGLKQTVPD PWLNVGDRYR
EGERVSGKVV SLTDYGAFIS LEDGIEGLVH VSEMSWTRRV RHPSEILKVG EEVEAVILGV
DPGNRRISLG LKQTEINPWT VIGERYPVGT KIEGQIKNIT DFGVFIGIED GIDGLVHVSD
ISWTRRVKHP GEMFTKGQTV QAVVLNIDVE NERLSLGIKQ LAADPWEEIP RKYRPGSKVK
GRVTSVTDFG IFVEIEEGIE GLIHVSEISY EKVASPKDFA NVGDELEAVV LNVDMVEKKI
ALSIKALQTA MEKAEMASYM GSQGEATSSF GDLLKEKLKK STEE