Gene Mext_2073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_2073 
SymbolrpsB 
ID5832416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp2313303 
End bp2314370 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content69% 
IMG OID641367871 
Product30S ribosomal protein S2 
Protein accessionYP_001639540 
Protein GI163851497 
COG category[J] Translation, ribosomal structure and biogenesis
[S] Function unknown 
COG ID[COG0052] Ribosomal protein S2
[COG3743] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01011] ribosomal protein S2, bacterial type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.193389 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.160852 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGTCG ATTTTTCTAT GCGCCAGCTC CTCGAGGCCG GCGCCCATTT CGGGCACCAG 
TCGCACCGCT GGAACCCGAA GATGCAGCCC TACATCTTCG GCACCCGTAA CAACATCCAC
ATCATCGACC TCGCCCAGAC CGTGCCGGCG CTCCACGCCG CGCTTCAGGC GGTGAGCGAC
ACGGTCGCCC GCGGCGGTCG CGTGCTGTTC GTCGGCACCA AGCGCCAGGC GGCGGATTCC
ATCGCCGAAG CGGCCAAGCG CTCTGCTCAG TACTACGTCA ACTCCCGCTG GCTCGGCGGC
ATGCTGACCA ACTGGAAGAC CATCTCCGGC TCGATCCAGC GCCTGCGCAA GGTCGATGAG
ACGCTCGAGG GCGGGGCCGT CGGCCTCACC AAGAAGGAGC GCCTCATGCT GACCCGTGAG
AAGGACAAGC TCGAGAAGGC GCTCGGCGGC ATCAAGGACA TGGGCGGCGT GCCCGACCTG
CTGTTCGTGA TCGACACCAA CAAGGAGCAG CTCGCGATCA AGGAGGCCCA GCGCCTCGGC
ATCCCGGTCG CTGCCATCGT CGACACGAAC TGCAACCCGG ACGGCATCTC CTACATCGTC
CCCGCCAACG ACGACGCCGG CCGCGCCATC GCGCTGTACT GCGACCTGAT CGCCCGCGCG
GCGATCGAGG GCATCGGCCG CGGCCAGGGC GCGCTCGGCC TCGACGTCGG CGCCTCCGAG
GAGCCGACCG CCGAGGAACT GCCGCCGGCC AACGACGACG TCGCGGTGTC CGTGGCCTCC
GACGCGATCG CCCCGGCCGA CGTCGCCGCG CTCGCCGAGT CGACCGAGCA CTTCGAGCAG
CTCGCTGCCC CGCGCGGTGC GCCGGACGAC CTGACCAAGC TCAACGGTGT CGGCCCGCAG
CTCGTGCAGA AGCTCAACGA CGCCGGCGTG TGGCACTACT GGCAGATCGC CGCCATGCAG
CCCGAGGACG TGGCCAAGCT CGACGCCGAC CTGAAGCTCA ACGGCCGCTT CGCTCGCGAC
GGTTGGGTCG AGCAGTCCCG CGCCTTCGTC GAGGCTTCCG CGGCCTAA
 
Protein sequence
MAVDFSMRQL LEAGAHFGHQ SHRWNPKMQP YIFGTRNNIH IIDLAQTVPA LHAALQAVSD 
TVARGGRVLF VGTKRQAADS IAEAAKRSAQ YYVNSRWLGG MLTNWKTISG SIQRLRKVDE
TLEGGAVGLT KKERLMLTRE KDKLEKALGG IKDMGGVPDL LFVIDTNKEQ LAIKEAQRLG
IPVAAIVDTN CNPDGISYIV PANDDAGRAI ALYCDLIARA AIEGIGRGQG ALGLDVGASE
EPTAEELPPA NDDVAVSVAS DAIAPADVAA LAESTEHFEQ LAAPRGAPDD LTKLNGVGPQ
LVQKLNDAGV WHYWQIAAMQ PEDVAKLDAD LKLNGRFARD GWVEQSRAFV EASAA