Gene Mext_3105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3105 
Symbol 
ID5833920 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp3454668 
End bp3455528 
Gene Length861 bp 
Protein Length286 aa 
Translation table11 
GC content73% 
IMG OID641368905 
Productpeptidase S49 
Protein accessionYP_001640564 
Protein GI163852521 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0616] Periplasmic serine proteases (ClpP class) 
TIGRFAM ID[TIGR00706] signal peptide peptidase SppA, 36K type 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.246798 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCTGA CCATCCCGAC CTGGATGCGC CGCCTCCTGC CTCGCCGCTT CCGCGAGACC 
CCGCCGCGGG TCGCCGTGGT GCGCCTGAGC GGGGCGATCG GCGCCGTCTC GCCGATCCGG
GCGGGCCTTT CCATCGGCAC GGTGGCGCCG AGCCTGGAGC GCGCCTTCAC CATGCCGGGC
CTGTCGGCGG TGGCGCTCGT CATCAATTCC CCCGGCGGAT CGCCGGTGCA GTCGCACCTG
ATCTACCGGC GCATCCGGGC GCTGGCGGCG GAGAAGGAGA TCAAGGTCTT CGCCTTCGTC
GAGGATGCCG CGGCCTCGGG CGGCTACATG ATCGCGTGCG CCGCCGACGA GATCGTCGCC
GATCCCGCCT CGCTCGTCGG CTCCATCGGC GTGGTCTCGG CCGGCTTCGG TTTCGACCGG
CTGATCGAGC GCATCGGCAT CGAGCGCCGC GTCCACACCC AGGGCGAGGC CAAGGCGATG
CTCGACCCGT TCCGCCCGGA GAACCCGCTG GACATCGCCC GGCTGAAGGA GATCCAGGCC
GACGTGCAGG CCCTGTTCTC CGGCCTCGTG CGCGAGCGCC GGCCGACGCT CGACGCTAGC
CGCGACCTGT TCACCGGCGC GGTCTGGACC GGGCGGCAGG CGCTCGAGCT CGGCCTCGTC
GATGCAATCG GCGACCTGCG CGGCACCCTG CGCGCCCGTT ACGGCGAGAA GGTCGATCTG
CGGCTCGTGG CCGAGAATCG CGGCTCCTGG CTCGCCCGCC TGCTCCGCCG CGCCGGTCCG
GGCCAGACTG CGGCCGGACT CCCCGATGCG CTGATCGCGG CGGTGGAGGA GCGGGCCGCC
TGGGCACGGC TCGGGCTGTA G
 
Protein sequence
MPLTIPTWMR RLLPRRFRET PPRVAVVRLS GAIGAVSPIR AGLSIGTVAP SLERAFTMPG 
LSAVALVINS PGGSPVQSHL IYRRIRALAA EKEIKVFAFV EDAAASGGYM IACAADEIVA
DPASLVGSIG VVSAGFGFDR LIERIGIERR VHTQGEAKAM LDPFRPENPL DIARLKEIQA
DVQALFSGLV RERRPTLDAS RDLFTGAVWT GRQALELGLV DAIGDLRGTL RARYGEKVDL
RLVAENRGSW LARLLRRAGP GQTAAGLPDA LIAAVEERAA WARLGL