Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_3105 |
Symbol | |
ID | 5833920 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 3454668 |
End bp | 3455528 |
Gene Length | 861 bp |
Protein Length | 286 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641368905 |
Product | peptidase S49 |
Protein accession | YP_001640564 |
Protein GI | 163852521 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0616] Periplasmic serine proteases (ClpP class) |
TIGRFAM ID | [TIGR00706] signal peptide peptidase SppA, 36K type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.246798 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCTGA CCATCCCGAC CTGGATGCGC CGCCTCCTGC CTCGCCGCTT CCGCGAGACC CCGCCGCGGG TCGCCGTGGT GCGCCTGAGC GGGGCGATCG GCGCCGTCTC GCCGATCCGG GCGGGCCTTT CCATCGGCAC GGTGGCGCCG AGCCTGGAGC GCGCCTTCAC CATGCCGGGC CTGTCGGCGG TGGCGCTCGT CATCAATTCC CCCGGCGGAT CGCCGGTGCA GTCGCACCTG ATCTACCGGC GCATCCGGGC GCTGGCGGCG GAGAAGGAGA TCAAGGTCTT CGCCTTCGTC GAGGATGCCG CGGCCTCGGG CGGCTACATG ATCGCGTGCG CCGCCGACGA GATCGTCGCC GATCCCGCCT CGCTCGTCGG CTCCATCGGC GTGGTCTCGG CCGGCTTCGG TTTCGACCGG CTGATCGAGC GCATCGGCAT CGAGCGCCGC GTCCACACCC AGGGCGAGGC CAAGGCGATG CTCGACCCGT TCCGCCCGGA GAACCCGCTG GACATCGCCC GGCTGAAGGA GATCCAGGCC GACGTGCAGG CCCTGTTCTC CGGCCTCGTG CGCGAGCGCC GGCCGACGCT CGACGCTAGC CGCGACCTGT TCACCGGCGC GGTCTGGACC GGGCGGCAGG CGCTCGAGCT CGGCCTCGTC GATGCAATCG GCGACCTGCG CGGCACCCTG CGCGCCCGTT ACGGCGAGAA GGTCGATCTG CGGCTCGTGG CCGAGAATCG CGGCTCCTGG CTCGCCCGCC TGCTCCGCCG CGCCGGTCCG GGCCAGACTG CGGCCGGACT CCCCGATGCG CTGATCGCGG CGGTGGAGGA GCGGGCCGCC TGGGCACGGC TCGGGCTGTA G
|
Protein sequence | MPLTIPTWMR RLLPRRFRET PPRVAVVRLS GAIGAVSPIR AGLSIGTVAP SLERAFTMPG LSAVALVINS PGGSPVQSHL IYRRIRALAA EKEIKVFAFV EDAAASGGYM IACAADEIVA DPASLVGSIG VVSAGFGFDR LIERIGIERR VHTQGEAKAM LDPFRPENPL DIARLKEIQA DVQALFSGLV RERRPTLDAS RDLFTGAVWT GRQALELGLV DAIGDLRGTL RARYGEKVDL RLVAENRGSW LARLLRRAGP GQTAAGLPDA LIAAVEERAA WARLGL
|
| |