Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_1718 |
Symbol | |
ID | 6135399 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | - |
Start bp | 1930229 |
End bp | 1931644 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641641976 |
Product | protease Do |
Protein accession | YP_001768645 |
Protein GI | 170739990 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0759509 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCTGTC CGATGCGCCT TGCCCTGGTC GCCGCGCTCG CCCTCGGCCC CGCCGCGGCC CTGGCCGACC CCGCCCCGCC GACCCGCGCC CTGCCCGAGA GCCGCGCGCA GGTGCAGCTC TCCTTCGCGC CGATCGTGCG CAAGGCGGCG CCCTCGGTGG TCAACGTCTA CGGCGCCCAC GTCGAGAAGC GCTCGGCCAA CCGCAACGCC ATGGACGAGT TCCTGCGCCG CTTCTTCGGG GAATCCGGCC CCGGCGGCAC GCAGGAGCGG GCGCAGCGCT CGCTGGGCTC GGGCGTGATC GTCGACGCCT CCGGGCTGAT CGTCACCAAC AACCACGTCG TCGAGAACAT GAACGAGGTG AAGGTGGCGC TGACCGACCG GCGGGAATTC CCGGCCGAGA TCGTGCTGCG CGACCCCCGC ACCGACCTCG CGGTCCTGCG CATCAAGGCG CCGGGCGGGA TCGCCGCGAT GGAGTTCGGC GATTCCGAGG CGCTGCAGGT CGGCGACTTC GTGATCGCCA TCGGCAACCC CTTCGGGGTC GGCCAGACGG TGACGCAGGG CATCGTCTCG GCCCTCGCCC GCACGCAGGT GGGCTCGGCC GATTACCAGT TCTTCATCCA GACGGACGCG GCCATCAACC CGGGCAATTC GGGGGGCGCC CTGGTCGACC TGTCGGGCGC GCTGGTCGGC ATCAACACGG CGATCTTCTC GCAATCGGGC GGCAGCCACG GCATCGGCTT CGCCATCCCC GCCAGCATGG TGCGGGCGGT GGTGGAGACG GCGCGGGGCG GCGGGCGGAT CGTGCGCCGG CCCTGGCTCG GGGCGCGGCT GCAGAACGTG ACGCCCGACA TCGCCGACAG CGTCGGCCTC GACCACCCGA CCGGGGTGCT GGTGGCCGGC ATGCTGGGCA AGAGTCCGGC GGAGGAATCC GGCCTCAAGC GCGGCGACGT GATCCTGAGC GTGGACGGCC AGCCGGTCGA CGACCCGGAG GCCTTCGGCT ACCGCTTCGC CCTCAAGGGC ATCAGCGGCG AGACGAAGCT CGCGGTGCTG CGCGGGTCGA ACCGCATCAC GCTGCCGGTG CGCCTCGCCC CGGCGCCCGA GACGCGCCCG CGCGACACGC TCAAGATCCG CACCCGCTCG CCCTTCCTGG GGGCGACGGC GGTCAACCTC TCGCCGGCCG TCGCCGAGGA GCTGCAGCTC GACCTCCCGG CGGACGGGGT GGTGATCGCC GAGGTCGACG GCGGCAGCAT CGCGGCCCGG GCCGGGCTGC AGAAGGGCGA CGTGATCGTG GCGGTGAACG GGGCCTCGGT CGCCAGCACC AGGGACCTCG ACCGCATCAC CCGCAACAGC CTCTCGGCCT GGGAGGTGAC GATCAACCGC GGCGGCCAGC AGCTCACCTC GCTGTTCAGC GGCTGA
|
Protein sequence | MRCPMRLALV AALALGPAAA LADPAPPTRA LPESRAQVQL SFAPIVRKAA PSVVNVYGAH VEKRSANRNA MDEFLRRFFG ESGPGGTQER AQRSLGSGVI VDASGLIVTN NHVVENMNEV KVALTDRREF PAEIVLRDPR TDLAVLRIKA PGGIAAMEFG DSEALQVGDF VIAIGNPFGV GQTVTQGIVS ALARTQVGSA DYQFFIQTDA AINPGNSGGA LVDLSGALVG INTAIFSQSG GSHGIGFAIP ASMVRAVVET ARGGGRIVRR PWLGARLQNV TPDIADSVGL DHPTGVLVAG MLGKSPAEES GLKRGDVILS VDGQPVDDPE AFGYRFALKG ISGETKLAVL RGSNRITLPV RLAPAPETRP RDTLKIRTRS PFLGATAVNL SPAVAEELQL DLPADGVVIA EVDGGSIAAR AGLQKGDVIV AVNGASVAST RDLDRITRNS LSAWEVTINR GGQQLTSLFS G
|
| |