Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_3148 |
Symbol | |
ID | 6135106 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | - |
Start bp | 3483021 |
End bp | 3484871 |
Gene Length | 1851 bp |
Protein Length | 616 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641643336 |
Product | pepF/M3 family oligoendopeptidase |
Protein accession | YP_001769988 |
Protein GI | 170741333 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1164] Oligoendopeptidase F |
TIGRFAM ID | [TIGR02290] oligoendopeptidase, pepF/M3 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00629046 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCGACCC GAGCCGCGGT GCCGTCGGAA GGTTCCTTGA GCACGGCCCA GGCGACGGCC CGCGCCGTCG ATCTCGGGCC GCTGCCGGAG TGGGACCTCT CGGATCTCTA CGCGGGCCTC GACGACCCGG CCTTCGCCCG CGACCTCGCC CGCGCCGAGG CGGAGTGCCG GAGCTTCGCC GAGACCTATC GGGGCCGCGT CGCCGCGCTG GCGGGCGGGG AGGGCGCCGC CGACCAGCTC GGCACGGCGG TGGCCGCCTA CGAGGCCATC GAGGACCTCA TGGGCCGGCT GATGTCCTTC GCGGGCCTCG TCTATTCGGG CAACACGACC GACCCCGTCC GCGCCAAGTT CTACGGCGAC ACCCAGGAGC GCCTGACCGC CGCGTCGAGC GACCTCCTGT TCTTCACGCT CGAACTGAAC CGGGTGCCGG ACGCGGACAT CGACGCCGCC GCCGCCCTGC CGCCGCTCGC CCGCTACCGA CCCTGGCTGG AGGATATTCG CCGCGAGAAG CCCCACCAGC TCTCCGACGA CCTCGAGAAG CTGCTGCTGG AGAAGTCGGT GACCGGCCGG TCGGCCTGGA ACCGGCTCTT CGACGAGACC ATCGCCTCCC TGCGCTTCCC CCTGCGCGGC GAGCAGCTGA CCCTGGAGCC GACCCTCAAC AAGCTCCAGG ACGCCGACGA GGGCCTGCGC CGCGACGCCG CCGAGGCCCT GAGCGGGGTG TTCCGGGCGA ACCTGCGCGT CTTCACCCTG ATCACCAACA CGCTCGCCAA GGACAAGGAG ATCTCGGACC GCTGGCGGCG CTTCGGCGAC GTGGCGGATT CGCGCCACCT CGCCAACCGC GTCGAGCCCG AGGTGGTGGC CGCCCTGGTC GAGGCGGTGA CGGCGGCCTA TCCGCGCCTC TCGCACCGCT ACTACCGGCT GAAGGCCCGC TGGTTCCAGC GCGACAGCCT CGCCTACTGG GACCGCAACG CGCCCCTGCC GAAGGTCGAG CAGCGCACGA TTCCCTGGGC CGAGGCCCGC GAGACCGTGC TCTCCGCCTA CGGCGCCTTC TCGCCCCGGA TGGCCGAGAT CGCCCGCACC TTCTTCGAGG GCGGCTGGAT CGACGCGCCG GTGCGCCCCG GCAAGGCCCC GGGCGCCTTC GCGCACCCGA CCGTGCCCTC CGCCCATCCC TACGTGCTGG TGAACTACCA GGGCAAGCCG CGCGACGTGA TGACCCTCGC CCACGAACTC GGGCACGGCG TCCACCAGGT GCTGGCGGCC GGGAACGGCG CCCTGATGGC CCCGACCCCG CTGACGCTCG CCGAGACCGC GAGCGTGTTC GGCGAGATGC TGACCTTCCG CCGCGTCCTC GACGCCACCC GGGAGCCGCA TCAGCGCCGG GCGCTCCTCG CCGCCAAGGT GGAGGACATG ATCAACACGG TGGTGCGCCA GATCGCCTTC TACGTCTTCG AGCGCCGGCT CCACCTCGCG CGCCGGGACG GCGAACTCAC GGCCGAGCAG ATCTGCGCGC TGTGGATGTC GGTCCAGGCC GAGAGCCTCG GGCCGGCGAT CCGCCTCGAC GAGGGCTACG AGCCGTTCTG GGCCTACATC CCGCACTTCA TCCACTCGCC GTTCTACGTC TACGCCTACG CCTTCGGCGA TTGCCTGGTG AACTCCCTGT ACGGGGTCTA CCAGCGAGCC GAGGAGGGCT TCGTCGCGCG CTACTTCGCG CTGCTCTCGG CCGGCGGCAC CAAGCCCTAC GGCGAACTCC TGGCGCCCTT CGGGCTCGAT GCCCGCGACC CCTCCTTCTG GCAGATCGGC CTCTCGATGA TCGAGGGCAT GATCGCCGAG CTCGAAGCCA TGGAGGCGTG A
|
Protein sequence | MSTRAAVPSE GSLSTAQATA RAVDLGPLPE WDLSDLYAGL DDPAFARDLA RAEAECRSFA ETYRGRVAAL AGGEGAADQL GTAVAAYEAI EDLMGRLMSF AGLVYSGNTT DPVRAKFYGD TQERLTAASS DLLFFTLELN RVPDADIDAA AALPPLARYR PWLEDIRREK PHQLSDDLEK LLLEKSVTGR SAWNRLFDET IASLRFPLRG EQLTLEPTLN KLQDADEGLR RDAAEALSGV FRANLRVFTL ITNTLAKDKE ISDRWRRFGD VADSRHLANR VEPEVVAALV EAVTAAYPRL SHRYYRLKAR WFQRDSLAYW DRNAPLPKVE QRTIPWAEAR ETVLSAYGAF SPRMAEIART FFEGGWIDAP VRPGKAPGAF AHPTVPSAHP YVLVNYQGKP RDVMTLAHEL GHGVHQVLAA GNGALMAPTP LTLAETASVF GEMLTFRRVL DATREPHQRR ALLAAKVEDM INTVVRQIAF YVFERRLHLA RRDGELTAEQ ICALWMSVQA ESLGPAIRLD EGYEPFWAYI PHFIHSPFYV YAYAFGDCLV NSLYGVYQRA EEGFVARYFA LLSAGGTKPY GELLAPFGLD ARDPSFWQIG LSMIEGMIAE LEAMEA
|
| |