Gene M446_3148 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3148 
Symbol 
ID6135106 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp3483021 
End bp3484871 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content71% 
IMG OID641643336 
ProductpepF/M3 family oligoendopeptidase 
Protein accessionYP_001769988 
Protein GI170741333 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1164] Oligoendopeptidase F 
TIGRFAM ID[TIGR02290] oligoendopeptidase, pepF/M3 family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00629046 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCGACCC GAGCCGCGGT GCCGTCGGAA GGTTCCTTGA GCACGGCCCA GGCGACGGCC 
CGCGCCGTCG ATCTCGGGCC GCTGCCGGAG TGGGACCTCT CGGATCTCTA CGCGGGCCTC
GACGACCCGG CCTTCGCCCG CGACCTCGCC CGCGCCGAGG CGGAGTGCCG GAGCTTCGCC
GAGACCTATC GGGGCCGCGT CGCCGCGCTG GCGGGCGGGG AGGGCGCCGC CGACCAGCTC
GGCACGGCGG TGGCCGCCTA CGAGGCCATC GAGGACCTCA TGGGCCGGCT GATGTCCTTC
GCGGGCCTCG TCTATTCGGG CAACACGACC GACCCCGTCC GCGCCAAGTT CTACGGCGAC
ACCCAGGAGC GCCTGACCGC CGCGTCGAGC GACCTCCTGT TCTTCACGCT CGAACTGAAC
CGGGTGCCGG ACGCGGACAT CGACGCCGCC GCCGCCCTGC CGCCGCTCGC CCGCTACCGA
CCCTGGCTGG AGGATATTCG CCGCGAGAAG CCCCACCAGC TCTCCGACGA CCTCGAGAAG
CTGCTGCTGG AGAAGTCGGT GACCGGCCGG TCGGCCTGGA ACCGGCTCTT CGACGAGACC
ATCGCCTCCC TGCGCTTCCC CCTGCGCGGC GAGCAGCTGA CCCTGGAGCC GACCCTCAAC
AAGCTCCAGG ACGCCGACGA GGGCCTGCGC CGCGACGCCG CCGAGGCCCT GAGCGGGGTG
TTCCGGGCGA ACCTGCGCGT CTTCACCCTG ATCACCAACA CGCTCGCCAA GGACAAGGAG
ATCTCGGACC GCTGGCGGCG CTTCGGCGAC GTGGCGGATT CGCGCCACCT CGCCAACCGC
GTCGAGCCCG AGGTGGTGGC CGCCCTGGTC GAGGCGGTGA CGGCGGCCTA TCCGCGCCTC
TCGCACCGCT ACTACCGGCT GAAGGCCCGC TGGTTCCAGC GCGACAGCCT CGCCTACTGG
GACCGCAACG CGCCCCTGCC GAAGGTCGAG CAGCGCACGA TTCCCTGGGC CGAGGCCCGC
GAGACCGTGC TCTCCGCCTA CGGCGCCTTC TCGCCCCGGA TGGCCGAGAT CGCCCGCACC
TTCTTCGAGG GCGGCTGGAT CGACGCGCCG GTGCGCCCCG GCAAGGCCCC GGGCGCCTTC
GCGCACCCGA CCGTGCCCTC CGCCCATCCC TACGTGCTGG TGAACTACCA GGGCAAGCCG
CGCGACGTGA TGACCCTCGC CCACGAACTC GGGCACGGCG TCCACCAGGT GCTGGCGGCC
GGGAACGGCG CCCTGATGGC CCCGACCCCG CTGACGCTCG CCGAGACCGC GAGCGTGTTC
GGCGAGATGC TGACCTTCCG CCGCGTCCTC GACGCCACCC GGGAGCCGCA TCAGCGCCGG
GCGCTCCTCG CCGCCAAGGT GGAGGACATG ATCAACACGG TGGTGCGCCA GATCGCCTTC
TACGTCTTCG AGCGCCGGCT CCACCTCGCG CGCCGGGACG GCGAACTCAC GGCCGAGCAG
ATCTGCGCGC TGTGGATGTC GGTCCAGGCC GAGAGCCTCG GGCCGGCGAT CCGCCTCGAC
GAGGGCTACG AGCCGTTCTG GGCCTACATC CCGCACTTCA TCCACTCGCC GTTCTACGTC
TACGCCTACG CCTTCGGCGA TTGCCTGGTG AACTCCCTGT ACGGGGTCTA CCAGCGAGCC
GAGGAGGGCT TCGTCGCGCG CTACTTCGCG CTGCTCTCGG CCGGCGGCAC CAAGCCCTAC
GGCGAACTCC TGGCGCCCTT CGGGCTCGAT GCCCGCGACC CCTCCTTCTG GCAGATCGGC
CTCTCGATGA TCGAGGGCAT GATCGCCGAG CTCGAAGCCA TGGAGGCGTG A
 
Protein sequence
MSTRAAVPSE GSLSTAQATA RAVDLGPLPE WDLSDLYAGL DDPAFARDLA RAEAECRSFA 
ETYRGRVAAL AGGEGAADQL GTAVAAYEAI EDLMGRLMSF AGLVYSGNTT DPVRAKFYGD
TQERLTAASS DLLFFTLELN RVPDADIDAA AALPPLARYR PWLEDIRREK PHQLSDDLEK
LLLEKSVTGR SAWNRLFDET IASLRFPLRG EQLTLEPTLN KLQDADEGLR RDAAEALSGV
FRANLRVFTL ITNTLAKDKE ISDRWRRFGD VADSRHLANR VEPEVVAALV EAVTAAYPRL
SHRYYRLKAR WFQRDSLAYW DRNAPLPKVE QRTIPWAEAR ETVLSAYGAF SPRMAEIART
FFEGGWIDAP VRPGKAPGAF AHPTVPSAHP YVLVNYQGKP RDVMTLAHEL GHGVHQVLAA
GNGALMAPTP LTLAETASVF GEMLTFRRVL DATREPHQRR ALLAAKVEDM INTVVRQIAF
YVFERRLHLA RRDGELTAEQ ICALWMSVQA ESLGPAIRLD EGYEPFWAYI PHFIHSPFYV
YAYAFGDCLV NSLYGVYQRA EEGFVARYFA LLSAGGTKPY GELLAPFGLD ARDPSFWQIG
LSMIEGMIAE LEAMEA