Gene M446_3043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3043 
Symbol 
ID6134892 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp3369831 
End bp3371696 
Gene Length1866 bp 
Protein Length621 aa 
Translation table11 
GC content76% 
IMG OID641643234 
Productpeptidase M23B 
Protein accessionYP_001769888 
Protein GI170741233 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.022206 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAGGCCG GTGCCTCCGA CATCCTCGAC GACACCGCCT TCTCGCGTCC GGGCGGCCAG 
AAGGTCAGCC TGCGCTGGCT CGCCGCCTGC CTGCTCACCG GCATCGCCGG CGCCGCCCTG
CTCGGCCTCG CCCTCGACCT CGCCGCCTCG GACGGAATCG TCGCGGTCCC GCCCGACCTC
GCCCTGCACC CGCGTGCCGA GGCGGGCGCG CCCGGCGAGG CCGGGACCGC GCCCGCGGAA
TTCGCCGCGC GCAAGGGCGA CCGGCTGGTG CGCGACGAGG TCGTGGTGGC GGCCAAGCGC
GAGTTCCGCG CACCGCTGAG CGAGACCGTC GGCGAGCGCG AGGTGATCCG CGTCCACCAA
GTGGTGCAGG TGGCCACGGA CCTGTCGCTG CGCGCCCCCG AGGCGCCGAT CCCGCCCTTC
GACCCGATGC GCGGCGCCGG GAGCGAGGCC GTCACCGAGG AGACCGGCAG CGATCCGGCC
GAGACCGCCG TGACCCTGGT GCGCTCGCCC CTGGCGGAGG CGCCGGAGAC CGAGGCCCCC
GCCCTCTCGG ACGAGGAGGT CGACGCGCTC GTGGCCGAGA CGCAGCGCCT CGCCGGGGGG
CCGGGGCCGC TGCCCCCTGC CTTCCCGCCG GAGCGGATGC TCTCGCGGGC CCTGCGCCTC
GGGGCGGGCC GCGAGGAGGA CGAGGCCCCC GCGGGGGCGA TCGACGTGAA GATCCTGCCG
GAGAACCTGA CCGAGATCGC CGAGACCGCG GCGGCCGCGG CCGGCCCCCT GTTCGAGACC
CGCGAGGCGG TGATCGGCAA GGATCAGGCC CTGGCGGCGA TCCTGCGCGA GAACGGCGCC
GGGCCCGACC GGGTCGCCGC GATCCTGGCG GCGCTGAGCC CGCGCGCGCG GGACGAGCTG
CCCGAGGGCC AGCACCTGCG CCTGCTGATC GCGCGCGAGG GCCCGACGCC GGGCATCGCC
CGGGTGACGC TCTACGGCGA GGACGGGATC GAGGAGATCG CCGCCGCCAA CGACCGCGGC
GGCTTCGTGT CGGTGGCGCC GCCGCGGCCG GGGGCGCCGG CGGCCGAGGA GGAGGGCGGC
GTCAGCCTCT ACGAGAGCCT CTACGGCGCG GCGCTGAAGA ACGGGGTCCC GCCCGGCGTC
ATCGAGGACC TCGTGCGCGT CCTGGCCTCC GGCAGCGACC TGCAGAGCCG CACCGGGCCC
GGCGACCACG TCGAACTCCT CTTCACCGCC GACGAGGATT CCAAGCCGGA ACTGCTCTTC
GCGGCCCTGC GCAGCCGCGG GGAGACGCAG AGCCTCTACC GGTTCCGGGG GCCGGGAACG
GGCGAGGTCG AGTACCTCGA CGCGGAGGGC CGCTCGACGC GCAAGTTCCT GATCCGGCGG
CCGGTGGCCG AGGGGCGGAT CAGCTCCCCG TACGGGGCGC GCCTGCACCC GATCCTCGGC
TATTACCGGA TGCACAACGG CGTCGATTGG GCGGCGACCC GGGGCACGCC GATCATGGCG
ACGGGCGACG GCGTGGTGAT CGCGGCGGGC GCCCGCTCGG GCTACGGCAA CCGCGTCGAG
ATCCAGCACG CCAACAACTA CGTCACCGCC TACAACCACA TGGCCCGGAT CGCGCGCGGC
ATCGTGCCGG GGGCGCGGGT GCATCTCGGC CAAGTGATCG GGTCGGTCGG CACCACCGGC
CTCTCGACGG GGCCGCACGT CCACTACGAG GTCGCGATCA ACGGGCGCTT CGTCGACCCG
ATGAAGATCC GGCTGCCGAG CGCGCACGCG CTCACCGGGC CGGCCCTCGC CGCGTTCCGG
GCGGTCGAGG AGCAGGTCGA CGGGCTGCGC CACCGCCACG GCACCGCCGC CGCGGCGCGG
ATGTGA
 
Protein sequence
MQAGASDILD DTAFSRPGGQ KVSLRWLAAC LLTGIAGAAL LGLALDLAAS DGIVAVPPDL 
ALHPRAEAGA PGEAGTAPAE FAARKGDRLV RDEVVVAAKR EFRAPLSETV GEREVIRVHQ
VVQVATDLSL RAPEAPIPPF DPMRGAGSEA VTEETGSDPA ETAVTLVRSP LAEAPETEAP
ALSDEEVDAL VAETQRLAGG PGPLPPAFPP ERMLSRALRL GAGREEDEAP AGAIDVKILP
ENLTEIAETA AAAAGPLFET REAVIGKDQA LAAILRENGA GPDRVAAILA ALSPRARDEL
PEGQHLRLLI AREGPTPGIA RVTLYGEDGI EEIAAANDRG GFVSVAPPRP GAPAAEEEGG
VSLYESLYGA ALKNGVPPGV IEDLVRVLAS GSDLQSRTGP GDHVELLFTA DEDSKPELLF
AALRSRGETQ SLYRFRGPGT GEVEYLDAEG RSTRKFLIRR PVAEGRISSP YGARLHPILG
YYRMHNGVDW AATRGTPIMA TGDGVVIAAG ARSGYGNRVE IQHANNYVTA YNHMARIARG
IVPGARVHLG QVIGSVGTTG LSTGPHVHYE VAINGRFVDP MKIRLPSAHA LTGPALAAFR
AVEEQVDGLR HRHGTAAAAR M