Gene Mpe_B0340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_B0340 
Symbol 
ID4787968 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008826 
Strand
Start bp287830 
End bp289251 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content64% 
IMG OID640092772 
ProductC-5 cytosine-specific DNA methylase 
Protein accessionYP_001023350 
Protein GI124262880 
COG category[L] Replication, recombination and repair 
COG ID[COG0270] Site-specific DNA methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTCAC TGCATACCTA CGCAGTACGC AAGATTGGCA GCCACCGTGG TTCGCCACGG 
CTCTGGCTGG AGGGCAGGGA GCCGACCAAG GGAGGTTTCC TGCCGGGCAC GCGCTTCAAC
ACGCGTGTCG ACACCGGCCG GGCACTCCTG GTGCTGGAGG CGGTCGAAGA TGGTGTTCGC
ATCGTCTCTG GCAAGCAGCG CGGCGACCGG CAGATCCCGG TCATCGACAT CAACAGCAAG
GAACTGCTCG ACATCTTCAC GGGTATCGAG GCGGTCCGCG TGATCGTCCA GGAGGGTGTC
ATCAGCATCC TGCCGCTGGC CTCCGAACTG CGCGCGCGCG AGCGCGTCAT TCGGCTGAAG
GACGGACTGG CGAACGGAAC CCTTTCCACC GGCTCGGTTT CGAGCGGCAT CGGGGTGCTT
GACCGTGCGG CGCACGAGGG CCTTGAACAG GCCGGCGTGG AGTGCCGCCT GGCCTTCGCG
AACGAGATCC GGGAAGACTG CGTCGAGCAC ATGTGCGATC ACAACCCGAT CGTGGACCAG
CACACCGTGA CCCTGACGGC GCCGATGCAG GAGCTCGCGT TCGACGAGTG GGCGATGAGC
CGCTTGCCGA AGGTGGACGT TCTGGTCGGC GGCATCCCTT GTTCCGGCGC AAGCAGGGCA
GGGCGCGCGA AGCGGGGCGC CTCGCACGCG GAAGCGCACC CCGAAGTCGG TCACCTGATC
GTGGCCTTCC TCGCCATCAT CGCCAAGGTG AACCCGTCGG CCATCGTGCT GGAGAACGTC
CCCGTCTGGG GAACCTCTGC TTCGATGTTC ATCCTGCGCA ACCAGCTGCG GGACCTGGGA
TACGACGTCC ACGAGACGAT CGTCAACTCG GCCGAATGGA ACGTGCTCGA GCACCGGGAG
CGCCTGTGTG TCGTGGCGGT GACCAAGGGG ATCGAGTTCA GCTTCGACGG CCTAGAGCGG
CCGGAGCCCG TGAGTCGCCG TCTCGGCGAG ATCATGGACG AAGTGCCGGT GGACGCGCCG
TGCTGGAGCG AGATGGCCTA CCTGAAGGAC AAGCGTGCCC GCGACGAGGC CAAAGGCAAC
AACTTCAAGA TGACGGTGCT CACGCCCGAC AGCGAGAAGG TGCCTTGCCT GAACAAGTCC
TTGTGGAAGC GCCAGAGTTC TGGCAGTTTC TGGAAGCATC CGGACGACAG CAACCTTCTG
AGGCTGCCCA CAGTGCGTGA ACACGCGCGC TGCAAGGGTG TCTGGGAGGA TCTGGTCGAA
GGTGTCGGCC TGACCTTCGG GCACGAGGCT CTCGGACAAT CCGTCACGGT TCCGCCGTTC
ATCTCGATCT TCAAGCTGCT CGGCCAGGCG CTCAAACGCT TTGCCAGTGA GGCTGAGGCT
TCGATCCAAC CCTTCGCGCT CCGCGAGCTC AAGGCAGCCT GA
 
Protein sequence
MTSLHTYAVR KIGSHRGSPR LWLEGREPTK GGFLPGTRFN TRVDTGRALL VLEAVEDGVR 
IVSGKQRGDR QIPVIDINSK ELLDIFTGIE AVRVIVQEGV ISILPLASEL RARERVIRLK
DGLANGTLST GSVSSGIGVL DRAAHEGLEQ AGVECRLAFA NEIREDCVEH MCDHNPIVDQ
HTVTLTAPMQ ELAFDEWAMS RLPKVDVLVG GIPCSGASRA GRAKRGASHA EAHPEVGHLI
VAFLAIIAKV NPSAIVLENV PVWGTSASMF ILRNQLRDLG YDVHETIVNS AEWNVLEHRE
RLCVVAVTKG IEFSFDGLER PEPVSRRLGE IMDEVPVDAP CWSEMAYLKD KRARDEAKGN
NFKMTVLTPD SEKVPCLNKS LWKRQSSGSF WKHPDDSNLL RLPTVREHAR CKGVWEDLVE
GVGLTFGHEA LGQSVTVPPF ISIFKLLGQA LKRFASEAEA SIQPFALREL KAA