Gene Mpe_A1246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1246 
Symbol 
ID4785147 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1342170 
End bp1343111 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content69% 
IMG OID640089812 
Productputative histone deacetylase-family protein 
Protein accessionYP_001020443 
Protein GI124266439 
COG category[B] Chromatin structure and dynamics
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0489566 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.18397 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCACCG CCTACTACTC CCACCCCGCC TGCCGCACCC ATGACATGGG CGATGGCCAC 
CCCGAGTGCC CGCAGCGGCT CGACGCCATT GCCGACCACC TGATCGCCAC CGGCCTGGAC
ATCGCGCTCG AGCAGCGCGA GGCGCCGCCG GTCGACGAGG GCACCTTGGC GCTGGCCCAT
GAGTCGCTGT ACGTCGCCAA CCTGCGCGGC TTCATGGAGC AGGTGGAGGC CACCGGCCAG
CGCCGCGCGC TGGACCCCGA CACCGTCGCC GGCCCCGGCA CCTGGGCTGC GGCGCTGCAT
GCGGCCGGGG CCGCGGTGGC GGCGACCGAT GCGGTGCTCG ATGGCGAGAT CGAGAACGCC
TTCTGCGCGG TGCGCCCGCC GGGCCACCAT GCCACCCGGC GCCAGGCCAT GGGTTTCTGC
TTCTTCAACA ACGTCGCCGT CGCGGCACGC CATGCACTCG ACCGGCGCGG GCTGGAGCGT
GTGGCGATCA TCGATTTCGA TGTCCACCAC GGCAACGGCA CCGAGGACAT CATCGCCAAC
GACGATCGCG TGCTGATGTG CAGCTTCTTC CAGCATCCGC TGTACCCGAA CTCCGGGGCC
CAGCCGCTGG GCGACAACAT GGTGAACCTG CCGGTGCCGG CATATACCAA GGGATCCGAG
ATCCGCGAAC TGATCGACCG GCACTGGTTG CCGCGGCTCG AGGCCTTCCG GCCGCAGATG
CTGTTCATTT CGGCGGGTTT CGACGCTCAC CGGGAGGACG ATCTCGGTCA ACTCGGGTTG
GTGGAGAGCG ACTACGCCTG GATCACGCGG CGCCTCAAGG GTGTGGCCGA ACGCCATGCC
GACGGTCGCA TCGTCTCGTG TCTGGAGGGG GGCTACGCAC TGAGTGCGCT GGCGCGCAGC
GTGGCCAGCC ACGTGCGAGT GCTGGCCGGA CTGACCGATT GA
 
Protein sequence
MATAYYSHPA CRTHDMGDGH PECPQRLDAI ADHLIATGLD IALEQREAPP VDEGTLALAH 
ESLYVANLRG FMEQVEATGQ RRALDPDTVA GPGTWAAALH AAGAAVAATD AVLDGEIENA
FCAVRPPGHH ATRRQAMGFC FFNNVAVAAR HALDRRGLER VAIIDFDVHH GNGTEDIIAN
DDRVLMCSFF QHPLYPNSGA QPLGDNMVNL PVPAYTKGSE IRELIDRHWL PRLEAFRPQM
LFISAGFDAH REDDLGQLGL VESDYAWITR RLKGVAERHA DGRIVSCLEG GYALSALARS
VASHVRVLAG LTD