Gene Mpe_A0416 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0416 
Symbol 
ID4785169 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp448760 
End bp449959 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content71% 
IMG OID640088974 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_001019613 
Protein GI124265609 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAAGC AAGTTCAAGA CGCCTACATC GTCGCCGCCA CCCGCACCCC GATCGGCAAG 
TCCGGGCGGG GCTACTTCCG CAACACGCGG CCCGACGATC TGCTGGTCGC GGCGGTGCAG
AGTGCGCTGC GGCAGGTGCC CACGCTCGAC CCGAAGGCGA TCGAGGACGC CATCGTCGGC
TGCTCCTTCC CCGAGGGCGA GCAGGGCATG AACATCGCGC GCGCCGCGAT GCTGCTGGCC
GGCCTGCCGC AGTCGGTGGG CGGCGTCACG GTCAACCGCT TCTGCGCCTC GGGCCTGACC
GCGCTGCAGA TGGCCGCCGA CCGCATCCGC ATCGGCGAGG CCGACGTGAT GATCGCCGGC
GGCGCCGAGT CGATGAGCCT GGTGCCGATG GGCGGCAACA AGCCCTCGTT CAACCCCGCC
GTGTTCGAGA AGGACGAGAA CGTGGGCATC GCCTACGGCA TGGGCCTCAC CGCCGAGAAG
GTCGCGGCGC AGTGGAAGGT GAGCCGCGAG GCGCAGGACG CCTTCGCGCT GCAGTCGCAC
CAGCGGGCGC TCGCGGCCCA GGCGGCCGGC GAGTTCACCG ACGAGATGAC GCCCATCGAC
GTGGTCGACC GCTTTCCGAA TCTGGCCACC GGCGAGGTCG GCAGCAAGAC CCGTACGGTC
ACCCTCGACG AAGGCCCGCG CCCCGACACC TCGCTGGAAG GCCTGGCCCG GCTGCGCCCG
GTGTTCGCCG CCAAGGGGTC GGTGACCGCG GGCAACAGCT CGCAGACCAG CGACGGCGCC
GGCGCGCTGA TCCTCGCCAG CGAGAAGGCG GTGCGGCAGT TCGACCTGAA GCCGCTGGCC
CGCTTCGTCA GCTATGCGAT CCGCGGCGTG CCGCCCGAGA TCATGGGCAT CGGTCCGATC
GAGGCGATCC CGCTGGCGTT GAAGCACGCC GGGCTGAAGC TCGACGACCT GGGCTGGATC
GAGCTGAACG AGGCCTTCGC GGCCCAGGCG CTGGCGGTCA TCGGAAGCGT CGGGCTCGAC
CCGGCCAAGG TCAACCCGAT GGGCGGCGCC ATCGCCCTCG GCCACCCGCT GGGCGCCACC
GGCGCCATCC GCTCGGCCAC CGTCGTGCAC GCGCTGCAAC GCCACAACCT GAAGTACGGC
ATGGTGACGA TGTGCGTCGG CATGGGCCAG GGGGCGGCGG GCATCCTTGA ACGCGTCTGA
 
Protein sequence
MAKQVQDAYI VAATRTPIGK SGRGYFRNTR PDDLLVAAVQ SALRQVPTLD PKAIEDAIVG 
CSFPEGEQGM NIARAAMLLA GLPQSVGGVT VNRFCASGLT ALQMAADRIR IGEADVMIAG
GAESMSLVPM GGNKPSFNPA VFEKDENVGI AYGMGLTAEK VAAQWKVSRE AQDAFALQSH
QRALAAQAAG EFTDEMTPID VVDRFPNLAT GEVGSKTRTV TLDEGPRPDT SLEGLARLRP
VFAAKGSVTA GNSSQTSDGA GALILASEKA VRQFDLKPLA RFVSYAIRGV PPEIMGIGPI
EAIPLALKHA GLKLDDLGWI ELNEAFAAQA LAVIGSVGLD PAKVNPMGGA IALGHPLGAT
GAIRSATVVH ALQRHNLKYG MVTMCVGMGQ GAAGILERV