Gene Mpe_A1869 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1869 
Symbol 
ID4786749 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1995955 
End bp1997136 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content68% 
IMG OID640090439 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_001021062 
Protein GI124267058 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.101968 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCG ACATCGTGAT CGTCAGCGCC GCCCGCACCG CCGTGGGCAA GTTCGGCGGC 
ACGCTCGCCA AGACCCCGGC GCCGGAACTC GGCGCCGCAG TGATCCAGGC CCTGCTGGCG
CGCAGCGGCC TGTCCGGCGA GCAGATCAGC GAGGTGATCC TCGGTCAGGT GCTCACTGCA
GGCAGTGGCC AGAACCCGGC GCGCCAATCG GTCATCAAGG CCGGCCTGCC GCAGGGCGTG
CCGGCCATGA CCATCAACAA GGTCTGCGGC TCCGGCCTGA AGGCCGTGAT GCTGGCAGCC
CAGGCCATCC GCGACGGCGA TGCCGAGATC GTGATCGCCG GCGGCCAGGA GAACATGAGC
CTGGCGCCGC ACGTGCTGCC GGGTTCACGC GACGGTCAGC GCATGGGGGA CTGGAAGCTG
ATCGACACGA TGATCGTCGA CGGCCTGTGG GATGTGTACA ACCAGTACCA CATGGGCATC
ACCGCCGAAA ACGTGGCGAA GAAATACGGC ATCACGCGCG AACAGCAGGA CGCATTGGCG
TTGGGTTCGC AGCAGAAGGC GGCCGCAGCC CAGGATGCCG GCAAGTTCAA GGACGAGATC
GTGCCGTTCA GCATCGCCCA GAAGAAGGGC GACCCGATCG TGTTCGCGGC CGACGAGTTC
ATCAACCGCA AGACCAACGC CGACGTGCTG GCCGGCCTGC GCCCGGCCTT CGACAAGGCG
GGCGGCGTGA CCGCCGGCAA CGCCTCGGGC CTGAACGACG GCGCGGCCGC GGTGCTGGTC
ATGAGCGCGA AGAAGGCCGA CCAGCTCGGC CTGAAGCCGC TTGCACGCAT CGCCTCCTAC
GCGAGCGCCG GCCTCGACCC GTCGCTGATG GGCATGGGCC CGGTGCCTGC CAGCAAGCGT
GCCTTGGAGC GGGCGGGCTG GAAGCCGGCC GACCTGGATC TGCTGGAGAT CAACGAGGCC
TTCGCTGCCC AGGCCTGCGC GGTCAACAAC GAGATGGGCT GGGACACCAG CAAGGTCAAC
GTCAACGGAG GGGCGATCGC CATCGGCCAC CCGATCGGTG CGTCGGGTTG CCGCGTGCTG
GTGACGCTGC TGCACGAGAT GCAGCGCCGC GGCAGCAAGC GCGGCATTGC GTCGCTGTGC
ATCGGCGGCG GCATGGGTGT TGCATTGACC GTGGAGCGCT GA
 
Protein sequence
MTTDIVIVSA ARTAVGKFGG TLAKTPAPEL GAAVIQALLA RSGLSGEQIS EVILGQVLTA 
GSGQNPARQS VIKAGLPQGV PAMTINKVCG SGLKAVMLAA QAIRDGDAEI VIAGGQENMS
LAPHVLPGSR DGQRMGDWKL IDTMIVDGLW DVYNQYHMGI TAENVAKKYG ITREQQDALA
LGSQQKAAAA QDAGKFKDEI VPFSIAQKKG DPIVFAADEF INRKTNADVL AGLRPAFDKA
GGVTAGNASG LNDGAAAVLV MSAKKADQLG LKPLARIASY ASAGLDPSLM GMGPVPASKR
ALERAGWKPA DLDLLEINEA FAAQACAVNN EMGWDTSKVN VNGGAIAIGH PIGASGCRVL
VTLLHEMQRR GSKRGIASLC IGGGMGVALT VER