Gene Mext_3469 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3469 
Symbol 
ID5831345 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp3848189 
End bp3849373 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content70% 
IMG OID641369267 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_001640925 
Protein GI163852882 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.753408 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCCA GTGAAGATAT CGTCATTGTC GGTGCGGCGC GTACGCCCGT CGGATCGTTC 
GCCGGTGCCT TCGGTTCCGT GCCGGCCCAC GAACTCGGCG CCACGGCGAT CAAGGCCGCA
CTGGAGCGCG CGGGCGTTTC GCCGGACGAC GTGGACGAGG TGATCTTCGG CCAGGTGCTC
ACCGCTGCCG CCGGGCAGAA CCCGGCCCGT CAGGCCGCCA TCGCCGCAGG CATCCCCGAG
AAGGCGACCG CCTGGGGTCT CAATCAGGTC TGCGGCTCGG GCCTGCGCAC CGTCGCGGTC
GGCATGCAGC AGATCGCCAA CGGCGACGCC AAGGTGATCG TGGCCGGCGG CCAGGAGTCG
ATGTCGCTCA GCCCGCACGC CCAGTACCTG CGCGGCGGCC AGAAGATGGG CGATCTCAAG
CTCGTCGACA CCATGATCAA GGACGGCCTG TGGGACGCCT TCAACGGCTA CCACATGGGC
CAGACCGCCG AGAACGTCGC CCAGGCCTTC CAGCTCACCC GCGAGCAGCA GGACCAGTTC
GCGGTTCGCT CGCAGAACAA GGCCGAGGCC GCCCGCAAGG AAGGCCGCTT CAAGGAAGAG
ATCGTCCCCG TCACCGTGAA GGGCCGCAAG GGCGACACGG TCGTCGACAC CGACGAGTAC
ATCCGCGACG GCGCCACCGT CGAGGCGATG GCCAAGCTCA AGCCCGCCTT CGCCAAGGAC
GGCACCGTGA CCGCGGCCAA CGCCTCGGGC CTCAACGACG GCGCCGCCGC GCTGGTGCTG
ATGTCGGCCT CCGAGGCCGA GCGCCGGGGC ATCACGCCGC TCGCCCGGAT CGTGTCCTGG
GCGACCGCCG GCGTCGATCC CAAGGTGATG GGCACGGGCC CGATCCCGGC CTCGCGCAAG
GCCCTGGAGA AGGCCGGCTG GAAGCCCGCC GACCTCGACC TGATCGAGGC GAACGAGGCT
TTCGCCGCTC AGGCGCTGGC CGTGAACAAG GACATGGGCT GGGACGACGA GAAGGTGAAC
GTCAATGGCG GCGCCATCGC CATCGGCCAC CCGATCGGTG CCTCCGGCGC CCGCGTCCTC
ATCACCCTGC TGCACGAGCT GAAGCGCCGC GACGCCAAGA AGGGCCTCGC CACGCTCTGC
ATCGGCGGCG GCATGGGTGT CGCCATGTGT GTCGAGCGGG TCTGA
 
Protein sequence
MAASEDIVIV GAARTPVGSF AGAFGSVPAH ELGATAIKAA LERAGVSPDD VDEVIFGQVL 
TAAAGQNPAR QAAIAAGIPE KATAWGLNQV CGSGLRTVAV GMQQIANGDA KVIVAGGQES
MSLSPHAQYL RGGQKMGDLK LVDTMIKDGL WDAFNGYHMG QTAENVAQAF QLTREQQDQF
AVRSQNKAEA ARKEGRFKEE IVPVTVKGRK GDTVVDTDEY IRDGATVEAM AKLKPAFAKD
GTVTAANASG LNDGAAALVL MSASEAERRG ITPLARIVSW ATAGVDPKVM GTGPIPASRK
ALEKAGWKPA DLDLIEANEA FAAQALAVNK DMGWDDEKVN VNGGAIAIGH PIGASGARVL
ITLLHELKRR DAKKGLATLC IGGGMGVAMC VERV