Gene Mpe_A3321 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3321 
Symbol 
ID4786420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3527173 
End bp3528249 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content69% 
IMG OID640091894 
Product4-hydroxy-2-ketovalerate aldolase 
Protein accessionYP_001022509 
Protein GI124268505 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR03217] 4-hydroxy-2-oxovalerate aldolase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.111533 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCCA CACGTTCGAT GGAACCCGAC CTGCGTGGCC GCAAGGTCCT GCTGCACGAC 
ATGTGCCTGC GCGACGGCAT GCATGCCAAG CGCGAGCAGA TCCCGGTGGA GCAGATGGTC
AAGGTGGCGA TGGCACTCGA CGCCGCCGGC GTGCCGCTGA TCCAGGTGAC GCACGGCGCC
GGACTGGGCG GCAACTCGCT GCAGCACGGC TTCGCGCTGG CCAGCAACGA GGCCTACCTC
AGCGCCGTGG CGCCGAAGAT GAAGCAGGCC AAGGTCTCGG TGCTGCTGAT CCCCGGCCTG
GGCACGATGC GCGAGCTGCA GTCGGCCTAC AACTGCGGCG CGCGCAGCGT CACCGTGGCC
ACCCACTGCA CCGAGGCCGA CACCGCGCCG CAGCACATCG CCTACGCGCG CAAGCTCGGC
ATGGACACGG TGGGCTTCCT GATGATGGCG CACCTGAACG ACCCGGAAGG GCTCGCGAAG
CAGGGCAAGC TGATGGAGGA CTACGGCGCG CAGACCGTCT ACGTGACCGA CTCCGCCGGC
TACATGCTGC CGGCCGACGT GCGCGCCCGC GTGGCCGCAC TGCGCGCGGT GCTGAAGCCC
GAGACCGAGA TCGGCTTCCA CGGCCACCAC AACCTGGGCA TGGGCATCGC CAACTCCATC
GCCGCCATCG AGGAGGGCGC GAGCCGCATC GACGGCTCGG TGGCCGGGCT GGGTGCCGGC
GCCGGCAACA CGCCGCTGGA GGTGTTCCTC GCGGTCTGCG ACCGCATGGG CATCGAGACC
GGTGTCGATC TCTTCAAGCT GATGGACGTG GCCGAGGACG TGATCGTGCC GATGATGGAC
CACCTGGTGC GCGTGGACCG CGAGTCGCTG ACGCTGGGCT TCGCCGGCGT GTACTCCACC
TTCCTGCTGC ACGCCAAGCG CGCGGCGGCG CGCTTCGGCG TGCCGGCGCG CGAGATCCTG
GTCGAGCTGG GCCGCCGCAA GATGATCGGC GGCCAGGAAG ACATGATCGA GGACACCGCG
ATGAGCATGG CCAAGGAACG CGGCCTGCTG AAGGACGTGA GCCGCAAGGC CGCTTGA
 
Protein sequence
MSATRSMEPD LRGRKVLLHD MCLRDGMHAK REQIPVEQMV KVAMALDAAG VPLIQVTHGA 
GLGGNSLQHG FALASNEAYL SAVAPKMKQA KVSVLLIPGL GTMRELQSAY NCGARSVTVA
THCTEADTAP QHIAYARKLG MDTVGFLMMA HLNDPEGLAK QGKLMEDYGA QTVYVTDSAG
YMLPADVRAR VAALRAVLKP ETEIGFHGHH NLGMGIANSI AAIEEGASRI DGSVAGLGAG
AGNTPLEVFL AVCDRMGIET GVDLFKLMDV AEDVIVPMMD HLVRVDRESL TLGFAGVYST
FLLHAKRAAA RFGVPAREIL VELGRRKMIG GQEDMIEDTA MSMAKERGLL KDVSRKAA