Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A3321 |
Symbol | |
ID | 4786420 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 3527173 |
End bp | 3528249 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640091894 |
Product | 4-hydroxy-2-ketovalerate aldolase |
Protein accession | YP_001022509 |
Protein GI | 124268505 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0119] Isopropylmalate/homocitrate/citramalate synthases |
TIGRFAM ID | [TIGR03217] 4-hydroxy-2-oxovalerate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.111533 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCCA CACGTTCGAT GGAACCCGAC CTGCGTGGCC GCAAGGTCCT GCTGCACGAC ATGTGCCTGC GCGACGGCAT GCATGCCAAG CGCGAGCAGA TCCCGGTGGA GCAGATGGTC AAGGTGGCGA TGGCACTCGA CGCCGCCGGC GTGCCGCTGA TCCAGGTGAC GCACGGCGCC GGACTGGGCG GCAACTCGCT GCAGCACGGC TTCGCGCTGG CCAGCAACGA GGCCTACCTC AGCGCCGTGG CGCCGAAGAT GAAGCAGGCC AAGGTCTCGG TGCTGCTGAT CCCCGGCCTG GGCACGATGC GCGAGCTGCA GTCGGCCTAC AACTGCGGCG CGCGCAGCGT CACCGTGGCC ACCCACTGCA CCGAGGCCGA CACCGCGCCG CAGCACATCG CCTACGCGCG CAAGCTCGGC ATGGACACGG TGGGCTTCCT GATGATGGCG CACCTGAACG ACCCGGAAGG GCTCGCGAAG CAGGGCAAGC TGATGGAGGA CTACGGCGCG CAGACCGTCT ACGTGACCGA CTCCGCCGGC TACATGCTGC CGGCCGACGT GCGCGCCCGC GTGGCCGCAC TGCGCGCGGT GCTGAAGCCC GAGACCGAGA TCGGCTTCCA CGGCCACCAC AACCTGGGCA TGGGCATCGC CAACTCCATC GCCGCCATCG AGGAGGGCGC GAGCCGCATC GACGGCTCGG TGGCCGGGCT GGGTGCCGGC GCCGGCAACA CGCCGCTGGA GGTGTTCCTC GCGGTCTGCG ACCGCATGGG CATCGAGACC GGTGTCGATC TCTTCAAGCT GATGGACGTG GCCGAGGACG TGATCGTGCC GATGATGGAC CACCTGGTGC GCGTGGACCG CGAGTCGCTG ACGCTGGGCT TCGCCGGCGT GTACTCCACC TTCCTGCTGC ACGCCAAGCG CGCGGCGGCG CGCTTCGGCG TGCCGGCGCG CGAGATCCTG GTCGAGCTGG GCCGCCGCAA GATGATCGGC GGCCAGGAAG ACATGATCGA GGACACCGCG ATGAGCATGG CCAAGGAACG CGGCCTGCTG AAGGACGTGA GCCGCAAGGC CGCTTGA
|
Protein sequence | MSATRSMEPD LRGRKVLLHD MCLRDGMHAK REQIPVEQMV KVAMALDAAG VPLIQVTHGA GLGGNSLQHG FALASNEAYL SAVAPKMKQA KVSVLLIPGL GTMRELQSAY NCGARSVTVA THCTEADTAP QHIAYARKLG MDTVGFLMMA HLNDPEGLAK QGKLMEDYGA QTVYVTDSAG YMLPADVRAR VAALRAVLKP ETEIGFHGHH NLGMGIANSI AAIEEGASRI DGSVAGLGAG AGNTPLEVFL AVCDRMGIET GVDLFKLMDV AEDVIVPMMD HLVRVDRESL TLGFAGVYST FLLHAKRAAA RFGVPAREIL VELGRRKMIG GQEDMIEDTA MSMAKERGLL KDVSRKAA
|
| |