Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A2275 |
Symbol | |
ID | 4785114 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 2435115 |
End bp | 2436569 |
Gene Length | 1455 bp |
Protein Length | 484 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640090843 |
Product | putative 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase oxidoreductase protein |
Protein accession | YP_001021466 |
Protein GI | 124267462 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR03216] 2-hydroxymuconic semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACAGT TCCTGAACTT CATCGATGGC GAATTCGTCG CCACGGACAA GACCTTCGCC AACCGTGCGC CGGTCGACAA CCGGGTGCTG GGCTTGGTGC ACGAAGCCGG CCGCGCCGAG GTCGACGCGG CGGTGGCCGC GGCGCGCGGT GCACTGAAGG GCGAGTGGGG CCGCATGCCC GTCGCCAAGC GCGTCGAGCT GCTGTATGCG GTGGCCGACG AGATCAACCG CCGCTTCGAC GACTTCCTGG CGGCCGAGAT CGCCGACACC GGCAAGCCGC TGAGCCTGGC CTCGCACATC GACATCCCGC GCGGCGCGGC CAACTTCAAG GTGTTCGCCG ACATCATCAA GAACGTGCCG GCCGAGACCT TCGAGATGGC CACGCCCGAC GGCGGCCAGG CGCTGAACTA CGCGGTGCGC ACGCCGGTAG GTGTGGTCGG CGTGGTTTGC CCGTGGAACC TGCCGCTGCT GCTGATGACC TGGAAGGTCG GCCCGGCGCT GGCCTGCGGC AACACCGTGG TGGTCAAGCC CTCCGAGGAG ACGCCGGCCA CTGCCACGCT GCTCGGCGAG GTGATGCAGA AGGTGGGCAT GCCCAAGGGC GTCTACAACG TCGTGCACGG CTTCGGCCCG GACTCGGCCG GAGCCTTCCT CACGCAGCAC CCGGACGTCG ACGCGATCAC CTTCACCGGC GAGACGCGCA CCGGCGAGGC CATCATGGCC GCGGCGGCCA AGGGCGTGCG GCCGGTCAGC TTCGAGCTCG GCGGCAAGAA CGCCGGCATC GTGTTCGCCG ATGCCGACTT CGACAAGGCG GTGGCGGGCA TCACCCGCAG TGCCTTCGAG AACTGCGGCC AGGTCTGCCT GGGCACCGAG CGTGTCTACG TGCAGCGGCC GATCTTCGAG AAGTTCGTGC AGGCGCTCAA GGCCAAGGCC GAGGCGCTGA AGATCGGCCC GTCGGAGGAG CCCGGCGTGG GCCTGGGTCC GCTGATCTCG GCCGAGCACC GCGACAAGGT GCTGAGCTAC TACCGCAAGG CGGTGGAGCA GGGCGCCACC GTCGTCACCG GCGGCGGCGT GCCGAAGATG AGTGGCGCGC TGGCCGAAGG CCATTGGGTG CAGCCGACGA TCTGGACCGG CCTGCCCGAG TCGGCCGCAG TGATCCGCGA GGAGATCTTC GGCCCGTGCT GCCACATCGC GCCGTTCGAC ACCGAAGAGG AGGCGATCGC GCTGGCCAAT GCCACCGACT ACGGACTCGC CACCACGGTG TGGACCCAGA ACCTCGGCAC CGCGCACCGC GTGGCCCGGC AAGTCGAGGT CGGCATCTGC TGGATCAACA GCTGGTTCCT GCGCGACCTT CGCACCGCCT TCGGCGGCGC CAAGGCCTCG GGCATCGGCC GCGAAGGCGG CGTGCACTCG CTCGAGTTCT ACACCGAGCT GCGCAATGTG TGCGTGAAGC TGTGA
|
Protein sequence | MKQFLNFIDG EFVATDKTFA NRAPVDNRVL GLVHEAGRAE VDAAVAAARG ALKGEWGRMP VAKRVELLYA VADEINRRFD DFLAAEIADT GKPLSLASHI DIPRGAANFK VFADIIKNVP AETFEMATPD GGQALNYAVR TPVGVVGVVC PWNLPLLLMT WKVGPALACG NTVVVKPSEE TPATATLLGE VMQKVGMPKG VYNVVHGFGP DSAGAFLTQH PDVDAITFTG ETRTGEAIMA AAAKGVRPVS FELGGKNAGI VFADADFDKA VAGITRSAFE NCGQVCLGTE RVYVQRPIFE KFVQALKAKA EALKIGPSEE PGVGLGPLIS AEHRDKVLSY YRKAVEQGAT VVTGGGVPKM SGALAEGHWV QPTIWTGLPE SAAVIREEIF GPCCHIAPFD TEEEAIALAN ATDYGLATTV WTQNLGTAHR VARQVEVGIC WINSWFLRDL RTAFGGAKAS GIGREGGVHS LEFYTELRNV CVKL
|
| |