Gene Mpe_A2275 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2275 
Symbol 
ID4785114 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2435115 
End bp2436569 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content69% 
IMG OID640090843 
Productputative 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase oxidoreductase protein 
Protein accessionYP_001021466 
Protein GI124267462 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR03216] 2-hydroxymuconic semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACAGT TCCTGAACTT CATCGATGGC GAATTCGTCG CCACGGACAA GACCTTCGCC 
AACCGTGCGC CGGTCGACAA CCGGGTGCTG GGCTTGGTGC ACGAAGCCGG CCGCGCCGAG
GTCGACGCGG CGGTGGCCGC GGCGCGCGGT GCACTGAAGG GCGAGTGGGG CCGCATGCCC
GTCGCCAAGC GCGTCGAGCT GCTGTATGCG GTGGCCGACG AGATCAACCG CCGCTTCGAC
GACTTCCTGG CGGCCGAGAT CGCCGACACC GGCAAGCCGC TGAGCCTGGC CTCGCACATC
GACATCCCGC GCGGCGCGGC CAACTTCAAG GTGTTCGCCG ACATCATCAA GAACGTGCCG
GCCGAGACCT TCGAGATGGC CACGCCCGAC GGCGGCCAGG CGCTGAACTA CGCGGTGCGC
ACGCCGGTAG GTGTGGTCGG CGTGGTTTGC CCGTGGAACC TGCCGCTGCT GCTGATGACC
TGGAAGGTCG GCCCGGCGCT GGCCTGCGGC AACACCGTGG TGGTCAAGCC CTCCGAGGAG
ACGCCGGCCA CTGCCACGCT GCTCGGCGAG GTGATGCAGA AGGTGGGCAT GCCCAAGGGC
GTCTACAACG TCGTGCACGG CTTCGGCCCG GACTCGGCCG GAGCCTTCCT CACGCAGCAC
CCGGACGTCG ACGCGATCAC CTTCACCGGC GAGACGCGCA CCGGCGAGGC CATCATGGCC
GCGGCGGCCA AGGGCGTGCG GCCGGTCAGC TTCGAGCTCG GCGGCAAGAA CGCCGGCATC
GTGTTCGCCG ATGCCGACTT CGACAAGGCG GTGGCGGGCA TCACCCGCAG TGCCTTCGAG
AACTGCGGCC AGGTCTGCCT GGGCACCGAG CGTGTCTACG TGCAGCGGCC GATCTTCGAG
AAGTTCGTGC AGGCGCTCAA GGCCAAGGCC GAGGCGCTGA AGATCGGCCC GTCGGAGGAG
CCCGGCGTGG GCCTGGGTCC GCTGATCTCG GCCGAGCACC GCGACAAGGT GCTGAGCTAC
TACCGCAAGG CGGTGGAGCA GGGCGCCACC GTCGTCACCG GCGGCGGCGT GCCGAAGATG
AGTGGCGCGC TGGCCGAAGG CCATTGGGTG CAGCCGACGA TCTGGACCGG CCTGCCCGAG
TCGGCCGCAG TGATCCGCGA GGAGATCTTC GGCCCGTGCT GCCACATCGC GCCGTTCGAC
ACCGAAGAGG AGGCGATCGC GCTGGCCAAT GCCACCGACT ACGGACTCGC CACCACGGTG
TGGACCCAGA ACCTCGGCAC CGCGCACCGC GTGGCCCGGC AAGTCGAGGT CGGCATCTGC
TGGATCAACA GCTGGTTCCT GCGCGACCTT CGCACCGCCT TCGGCGGCGC CAAGGCCTCG
GGCATCGGCC GCGAAGGCGG CGTGCACTCG CTCGAGTTCT ACACCGAGCT GCGCAATGTG
TGCGTGAAGC TGTGA
 
Protein sequence
MKQFLNFIDG EFVATDKTFA NRAPVDNRVL GLVHEAGRAE VDAAVAAARG ALKGEWGRMP 
VAKRVELLYA VADEINRRFD DFLAAEIADT GKPLSLASHI DIPRGAANFK VFADIIKNVP
AETFEMATPD GGQALNYAVR TPVGVVGVVC PWNLPLLLMT WKVGPALACG NTVVVKPSEE
TPATATLLGE VMQKVGMPKG VYNVVHGFGP DSAGAFLTQH PDVDAITFTG ETRTGEAIMA
AAAKGVRPVS FELGGKNAGI VFADADFDKA VAGITRSAFE NCGQVCLGTE RVYVQRPIFE
KFVQALKAKA EALKIGPSEE PGVGLGPLIS AEHRDKVLSY YRKAVEQGAT VVTGGGVPKM
SGALAEGHWV QPTIWTGLPE SAAVIREEIF GPCCHIAPFD TEEEAIALAN ATDYGLATTV
WTQNLGTAHR VARQVEVGIC WINSWFLRDL RTAFGGAKAS GIGREGGVHS LEFYTELRNV
CVKL