Gene Mpe_A2417 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2417 
Symbol 
ID4784313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2581884 
End bp2583293 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content71% 
IMG OID640090987 
Productethanolamine ammonia-lyase heavy chain 
Protein accessionYP_001021607 
Protein GI124267603 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4303] Ethanolamine ammonia-lyase, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.739153 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTACG CGCACTCCAT CGGCCCGCGG CGCTACAGCT TCGACGACCT GCGCACGTTG 
ATGGCCCGGG CTTCGCCCGC GCGCTCCGGC GACGAACTCG CCGGGGTCGC GGCGCGCAGC
AGCGAGGAGC GCGCCGCGGC GCAGATCGCG CTGGCGGCGC TGCCGCTCAA GACCTTCCTG
AACGACGCGC TGGTGCCCTA CGAGGACGAC GAGGTCACGC GCCTCATCCT CGACAGCCAA
GACGCCGCCG CCTTTGCCCC GATCGCCCAC CTCACTGTGG GCGACCTGCG CGACTGGCTG
CTGTCCGACG CGGTCGACGG CGACACGCTG CGCGCCGCGG CGCCCGGCCT GACGCCCGAG
ATGACCGCAG CGGTCAGCAA GATCATGCGC AACCAGGACC TGATCCTGGT CGCGCGCAAA
TGCCGCGTGC GCACGCGCTT TCGCGACACC ATTGGCCTGC CCGGCCGGCT GTCCACCCGG
CTGCAGCCCA ACCACCCGAC CGACAACGCC GCCGGCATCG CCGCCAGCAC GCTCGACGGC
CTTCTCTACG GCAGCGGCGA CGCGGTGATC GGCATCAACC CGGCCACCGA CAACGTGGCC
CAGGTGACGC GGCTGCTGCA GATGCTGGAC GCGGTGATCC AGCGCTACGA GATCCCCACG
CAGAGCTGCG TGCTGACCCA CGTGACCAAC ACGCTGCAGT GCATCGAGCG CGGCGCGCCG
GTGGACCTGG TGTTCCAGTC GATCGGGGGC ACCGAAGCCA CCAACCGCAG CTTCGGCATC
GACCTGGCGC TGCTCGCCGA GGCGCACGAC GCCGCACTGT CGCTGCAGCG CGGCGCGCTG
TTCTCCGACG ACGGCCAGCG CGGCCGCAAC GTCATGTACT TCGAGACCGG CCAGGGCAGC
GCGCTGTCAG CCCAGGCCCA CCACGGCTGC GACCAGCAGA CGATCGAGGC GCGGGCCTAC
GCGGTGGCAC GGCGCTTCGA GCCGCTGCTG GTGAACAGCG TGGTCGGCTT CATCGGCCCC
GAGTACCTGT ACGACGGCAA GCAGATCATC CGCGCCGGCC TCGAAGACCA TTTCTGCGCC
AAGCTGCTCG GCCTGCCGAT GGGCTGCGAC ATCTGCTACA CCAACCACGC CGAGGCCGAC
CAGAACGACA TGGACGTGCT GCTCACGTTG CTGGGCGTGG CCGGCTGCAG CTTCATCATG
GGCATCCCGG GCTCGGACGA CGTGATGCTG AACTACCAGA CCACATCCTT CCACGACGCG
CTCTACGCAC GCCGCGTGCT GGGCCTGCGG CCCGCGCCGG AGTTCGAGCG CTGGCTGGAG
GCCATGCGCA TCACCGAGCC CGGCGCGCCC GATCGGCTGA CCGACCTGAT GCCGCCGGCC
CTGGCCCGCC TGCTGGAGCA CCCGCGCTGA
 
Protein sequence
MAYAHSIGPR RYSFDDLRTL MARASPARSG DELAGVAARS SEERAAAQIA LAALPLKTFL 
NDALVPYEDD EVTRLILDSQ DAAAFAPIAH LTVGDLRDWL LSDAVDGDTL RAAAPGLTPE
MTAAVSKIMR NQDLILVARK CRVRTRFRDT IGLPGRLSTR LQPNHPTDNA AGIAASTLDG
LLYGSGDAVI GINPATDNVA QVTRLLQMLD AVIQRYEIPT QSCVLTHVTN TLQCIERGAP
VDLVFQSIGG TEATNRSFGI DLALLAEAHD AALSLQRGAL FSDDGQRGRN VMYFETGQGS
ALSAQAHHGC DQQTIEARAY AVARRFEPLL VNSVVGFIGP EYLYDGKQII RAGLEDHFCA
KLLGLPMGCD ICYTNHAEAD QNDMDVLLTL LGVAGCSFIM GIPGSDDVML NYQTTSFHDA
LYARRVLGLR PAPEFERWLE AMRITEPGAP DRLTDLMPPA LARLLEHPR