Gene Mpe_A1435 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1435 
Symbol 
ID4783717 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1547702 
End bp1549360 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content69% 
IMG OID640090001 
Productputative CoA ligase (AMP-forming) 
Protein accessionYP_001020632 
Protein GI124266628 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.760271 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGCA ACATCAGCAT CTACGAACAG GGGCTGGACC AGACCACGGC GAACTTCGTC 
GCGCTCTCGC CCGTCAGTTT TGTCGAACGC AGCGCCGAGG TGTTCGGCGA CCTGCCGGCC
GTCGTGCACG GGGCGCGCCG ACAGACCTGG GCGCAGACGC GCGAGCGTTC GGCGCGGCTC
GCCGCGGCGC TGCGTGCACT CGGCGTGGCG CGCGGCAGCA CCGTCAGCGT GATGCTGCCC
AACACGCCGG AGATGGTGGA GGCGCACTAC GCGGTGCCGG CGCTGAACGC GGTGCTGAAC
ACGCTGAACA CCCGGCTCGA CGCCGCGCTG CTGGCCTGGC AGATGAACCA CTGCGAGGCC
CAGGTGCTGA TCACCGACCG CGAGTTCGCG CCGACCATCG CCGAGGCGCT GCGGCTGCTG
CACAGCGAGC ACGGCCGCAC ACCGATCGTC ATCGACGTCT GCGACAGCGA GTACGCCGGT
CCGGGCGACC GGCTCGGCAC GCACGAGTAC GAGGCATTGT TGGCCGCCCA CGCGCCGCTG
GCGCGGCTCG ATGGTCCGGC CGACGAATGG GACGCCATCG CCGTCAGCTA CACGTCGGGG
ACCACCGGCG ACCCCAAGGG CGTGGTGACC CACCACCGCG GCGCCTACCT GAACGCGGTG
AGCAACGCGG CCACCTGGAC CATGCCGCAC TTCCCGATCT ACCTGTGGAC GCTGCCGATG
TTCCACTGCA ACGGCTGGTG CTTCCCGTGG ACGATCGCGA TGCTGGGGGG CACCCACGTG
TGCCTGCGCC GGGTCGATGC GCCCAGCATC CTCGGCGCGA TGCGCGAGCA CCGCGTCGAT
CACTACTGCG CTGCCCCGAT CGTGCACAAC CTGCTGATCG CCGCGCCCGA CGAGCTGCGC
GCCGGCATCA CGCAGAAGGT GCGCGGCATG GTGGCGGGTG CCGCGCCGCC GGCCGCGATG
ATCGAGGGCA TGGCGAAACT GGGCTTCGAT ATCACCCATG TCTACGGCCT CACCGAGGTC
TACGGCCCAG CCGCCGTGGC CGTGAAGCGC GCCAGCTGGG CCGGCGAGAG CCTGTCCGAG
CAGACGCGGC TCAACGGCCG CCAGGGCGTG CGCTACGCGC TGCAGGAGGG CATGACGGTG
CTGGACCCCG AGACGATGGT CGAGACGCCG GCCGACGGCC AGACGATGGG CGAGATCATG
TTCCGCGGCA ACATCGTGAT GAAGGGCTAC CTGAAGAACC CCCAGGCCAG CGCTGCGGCT
TTCGCGGGCG GCTGGTTCCA CACCGGCGAC CTGGCGGTGA TGGAACCGGA CCGCTACGTC
AAGATCAAGG ACCGCAGCAA GGACATCATC ATCTCCGGCG GCGAGAACAT CAGCTCCATC
GAGGTCGAGG ACGCGCTCTA CCGGCACCCG GCGGTGATGG CCTGCGCGGT GGTCGCGAGA
CCCGACCCGA AGTGGGGCGA GACGCCGGTC GCCTACGTGG AGCTCAAGCC CGGCGCCGAG
GTGAGCGCGG CGGAACTGGT CACCCACTGC AAGTCGCTGC TGGCTGGCTA CAAGGCGCCG
AAGGAGGTGC GCTTCGAAGC CATCCCCAAG ACCTCGACCG GGAAGATCCA GAAATTCCAG
CTGCGTGAGC GGGCCCGCTC GACGCAGGCG ATCGAATAG
 
Protein sequence
MSSNISIYEQ GLDQTTANFV ALSPVSFVER SAEVFGDLPA VVHGARRQTW AQTRERSARL 
AAALRALGVA RGSTVSVMLP NTPEMVEAHY AVPALNAVLN TLNTRLDAAL LAWQMNHCEA
QVLITDREFA PTIAEALRLL HSEHGRTPIV IDVCDSEYAG PGDRLGTHEY EALLAAHAPL
ARLDGPADEW DAIAVSYTSG TTGDPKGVVT HHRGAYLNAV SNAATWTMPH FPIYLWTLPM
FHCNGWCFPW TIAMLGGTHV CLRRVDAPSI LGAMREHRVD HYCAAPIVHN LLIAAPDELR
AGITQKVRGM VAGAAPPAAM IEGMAKLGFD ITHVYGLTEV YGPAAVAVKR ASWAGESLSE
QTRLNGRQGV RYALQEGMTV LDPETMVETP ADGQTMGEIM FRGNIVMKGY LKNPQASAAA
FAGGWFHTGD LAVMEPDRYV KIKDRSKDII ISGGENISSI EVEDALYRHP AVMACAVVAR
PDPKWGETPV AYVELKPGAE VSAAELVTHC KSLLAGYKAP KEVRFEAIPK TSTGKIQKFQ
LRERARSTQA IE