Gene Mpe_A0163 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0163 
Symbol 
ID4784126 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp171600 
End bp173663 
Gene Length2064 bp 
Protein Length687 aa 
Translation table11 
GC content72% 
IMG OID640088711 
Productendothelin-converting protein 1 
Protein accessionYP_001019360 
Protein GI124265356 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3590] Predicted metalloendopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.847092 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGCCCGC ACGACCCGCC CTTCCGTCCT GTTCACGCCT GCGGGCTGCA TCGTGCGCTG 
TTCGGCCCCG AGTCGGCGGC GGGCCAGGCC GAACCGGGCC GTGGCGCTGC GGCGTCGGGC
ATCGAGCAGA GCTGCTTCGA CCGCTCGGTG CGCGCGCAGG ACGACCTGTT CCGCCATGTC
AACGGCGGGT GGCTGAAGCA CACCGCCATT CCCGCCGACC GCGCGTCCAC CGGCGCCTTC
ATGCAGATCC ACGACCGCAT CCAGGACCAG TTGCTGGCCC TGATCGACGA GGCGTCCGCC
GAGGGCCAGG ACGGCGAGGC GCGGCAGATC GGCGATCTGT ACGCCAGCTT CATGGACGAG
GCCGCGATCG AGGCGCGTGG CCTCGCGCCG CTGCAGGACG AACTGGCGGC TGTTGCCGCG
ATCAACGATC GCGCGCAGTT CGGTGCATGG CTTGCCGACG CCGTCAGCGC CGGCCTGGGG
GTGCCGCTGG CCCTGCATAT CGGCCAGGAC GATCGCGATT CGACGCGCTA CGTGCCCTTC
CTGTCGCAAG GCGGCCTGGG CCTGCCCGAC CGCGACTACT ACCTGCTGGA GGACAACGCG
CGTTTCGGCG AGGTGCGCGC GCAGTACCGG GCGCACATGG CCGCGATGCT GGTGCTGGCC
GGCGAGCCTG CGGCCGCGGC CGAGGCGGCG GCGCAGGCCG TGTTGGCGCT CGAGACCGAG
CTGGCCCAGG CGCAGTGGTC GCGCGTCGAG AACCGTGACC CGGTGAAGAC CTACAACCGC
TGCGACTTCG CCACGCTGCG CGCGCTGGCC CCGGCGATCG ACTGGGACGG CTTCGCCGCG
CGCACCGGCC TGGCCGGTCG CGCCGAAGGG CTGGTGGTCG GCCAGCCGAG TTACCTGGCT
GCGCTGTCGG CGCGGCTCGC CGACGCGCCG CTCGACGCCT GGAAGGCCTA CGCGACGCTG
CGTGTGCTGT ATGCCTACGC GCCCTTCCTG GGCCGCGCGA TCGTCGACGC CCGTTTCGCC
TTCACCGGCA CCGTGCTGCG CGGCACGCCG GAGAACCTGC CGCGCTGGAA ACGCGGTGTC
GCGCTGGTCG AGGGCTGCCT TGGCGAGGGC CTGGGCCAGC TCTACGTGGC CCGACACTTC
CCGCCGGCCC ACAAGGCGCG CATGGAGGCG CTGGTCGCGC AACTGCTCGC GGCCTACCGA
CGGAACCTCG ACACGCTGGA CTGGATGGGG CCGGCCACGC GCGCCCAGGC GCAGGCCAAG
CTGGCCCGGC TCGTGACCAA GATCGGCTAC CCGGTGCGCT GGCGCGACTA CCGCGCGCTG
GAGATCCGCC GCGACGACGT GGTCGGCAAC GTGCGGCGCG TGCGTGCGTT CGAGCATGCG
CGCCAGCTCG CTCGGCTGGG CCAGCCGATC GACCGCGACG AGTGGGGCAT GACGCCGCAG
ACCGTGAACG CCTACTACAA CCCGTCGATG AACGAGATCG TGTTCCCGGC GTCCATCCTG
CAGCCGCCGT TCTTCGACGC GGACGCCGAC GACGCGGTGA ACTACGGCGC GATCGGTGCC
GTCATCGGCC ACGAGATCAG CCACGGCTTC GACGACATGG GCAGCCAATA CGACGCCGAC
GGCAATCTGC GCGACTGGTG GACTGCCGAG GACCGCGCCC GCTTCGCCGC CAAGACCAGC
GTGCTGGTGG CGCAGTACGG TGCCTACGAG CCGCTGCCGG GCTATCCGAT CGACGGCGCG
CTGTCGCTGG GCGAGAACAT TGCCGACAAC GCCGGCCTGG CGATCGCCTT CCAGGCCTAC
CAGCGCTCGC TCGGTGGCCG GCCGGCCCCG GTGATCGACG GGCTGGAGGG CGCGCAGCGC
TTCTTCTACG GCTTCGCTCA GGTGTGGCGC GGCAAGCAGC GCGAGGCGGC GCTGATCGAG
CAGATCAAGG CCGGCCCGCA TGCGCCCGGC GAGTTCCGCG CCAACGGCAC GGTGCGCAAC
CATCCCGGCT TCTACGCCAC CTTCGGCGTG CAGCCGGGCG ATGCGCTCTA CCTGCCCGAG
GCGCAGCGCG TCTCCGTCTG GTGA
 
Protein sequence
MSPHDPPFRP VHACGLHRAL FGPESAAGQA EPGRGAAASG IEQSCFDRSV RAQDDLFRHV 
NGGWLKHTAI PADRASTGAF MQIHDRIQDQ LLALIDEASA EGQDGEARQI GDLYASFMDE
AAIEARGLAP LQDELAAVAA INDRAQFGAW LADAVSAGLG VPLALHIGQD DRDSTRYVPF
LSQGGLGLPD RDYYLLEDNA RFGEVRAQYR AHMAAMLVLA GEPAAAAEAA AQAVLALETE
LAQAQWSRVE NRDPVKTYNR CDFATLRALA PAIDWDGFAA RTGLAGRAEG LVVGQPSYLA
ALSARLADAP LDAWKAYATL RVLYAYAPFL GRAIVDARFA FTGTVLRGTP ENLPRWKRGV
ALVEGCLGEG LGQLYVARHF PPAHKARMEA LVAQLLAAYR RNLDTLDWMG PATRAQAQAK
LARLVTKIGY PVRWRDYRAL EIRRDDVVGN VRRVRAFEHA RQLARLGQPI DRDEWGMTPQ
TVNAYYNPSM NEIVFPASIL QPPFFDADAD DAVNYGAIGA VIGHEISHGF DDMGSQYDAD
GNLRDWWTAE DRARFAAKTS VLVAQYGAYE PLPGYPIDGA LSLGENIADN AGLAIAFQAY
QRSLGGRPAP VIDGLEGAQR FFYGFAQVWR GKQREAALIE QIKAGPHAPG EFRANGTVRN
HPGFYATFGV QPGDALYLPE AQRVSVW