Gene Mpe_A0688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0688 
Symbol 
ID4784429 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp718881 
End bp721031 
Gene Length2151 bp 
Protein Length716 aa 
Translation table11 
GC content67% 
IMG OID640089248 
ProductTonB-dependent receptor protein 
Protein accessionYP_001019885 
Protein GI124265881 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4773] Outer membrane receptor for ferric coprogen and ferric-rhodotorulic acid 
TIGRFAM ID[TIGR01783] TonB-dependent siderophore receptor 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCGCC TGCATCGTGT TTCCCCGCTG ACCTTCGCCT GCCTCGCCCT GGCTGCCCAT 
GCGCAGTCCG AACCGCCTTC CCAGGTCCTG CCGCCGGTCC AGGTGCAGGG CCAGCGCGAC
GACTACCGCG CCCCCGAAAC CACCACCGGC AACCGGACGT CCACGCCGTC GCTCCAAAGC
CCTCAGAGCG TGCAGGTCGT GCCGCGCGCC GTCATCGAGG ACCAGAACGC ACTGAACCTC
GCCGAAGCGC TCCGCAACGT GTCCGGCGTG CAGTTCGATT TCGGGTTCAA CGGGACGGCC
ATGCCGCTGG TCGTGCTGCG AGGCTTTCCG AGCGTATCGA TGACCGCCAT GGGGCCGATG
TCGGGCAGCT CGACCTACTA CCTGGACGGC ACCAAGGTCA CCGGCGTCCC GATCAACATG
GCCAACGTGC TGGCCGTCGA GGTGATCAAG GGGCCGTCGA GCGTCCTGTA CGGGCGCGCC
GAACCGGGCG GGCTGGTCAA CGTGGTCAGC AAGCCGATCA GCGTCGTGCC CGCCATGAGC
CTGGAGCAGA CCGTCGGGGA GTACGGCCTG TCGCGCACGG CGGTGGAGGC ATCCGGCCCT
CTGAACGAGG AACGCAGTCT GCGTGGCCGG GCGTCCGCCT CGTACTACAC GGCCGATTCC
ATCCGCGACT TCGTCGAGGA CAAGCTCGGC GCCTTCGGCG CCAGCCTGAG CTGGCTGCCC
AGCGCGCAGA CCACCTTGAC GGGAACGCTG GACTACAGCC ACCAGCGCTA CCGCACCGAC
TACGGCGTGC CGGCCTTGGG CGACCGGCCC GCCGATCTGC CGTGGTCGCG GCAGTTCAAC
GACTCGCCGC AGCTCTCCAG CAGCAAGACC ACCACCCTGA AGCTGGAAGG CGAGCACCGC
CTGTCCGAGG CCTGGCAGCT CAAGGGCAAG CTCCTGACCC TGCGCAGCGA CACCTCCGAG
ATGGACATCT CGCCCTATCG CGCCGACTAC GGCATGGGCA TGACGCCCGA CGCCACCTGC
CCGGGCACGG GCAATCCGCT GTGCCGCTAC TACTTCTACG TGCGACCGGA CGGGCGCTAC
CGGCTGGACC AGTTCAACCT CGACCTGATC GGCAAGATCG ACACCGGCGG GATCCAGCAC
ACCGTGTTGC TCGGCGTCGA TGCCTACAGC GGCCGCAAGA CCGGCACGAC CTACTTCCAG
CAGATCGGCT CGGTGGACAT CTACACGCCG GCGCTGGGCA GCACGCCGCC GCTGGACCTG
GGCATGTCCA TGCCGATGGA CATCGAGGAT CGCAACCGCT GGACCAGCAT CTACGTGCAG
GATCAACTGG CCCTGGGCCA GGGCGTCTTC CTGACCGCCG CGTTGCGGCA CGACCGCACC
AGCGCCATCT ACGCCGCCCC GGGCACCGAG CCCAACAAGG CTTCGTTCAC CACGCCACGA
CTCGGTGCGG TCTGGCAGTT CGCCTCCAAC CAGTCGATCT ACGCCCAGTA CCAGGACGCC
GTGTCCGCCA ACAACGGCCG GGACACGGTG ACCGGGGCCG CTCTCAGCGC CGAGCGCGCC
AGGCAGTTCG AGATCGGCCA CAAGATCGAC TGGCTCGACG GCAAGCTCAG TTCGACCCTT
GCGGCGTACG AGCTGACCAA GCGCAACCGT GGGGGCTCGG TCCCGGTCGC GACGCCGCCC
TTCTACAACA CCGTCACGGT GGGCGAAGCC CGCTCCCGTG GCGTCGAATG GGATCTTTCG
GGGCAGGTCT CACGCAGTCT GTCGCTGATC GCCTCCTATG CCTACACCGA CACCCGCGTG
CTGGTCGATC CGACCTACCA GGGCAAGAAG CTGGCCAACG TGGCGCGGCA TACCGGCAGC
CTCTGGGCGC GCTACGCCAT CGACAGCCAG TGGAGCACGG GTGCCGGCGT CTTTGCACAA
GGTCAGCGCC AGGGCGATAC GGGCAATACC TTCCAGCTGC CGGGGTACGG ACGGGTCGAC
GCCATGCTCG CCTACCGCTT CGCGCTGCAG GATGCCCGGG CCGCGCTGCA GTTCAACGTC
GACAACGTGT TCGATCGCAA GTACTACACG GGCAGCCATC AGTTCGTGGC CGACTGGGTC
AAGCTGGGGT CACCGCGGAC AGTCAAGGCG ACGCTGCGGC TGGATTACTA G
 
Protein sequence
MNRLHRVSPL TFACLALAAH AQSEPPSQVL PPVQVQGQRD DYRAPETTTG NRTSTPSLQS 
PQSVQVVPRA VIEDQNALNL AEALRNVSGV QFDFGFNGTA MPLVVLRGFP SVSMTAMGPM
SGSSTYYLDG TKVTGVPINM ANVLAVEVIK GPSSVLYGRA EPGGLVNVVS KPISVVPAMS
LEQTVGEYGL SRTAVEASGP LNEERSLRGR ASASYYTADS IRDFVEDKLG AFGASLSWLP
SAQTTLTGTL DYSHQRYRTD YGVPALGDRP ADLPWSRQFN DSPQLSSSKT TTLKLEGEHR
LSEAWQLKGK LLTLRSDTSE MDISPYRADY GMGMTPDATC PGTGNPLCRY YFYVRPDGRY
RLDQFNLDLI GKIDTGGIQH TVLLGVDAYS GRKTGTTYFQ QIGSVDIYTP ALGSTPPLDL
GMSMPMDIED RNRWTSIYVQ DQLALGQGVF LTAALRHDRT SAIYAAPGTE PNKASFTTPR
LGAVWQFASN QSIYAQYQDA VSANNGRDTV TGAALSAERA RQFEIGHKID WLDGKLSSTL
AAYELTKRNR GGSVPVATPP FYNTVTVGEA RSRGVEWDLS GQVSRSLSLI ASYAYTDTRV
LVDPTYQGKK LANVARHTGS LWARYAIDSQ WSTGAGVFAQ GQRQGDTGNT FQLPGYGRVD
AMLAYRFALQ DARAALQFNV DNVFDRKYYT GSHQFVADWV KLGSPRTVKA TLRLDY