Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A0688 |
Symbol | |
ID | 4784429 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 718881 |
End bp | 721031 |
Gene Length | 2151 bp |
Protein Length | 716 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640089248 |
Product | TonB-dependent receptor protein |
Protein accession | YP_001019885 |
Protein GI | 124265881 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG4773] Outer membrane receptor for ferric coprogen and ferric-rhodotorulic acid |
TIGRFAM ID | [TIGR01783] TonB-dependent siderophore receptor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCGCC TGCATCGTGT TTCCCCGCTG ACCTTCGCCT GCCTCGCCCT GGCTGCCCAT GCGCAGTCCG AACCGCCTTC CCAGGTCCTG CCGCCGGTCC AGGTGCAGGG CCAGCGCGAC GACTACCGCG CCCCCGAAAC CACCACCGGC AACCGGACGT CCACGCCGTC GCTCCAAAGC CCTCAGAGCG TGCAGGTCGT GCCGCGCGCC GTCATCGAGG ACCAGAACGC ACTGAACCTC GCCGAAGCGC TCCGCAACGT GTCCGGCGTG CAGTTCGATT TCGGGTTCAA CGGGACGGCC ATGCCGCTGG TCGTGCTGCG AGGCTTTCCG AGCGTATCGA TGACCGCCAT GGGGCCGATG TCGGGCAGCT CGACCTACTA CCTGGACGGC ACCAAGGTCA CCGGCGTCCC GATCAACATG GCCAACGTGC TGGCCGTCGA GGTGATCAAG GGGCCGTCGA GCGTCCTGTA CGGGCGCGCC GAACCGGGCG GGCTGGTCAA CGTGGTCAGC AAGCCGATCA GCGTCGTGCC CGCCATGAGC CTGGAGCAGA CCGTCGGGGA GTACGGCCTG TCGCGCACGG CGGTGGAGGC ATCCGGCCCT CTGAACGAGG AACGCAGTCT GCGTGGCCGG GCGTCCGCCT CGTACTACAC GGCCGATTCC ATCCGCGACT TCGTCGAGGA CAAGCTCGGC GCCTTCGGCG CCAGCCTGAG CTGGCTGCCC AGCGCGCAGA CCACCTTGAC GGGAACGCTG GACTACAGCC ACCAGCGCTA CCGCACCGAC TACGGCGTGC CGGCCTTGGG CGACCGGCCC GCCGATCTGC CGTGGTCGCG GCAGTTCAAC GACTCGCCGC AGCTCTCCAG CAGCAAGACC ACCACCCTGA AGCTGGAAGG CGAGCACCGC CTGTCCGAGG CCTGGCAGCT CAAGGGCAAG CTCCTGACCC TGCGCAGCGA CACCTCCGAG ATGGACATCT CGCCCTATCG CGCCGACTAC GGCATGGGCA TGACGCCCGA CGCCACCTGC CCGGGCACGG GCAATCCGCT GTGCCGCTAC TACTTCTACG TGCGACCGGA CGGGCGCTAC CGGCTGGACC AGTTCAACCT CGACCTGATC GGCAAGATCG ACACCGGCGG GATCCAGCAC ACCGTGTTGC TCGGCGTCGA TGCCTACAGC GGCCGCAAGA CCGGCACGAC CTACTTCCAG CAGATCGGCT CGGTGGACAT CTACACGCCG GCGCTGGGCA GCACGCCGCC GCTGGACCTG GGCATGTCCA TGCCGATGGA CATCGAGGAT CGCAACCGCT GGACCAGCAT CTACGTGCAG GATCAACTGG CCCTGGGCCA GGGCGTCTTC CTGACCGCCG CGTTGCGGCA CGACCGCACC AGCGCCATCT ACGCCGCCCC GGGCACCGAG CCCAACAAGG CTTCGTTCAC CACGCCACGA CTCGGTGCGG TCTGGCAGTT CGCCTCCAAC CAGTCGATCT ACGCCCAGTA CCAGGACGCC GTGTCCGCCA ACAACGGCCG GGACACGGTG ACCGGGGCCG CTCTCAGCGC CGAGCGCGCC AGGCAGTTCG AGATCGGCCA CAAGATCGAC TGGCTCGACG GCAAGCTCAG TTCGACCCTT GCGGCGTACG AGCTGACCAA GCGCAACCGT GGGGGCTCGG TCCCGGTCGC GACGCCGCCC TTCTACAACA CCGTCACGGT GGGCGAAGCC CGCTCCCGTG GCGTCGAATG GGATCTTTCG GGGCAGGTCT CACGCAGTCT GTCGCTGATC GCCTCCTATG CCTACACCGA CACCCGCGTG CTGGTCGATC CGACCTACCA GGGCAAGAAG CTGGCCAACG TGGCGCGGCA TACCGGCAGC CTCTGGGCGC GCTACGCCAT CGACAGCCAG TGGAGCACGG GTGCCGGCGT CTTTGCACAA GGTCAGCGCC AGGGCGATAC GGGCAATACC TTCCAGCTGC CGGGGTACGG ACGGGTCGAC GCCATGCTCG CCTACCGCTT CGCGCTGCAG GATGCCCGGG CCGCGCTGCA GTTCAACGTC GACAACGTGT TCGATCGCAA GTACTACACG GGCAGCCATC AGTTCGTGGC CGACTGGGTC AAGCTGGGGT CACCGCGGAC AGTCAAGGCG ACGCTGCGGC TGGATTACTA G
|
Protein sequence | MNRLHRVSPL TFACLALAAH AQSEPPSQVL PPVQVQGQRD DYRAPETTTG NRTSTPSLQS PQSVQVVPRA VIEDQNALNL AEALRNVSGV QFDFGFNGTA MPLVVLRGFP SVSMTAMGPM SGSSTYYLDG TKVTGVPINM ANVLAVEVIK GPSSVLYGRA EPGGLVNVVS KPISVVPAMS LEQTVGEYGL SRTAVEASGP LNEERSLRGR ASASYYTADS IRDFVEDKLG AFGASLSWLP SAQTTLTGTL DYSHQRYRTD YGVPALGDRP ADLPWSRQFN DSPQLSSSKT TTLKLEGEHR LSEAWQLKGK LLTLRSDTSE MDISPYRADY GMGMTPDATC PGTGNPLCRY YFYVRPDGRY RLDQFNLDLI GKIDTGGIQH TVLLGVDAYS GRKTGTTYFQ QIGSVDIYTP ALGSTPPLDL GMSMPMDIED RNRWTSIYVQ DQLALGQGVF LTAALRHDRT SAIYAAPGTE PNKASFTTPR LGAVWQFASN QSIYAQYQDA VSANNGRDTV TGAALSAERA RQFEIGHKID WLDGKLSSTL AAYELTKRNR GGSVPVATPP FYNTVTVGEA RSRGVEWDLS GQVSRSLSLI ASYAYTDTRV LVDPTYQGKK LANVARHTGS LWARYAIDSQ WSTGAGVFAQ GQRQGDTGNT FQLPGYGRVD AMLAYRFALQ DARAALQFNV DNVFDRKYYT GSHQFVADWV KLGSPRTVKA TLRLDY
|
| |