Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A1750 |
Symbol | |
ID | 4784208 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 1875858 |
End bp | 1878134 |
Gene Length | 2277 bp |
Protein Length | 758 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640090320 |
Product | putative extracellular solute-binding protein |
Protein accession | YP_001020944 |
Protein GI | 124266940 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCGCCGC CGTCGAGTCG CTTCCTCCCC GCGCTGTCCC GGCGGCTGGC GTTGCTGCTG TGCGGCACGG CCCTCGTGGC CGGCTGCAAC AACAGCCCGC TGCCGGCCGG CGAGGCGGCC ACCAACACGC TGTTCACCGG CTTCCAGGAG AAATCGCCGC GCCACCTGGA CCCGACCGCG TCGTACTCGA ACGACGAGAC CAAGATCACC TACCAGGTCT ACGAACCGCT CTACGGCTAC CACTACCTGA AGCGGCCCTA TCAGCTGGTG CCCAAGGTCG CGGTGGCCGT CGTCGCCCCG AAATACTTCG ACAAAGCCGG CCAGCCGCTG CCCGACGACG CGCCGGGCGA GCAGGTCGCA CTCAGCGTCT ACGAGGTGCC GCTGAAGCAC GGCGTGCGCT ACGCACCGCA CCCGGCGTTC GCCAAGGACG GGCAGGGCCG CTACCGCTAC CACACCGACC ATGCGCTGAC GCGTGCCGAA CTCGGCGATC GCCACTCGCC GCTGGATTTC GAGCACCAGG GGACGCGCGA GCTGGTGGCC GACGACTTCG TCTATGCCAT CAAGCGCCAT GCCAGCACCC GGGTGCAGGC GCCGGTGTTC TCGGTGTTCT CGGCTTATGT GCTGGGGCTC AAGGACTACG GCGCGCTGCT GCACGCCGAG GACAGCAAGC TGCTCGCCGG CCTGCCCGCG TCGGCGCTCG ACAAGCCATT CCTTGACCTG CGCAGGTTCC CGCTGGCCGG CGCGGAGGCG CCGGACCCGC ACACGCTGCG CATCCGCATC CTCGGCAAGT ACCCGCAGTG GCCGTACTGG ATGGCGATGA CCTTCCTTGC CCCTATCCCC TGGGAGGCCG ATGCCTTCTA TGCCCAGCCC GGCATGGCGG GCCAGGGCAT GAGCCTGGAG ACCTGGCCGG TCGGCACCGG CCCCTACATG GCGACGGTCT ACGAGCAGGA CCGCCGCCAC GTGCTGGAGC GCAACCCGAA CTACCGGCAG GACGACCTGT ATCCCTGCGA GGGCGCGCCG GACGATACGC CCGAGCTGCT GGCCGACTGC GGCCAACGCA TGCCCTTCGT CGACCGCCTC GTGTTCCGTG CCGAGAAGGA GAAGGTGCCG ATCAAGTCGA AGTTCATCCA GGGCTATTCC GACGTGCCGG AGATCGAGCG CCCCGAGTGG GGCATCGAGT TCCATGCCGA CGCCGACGAC TCCGAGGCGA CGCGGCGGCT GTTCGCCGAG CGCGGCTTCC GCTTCCCGCG CGCGGTCGAC GTGTCGAACT GGTACGTCGG CTTCAACTGG CTCGATCCGG TGATCGGCAA GGGCGACACG CCGGAGCAGC AGGTGAAGAA CCGCAAGCTG CGGCAGGCGC TGTCGATCGC GATCGACTGG GAGGAGTACG TGCGCGTGTT TCCGAACAAG GGCGGCGAGC CGGCCCACGG CCCGCTGCCG GCCGGCATGT TCGGCTCGCG CCACGGCACG CCGGCCGGCT TCAACCCGGT CACGCATGTG CAGGTGAACG GCGGGATCAA GCGGCGACCG CTCGCCGATG CCGAGCGCCT GATCGCCGAG GCCGGCTATC CGGGCGGTCG CGACGCGACG AGCGGTCGGC CGCTGGTGCT GAACTACGAC TACCAGCGCA TCCCGACGCC CGAGCTCAAG GCCGAGATGG ACTGGATGGT CAAGCAGTTC GCGAAGCTCG GCGTCACGCT GGAGATCCGC GCGACCGACT ACAACCAGTT CCAGGACAAG ATGCGCAAGG GTCGGCAGCA GGTGTTCTGG TGGGGCTGGC TGGCCGATTA CCCGGACGCC GAGAACTTCC TGTTCCTGCT CTACGGCCCG AACGCCAAGG CCGGGAACGA CGGCGAGAAC GCCGCCAACT ACGCCAACGC CGAGTACGAC CGGCGCTACG AGCGCCTGCG CCTGCTCGAC GACGGGCCGC AGAAGCAGCA GCTGATCGAC GAGATGGTGG CCCTGCTGCG CGAGGACGCG CCCTGGACCT TCGGCTTCTT CCCGTACTCG GCCAGCGCCT TCCAGCCCTG GGTGCACAAC GGCAAGCCCG GCGTGATGGT GCGCGACATG GCGCGCTACT ACCGGGTCGA CCCGGCGCTG CGCGTCGCGA AGCAGGCCGA GTGGAACCGG CCGCAGTGGT GGCCGCTGGG ATTGATGGCG TTGGCCGCGC TCGCCGTGGC TTGGCTGGCG CGCCGGGTGT TCATGGCCCG CGAGCGCAGC ACCGCGCGAG GCCGCAGGGC CACCGGCGCG CGCGCGGGGG AGGGCGCCGG AGCATGA
|
Protein sequence | MSPPSSRFLP ALSRRLALLL CGTALVAGCN NSPLPAGEAA TNTLFTGFQE KSPRHLDPTA SYSNDETKIT YQVYEPLYGY HYLKRPYQLV PKVAVAVVAP KYFDKAGQPL PDDAPGEQVA LSVYEVPLKH GVRYAPHPAF AKDGQGRYRY HTDHALTRAE LGDRHSPLDF EHQGTRELVA DDFVYAIKRH ASTRVQAPVF SVFSAYVLGL KDYGALLHAE DSKLLAGLPA SALDKPFLDL RRFPLAGAEA PDPHTLRIRI LGKYPQWPYW MAMTFLAPIP WEADAFYAQP GMAGQGMSLE TWPVGTGPYM ATVYEQDRRH VLERNPNYRQ DDLYPCEGAP DDTPELLADC GQRMPFVDRL VFRAEKEKVP IKSKFIQGYS DVPEIERPEW GIEFHADADD SEATRRLFAE RGFRFPRAVD VSNWYVGFNW LDPVIGKGDT PEQQVKNRKL RQALSIAIDW EEYVRVFPNK GGEPAHGPLP AGMFGSRHGT PAGFNPVTHV QVNGGIKRRP LADAERLIAE AGYPGGRDAT SGRPLVLNYD YQRIPTPELK AEMDWMVKQF AKLGVTLEIR ATDYNQFQDK MRKGRQQVFW WGWLADYPDA ENFLFLLYGP NAKAGNDGEN AANYANAEYD RRYERLRLLD DGPQKQQLID EMVALLREDA PWTFGFFPYS ASAFQPWVHN GKPGVMVRDM ARYYRVDPAL RVAKQAEWNR PQWWPLGLMA LAALAVAWLA RRVFMARERS TARGRRATGA RAGEGAGA
|
| |