Gene Mpe_A1750 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1750 
Symbol 
ID4784208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1875858 
End bp1878134 
Gene Length2277 bp 
Protein Length758 aa 
Translation table11 
GC content70% 
IMG OID640090320 
Productputative extracellular solute-binding protein 
Protein accessionYP_001020944 
Protein GI124266940 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGCCGC CGTCGAGTCG CTTCCTCCCC GCGCTGTCCC GGCGGCTGGC GTTGCTGCTG 
TGCGGCACGG CCCTCGTGGC CGGCTGCAAC AACAGCCCGC TGCCGGCCGG CGAGGCGGCC
ACCAACACGC TGTTCACCGG CTTCCAGGAG AAATCGCCGC GCCACCTGGA CCCGACCGCG
TCGTACTCGA ACGACGAGAC CAAGATCACC TACCAGGTCT ACGAACCGCT CTACGGCTAC
CACTACCTGA AGCGGCCCTA TCAGCTGGTG CCCAAGGTCG CGGTGGCCGT CGTCGCCCCG
AAATACTTCG ACAAAGCCGG CCAGCCGCTG CCCGACGACG CGCCGGGCGA GCAGGTCGCA
CTCAGCGTCT ACGAGGTGCC GCTGAAGCAC GGCGTGCGCT ACGCACCGCA CCCGGCGTTC
GCCAAGGACG GGCAGGGCCG CTACCGCTAC CACACCGACC ATGCGCTGAC GCGTGCCGAA
CTCGGCGATC GCCACTCGCC GCTGGATTTC GAGCACCAGG GGACGCGCGA GCTGGTGGCC
GACGACTTCG TCTATGCCAT CAAGCGCCAT GCCAGCACCC GGGTGCAGGC GCCGGTGTTC
TCGGTGTTCT CGGCTTATGT GCTGGGGCTC AAGGACTACG GCGCGCTGCT GCACGCCGAG
GACAGCAAGC TGCTCGCCGG CCTGCCCGCG TCGGCGCTCG ACAAGCCATT CCTTGACCTG
CGCAGGTTCC CGCTGGCCGG CGCGGAGGCG CCGGACCCGC ACACGCTGCG CATCCGCATC
CTCGGCAAGT ACCCGCAGTG GCCGTACTGG ATGGCGATGA CCTTCCTTGC CCCTATCCCC
TGGGAGGCCG ATGCCTTCTA TGCCCAGCCC GGCATGGCGG GCCAGGGCAT GAGCCTGGAG
ACCTGGCCGG TCGGCACCGG CCCCTACATG GCGACGGTCT ACGAGCAGGA CCGCCGCCAC
GTGCTGGAGC GCAACCCGAA CTACCGGCAG GACGACCTGT ATCCCTGCGA GGGCGCGCCG
GACGATACGC CCGAGCTGCT GGCCGACTGC GGCCAACGCA TGCCCTTCGT CGACCGCCTC
GTGTTCCGTG CCGAGAAGGA GAAGGTGCCG ATCAAGTCGA AGTTCATCCA GGGCTATTCC
GACGTGCCGG AGATCGAGCG CCCCGAGTGG GGCATCGAGT TCCATGCCGA CGCCGACGAC
TCCGAGGCGA CGCGGCGGCT GTTCGCCGAG CGCGGCTTCC GCTTCCCGCG CGCGGTCGAC
GTGTCGAACT GGTACGTCGG CTTCAACTGG CTCGATCCGG TGATCGGCAA GGGCGACACG
CCGGAGCAGC AGGTGAAGAA CCGCAAGCTG CGGCAGGCGC TGTCGATCGC GATCGACTGG
GAGGAGTACG TGCGCGTGTT TCCGAACAAG GGCGGCGAGC CGGCCCACGG CCCGCTGCCG
GCCGGCATGT TCGGCTCGCG CCACGGCACG CCGGCCGGCT TCAACCCGGT CACGCATGTG
CAGGTGAACG GCGGGATCAA GCGGCGACCG CTCGCCGATG CCGAGCGCCT GATCGCCGAG
GCCGGCTATC CGGGCGGTCG CGACGCGACG AGCGGTCGGC CGCTGGTGCT GAACTACGAC
TACCAGCGCA TCCCGACGCC CGAGCTCAAG GCCGAGATGG ACTGGATGGT CAAGCAGTTC
GCGAAGCTCG GCGTCACGCT GGAGATCCGC GCGACCGACT ACAACCAGTT CCAGGACAAG
ATGCGCAAGG GTCGGCAGCA GGTGTTCTGG TGGGGCTGGC TGGCCGATTA CCCGGACGCC
GAGAACTTCC TGTTCCTGCT CTACGGCCCG AACGCCAAGG CCGGGAACGA CGGCGAGAAC
GCCGCCAACT ACGCCAACGC CGAGTACGAC CGGCGCTACG AGCGCCTGCG CCTGCTCGAC
GACGGGCCGC AGAAGCAGCA GCTGATCGAC GAGATGGTGG CCCTGCTGCG CGAGGACGCG
CCCTGGACCT TCGGCTTCTT CCCGTACTCG GCCAGCGCCT TCCAGCCCTG GGTGCACAAC
GGCAAGCCCG GCGTGATGGT GCGCGACATG GCGCGCTACT ACCGGGTCGA CCCGGCGCTG
CGCGTCGCGA AGCAGGCCGA GTGGAACCGG CCGCAGTGGT GGCCGCTGGG ATTGATGGCG
TTGGCCGCGC TCGCCGTGGC TTGGCTGGCG CGCCGGGTGT TCATGGCCCG CGAGCGCAGC
ACCGCGCGAG GCCGCAGGGC CACCGGCGCG CGCGCGGGGG AGGGCGCCGG AGCATGA
 
Protein sequence
MSPPSSRFLP ALSRRLALLL CGTALVAGCN NSPLPAGEAA TNTLFTGFQE KSPRHLDPTA 
SYSNDETKIT YQVYEPLYGY HYLKRPYQLV PKVAVAVVAP KYFDKAGQPL PDDAPGEQVA
LSVYEVPLKH GVRYAPHPAF AKDGQGRYRY HTDHALTRAE LGDRHSPLDF EHQGTRELVA
DDFVYAIKRH ASTRVQAPVF SVFSAYVLGL KDYGALLHAE DSKLLAGLPA SALDKPFLDL
RRFPLAGAEA PDPHTLRIRI LGKYPQWPYW MAMTFLAPIP WEADAFYAQP GMAGQGMSLE
TWPVGTGPYM ATVYEQDRRH VLERNPNYRQ DDLYPCEGAP DDTPELLADC GQRMPFVDRL
VFRAEKEKVP IKSKFIQGYS DVPEIERPEW GIEFHADADD SEATRRLFAE RGFRFPRAVD
VSNWYVGFNW LDPVIGKGDT PEQQVKNRKL RQALSIAIDW EEYVRVFPNK GGEPAHGPLP
AGMFGSRHGT PAGFNPVTHV QVNGGIKRRP LADAERLIAE AGYPGGRDAT SGRPLVLNYD
YQRIPTPELK AEMDWMVKQF AKLGVTLEIR ATDYNQFQDK MRKGRQQVFW WGWLADYPDA
ENFLFLLYGP NAKAGNDGEN AANYANAEYD RRYERLRLLD DGPQKQQLID EMVALLREDA
PWTFGFFPYS ASAFQPWVHN GKPGVMVRDM ARYYRVDPAL RVAKQAEWNR PQWWPLGLMA
LAALAVAWLA RRVFMARERS TARGRRATGA RAGEGAGA