Gene Mpe_A1097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1097 
Symbol 
ID4783701 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1175785 
End bp1177371 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content68% 
IMG OID640089660 
Productputative binding-dependent transport protein (periplasmic) 
Protein accessionYP_001020293 
Protein GI124266289 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGACCT TCCTTCGAGC CGCAACGTTG CTGCTGCTGG CCGCGCTGGC CACAGCCGGC 
TCGGCGCAGA CGCTGCGCTG GGCCAGCCAG GGCGATGTCC AGACCCTGGA CCCCCACTCC
CAGAACGAGT CGCTGACCAA CCAGATCAAC GGCCAGGTCT ACGAGTCGCT GGTCAATCGC
GACCGGCATC TCGCCATCGA GCCGGTGCTC GCCGCCGAGT GGCAGCAGAT CTCGCCGACC
GTGTGGCGGC TGAAGCTGCG GCCCCAGGTC CGCTTCCACG ACGGCAGCCC GTTCACCGCG
GACGACGTGG TGTTCTCGGT GCTGCGGGCG CGCGATCCCG CGTCGGCGAT CCGCGTCTAC
GCCAGCGCGT TGGGCGAGCC GAAGAAGCTC GACGCTCTGA CGGTGGAGTT CCGGCTGGCG
CAGGTGAACC CGATCTTCCT GCAGCACCTC AGCACGGTGC AGATCATGAG CCGCGCCTGG
AGCGAGAAGC ACGGTGCCGC CCGGCCGCTG GACTTCAAGA ACAAGCAGGA AGGTCATGCA
TCGCTGAACA CCAACGGCAC CGGCCCGTTC ATGCTGGTCT CGCGCCAGCC GGACGTGAAG
ACGGTCTACA AGCGCAACCC GGCCTGGTGG GGCCGCTTCG CGGGCAACCT GCAGCAGGTG
GTCTACACGC CGGTGAAGAG CGATGCGACG CGCTCTGCCG CGCTGATCTC CGGCGAGCTC
GACTTCGTGC TCGACCCGGC GCCGCAGGAC ATCGCCCGGC TGCGCAACAC GGCCGGCGTG
CAGGTCGTCG ACGGACCGGA GAATCGCGTC CTGTTCATCG GCATGGATCA GGCGCGCGAC
AAGCTGTTGT ACGGTCGGGT GCCCGGCGAC CGCAACCCCT TCAAGGACGT GCGGGTGCGC
AAGGCGCTTT ACCAGGCCAT CGACATCGAG ACGATCAAGA CGAAGCTGAT GCGTGGGCAG
GCCTTGCCCA CCGGCGGCAT CACGCCGTCT CCGCTGGGCG CCTACAACGA TGCGGCGCTG
GAGAAGCGCC TGCCGTTCGA CCTGGCGGCG GCCCGGCAGC TGATGCAGGC CGCAGGCTAC
CCCGAGGGCT TCGGGGTCAC GCTGGACTGC CCGAACAACC GCTACATCAA CGACGAGGAG
ATCTGCCTCG CGCTGGCGGC CATGTGGTCG CAGCTGAAGG TGAAGGTGCA GGTCAACGCG
ATGCCGCGCA GCACCTATTT CGCGAAGCTG GAGAAGCTGG ACACCTCGCT GTACCTGCTG
GGCTGGGGCG GTTCGATCAC CGATGCCGAG ACCACGCTGA CGCCGGTGCT GCGCAGCCGC
ATCCCGGGCG ATACGGGCGG TGTCGGGAGC TGGAACTTCG GCGGCGTCAA GGATGCCAGG
CTCGACGAAC TGGCCGCGGC GTCCAGCAGC GAGGCCGATC CGAAGAAGCG CGAGGCACTG
GTCAAGGCGG CGCTGGCCCG CCACAACGAG CTGGTGCTGC ACCTGCCGCT GCACCGGCAG
ATCGTGCCGT GGGCGGCGCG CAGCAACGTG ACGGTGGTCC ACCGCGCCGA CAACTGGCTC
GAGTGGTCGT GGATCAGTGT CAAGTGA
 
Protein sequence
MPTFLRAATL LLLAALATAG SAQTLRWASQ GDVQTLDPHS QNESLTNQIN GQVYESLVNR 
DRHLAIEPVL AAEWQQISPT VWRLKLRPQV RFHDGSPFTA DDVVFSVLRA RDPASAIRVY
ASALGEPKKL DALTVEFRLA QVNPIFLQHL STVQIMSRAW SEKHGAARPL DFKNKQEGHA
SLNTNGTGPF MLVSRQPDVK TVYKRNPAWW GRFAGNLQQV VYTPVKSDAT RSAALISGEL
DFVLDPAPQD IARLRNTAGV QVVDGPENRV LFIGMDQARD KLLYGRVPGD RNPFKDVRVR
KALYQAIDIE TIKTKLMRGQ ALPTGGITPS PLGAYNDAAL EKRLPFDLAA ARQLMQAAGY
PEGFGVTLDC PNNRYINDEE ICLALAAMWS QLKVKVQVNA MPRSTYFAKL EKLDTSLYLL
GWGGSITDAE TTLTPVLRSR IPGDTGGVGS WNFGGVKDAR LDELAAASSS EADPKKREAL
VKAALARHNE LVLHLPLHRQ IVPWAARSNV TVVHRADNWL EWSWISVK