Gene Mpe_A1021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1021 
Symbol 
ID4785623 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1086944 
End bp1089280 
Gene Length2337 bp 
Protein Length778 aa 
Translation table11 
GC content69% 
IMG OID640089583 
Productputative ABC transporter 
Protein accessionYP_001020218 
Protein GI124266214 
COG category[V] Defense mechanisms 
COG ID[COG1132] ABC-type multidrug transport system, ATPase and permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.338231 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGTCTTC GAGAACAAAT CAGAATGCAT CATCACCATC CAGTCGGCGT CCCAGCGGAC 
GGGGCCTCCC CCGCGCGCAT CGCGCTCCAG TCGCAGCTCG CACCGGAAGA GAACGTGCTG
GCGACGCTGG AGGTTGACCT GGACGCACGG CTGCATTTCG GCGCCGGGCT GCTGGCGCTC
ACCGACCGCC GCCTGCTGGT CCGCGAACCG GGCGCCGCCG CCTGGCAGGT CTGGCCGCTG
GACAGCGACC TGACGCTGCG CCACACCGAC CACGCCGGGG TCGGCAGCCT GGAACTGCAC
GATCCGGCAA CCCGGCGCGC CGGCTGGCGC TTCACGCTGC GTCACAACGT GCAGGCGCTG
CGCCTGCTGA AACTGTTCGA ATTGCAACAG ACCCGCCAGC GCACCGGCGA GTCGCCCGGC
GACGACGGCA CGCTCTGCCC CAGCTGCCAG GCGCCGCTGC CGCCCGACAG CGACGACTGC
CCGCTGTGCG CCCGGGAGCT GAACACCCCG CCCTCGACCT GGGTGCTGCT GCGGCTGTGG
CGCTTCGCCC GGCCCTACCG CAAGGAGCTG GCGATCGGCT TCGGGCTGAC GCTGGCCTCG
ACCGCGGCGA CGCTGGTGCC GCCCTACCTC ACCATCCCGC TGATGGACGA GGTGCTGATC
CCGTTCCAGA ACGGCCAGCA GATCGACCCA AAACTGGTGC TGATGCTGCT GGGCGGCCTG
CTGGCGGCAG CGCTGGCGGC CTGGGGCCTG AGCTGGGCGC GCACCTACCT GCTGGCGCTG
GTGTCCGAGC GCATCGGCGC CGATCTGCGC ACCACCACCT TCGAGCACCT GCTGCGGCTT
TCGCTCGACT ACTTCGGTGG CAAGCGCACC GGCGACCTGA TGGCGCGCAT CGGCTCCGAG
ACCGACCGCA TCTGCGTCTT CCTGTCGCTG CACGCGCTGG ATTTCGCGAC CGACGTGCTG
ATGATCGGCA TGACCGCGGT GATCCTGGCC TCGATCAACC CCTGGCTGGC GCTGGTGACC
CTGCTGCCGC TGCCCTTCAT CGCCTGGATG ATCCACCTGG TGCGCGACCG TCTGCGCACC
GGCTTCGAGA AGATCGACCG CGTGTGGTCC GAAGTGACCA ATGTGCTGGC CGACACGATC
CCCGGCATTC GCGTGGTGAA GGCCTTCGCC CAGGAACAGC GCGAGGCCGA CCGCTTCCGC
GACGCGAACC AGCACAACCT CGCGGTCAAC GACCGGCTCA ACAAGACCTG GAGCCTGTTC
TCGCCGACCG TGTCGCTGCT GACAGAGGTG GGCCTGCTGG TGGTGTGGGC CTTCGGCATC
TGGCAGGTCA GCAAGAACGA GATCACGGTC GGCGTGCTGA CCGCCTTCAT CGCCTACATC
GGCCGCTTCT ACGGCCGGCT CGACTCCATG AGCCGCATCG TCTCGGTCAC GCAGAAGGCG
GCCGCCGGGG CCAAGCGCAT CTTCGACATC CTCGACCATG TGTCGAACGT GCCCGAGCCT
GCCCATCCGC AGCCGATCGG TCGGCTGCAG GGCCGCATCG AGCTGGCCGA CATCGGCTTC
CGCTACGGCT CACGCACCGT GATCCGCGGG CTGCAACTCG ACATCCGACC GGGCGAGATG
ATCGGCCTGG TGGGCCACAG CGGCTCCGGC AAGAGCACGC TGGTCAACCT GATCTGCCGC
TTCTACGACG TGACCGATGG CGCGATCAAG GTCGACGGCA CCGACATCCG CCGCTTCGGC
GTGGCCGACT ACCGGCGCCA CATCGGCCTG GTGCTGCAGG AGCCCTTCCT GTTCTTCGGC
ACGATCGCCG AGAACATCGC CTACGGCAAG CCGGGCGCCA CGCGCGCCGA GATCGTGGCC
GCGGCCCGCG CCGCGCATGC GCACGAGTTC ATCCTGCGGC TGCCGCAGGG CTACGACTCG
CTGGTCGGCG AGCGCGGCCA GGGTCTCTCG GGCGGCGAGC GCCAGCGCAT CAGCATCGCG
CGCGCGCTGC TGATCGACCC GCGCATCCTG ATCCTCGACG AGGCCACCTC CTCGGTCGAT
ACCGAGACCG AGAAGGAAAT CCAGAAGGCG CTCGACAACC TGGTGCAGGG CCGCACCACC
ATTGCCATCG CGCACCGCCT TTCCACGCTG CGCAAGGCCG ACCGGCTGGT GGTGATGGAC
CGCGGCCGCA TCGTCGAGGT GGGTCCGCAC GACGCGCTGA TGGCGCAGCG CGGTGCCTAC
TGGCGGCTCT ACGAGGCGCA GCTGCGCCGC GTCGAAGGCG ACGACGGCGA GGCGGCGCTG
GACCAGCCTT CGCCGTCGAC ACTGTCGCCG TCGGCCCACC CGACCGAACT CGCATGA
 
Protein sequence
MGLREQIRMH HHHPVGVPAD GASPARIALQ SQLAPEENVL ATLEVDLDAR LHFGAGLLAL 
TDRRLLVREP GAAAWQVWPL DSDLTLRHTD HAGVGSLELH DPATRRAGWR FTLRHNVQAL
RLLKLFELQQ TRQRTGESPG DDGTLCPSCQ APLPPDSDDC PLCARELNTP PSTWVLLRLW
RFARPYRKEL AIGFGLTLAS TAATLVPPYL TIPLMDEVLI PFQNGQQIDP KLVLMLLGGL
LAAALAAWGL SWARTYLLAL VSERIGADLR TTTFEHLLRL SLDYFGGKRT GDLMARIGSE
TDRICVFLSL HALDFATDVL MIGMTAVILA SINPWLALVT LLPLPFIAWM IHLVRDRLRT
GFEKIDRVWS EVTNVLADTI PGIRVVKAFA QEQREADRFR DANQHNLAVN DRLNKTWSLF
SPTVSLLTEV GLLVVWAFGI WQVSKNEITV GVLTAFIAYI GRFYGRLDSM SRIVSVTQKA
AAGAKRIFDI LDHVSNVPEP AHPQPIGRLQ GRIELADIGF RYGSRTVIRG LQLDIRPGEM
IGLVGHSGSG KSTLVNLICR FYDVTDGAIK VDGTDIRRFG VADYRRHIGL VLQEPFLFFG
TIAENIAYGK PGATRAEIVA AARAAHAHEF ILRLPQGYDS LVGERGQGLS GGERQRISIA
RALLIDPRIL ILDEATSSVD TETEKEIQKA LDNLVQGRTT IAIAHRLSTL RKADRLVVMD
RGRIVEVGPH DALMAQRGAY WRLYEAQLRR VEGDDGEAAL DQPSPSTLSP SAHPTELA