Gene Mpe_A1044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1044 
Symbol 
ID4785647 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1116292 
End bp1117479 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content75% 
IMG OID640089606 
Productnodulation protein NolF 
Protein accessionYP_001020240 
Protein GI124266236 
COG category[V] Defense mechanisms 
COG ID[COG1566] Multidrug resistance efflux pump 
TIGRFAM ID[TIGR01730] RND family efflux transporter, MFP subunit 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.991845 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGCC GGGCGGCGTG GTGGATCGGC GCCGCCGTCG TGCTGCTGCT GGTCGGCGCG 
GCGCTGGGCG GCGGCCTGCT GGCACGCAAG GCCGAGCAGG GCCGCATGCT GGCGCGCCCT
GCCGACGTGG CCCTCGAACT CGCGCCGCTG GACGTGGCCC GGGTTCAGCC GCGCGAGCTG
CTGCGCACGC TCGAGATCAC CGGCGCACTG AAGGCGGTGA ATTCCGCGGT GGTGAAGGCC
AAGGTGGCGG GCGAGGTGCA GCAGCTCAGC GTGCGCGAGG GCGATCGTGT CACGGCCGGC
CAAGTGCTGG GTCGAATCGA CGCGACCGAG TACGCCTGGA AGCTGCGCCA GGCGCAGGAG
CAGTCGGCCC AGGCCGGCGC GCAGCTCGAC ATCGCCGAGC GCGCGCTGGA GAACAACCGC
GCCCTGGTGA ATCAGGGCTT CATCTCGAAG AACGCGCTCG ACACGTCGGT CTCGAACGCG
GCCGCCGCAC GTGCGGCGTT GCAGGCGGCC CGGGCGGCTG CCGAACTGGC GCGCAAGGCC
CAGAACGACA CGGTGCTGCG TGCGCCGATC GCCGGCGAGG TGTCGCAGCG CGCTGCGCAG
CCCGGTGAAC GCGTGGCGGT CGACGCGAAG CTGGTCGAGA TCGTCGATCT GTCGCGGATC
GAGCTCGAGG CGGCGGTCGC GCCCGAGGAC GTGGGCGCGG TACGCATCGG AGCGGCCGCC
CGCCTGCAGG TCGACGGCAT CGCGGCGCCG GTGCTGGCGC GCGTGGCCCG CATCAACCCG
GCCGCCCAGG CCGGCACGCG TGCGGTAATG GTCTATCTCG CGGTCGAGCC CCAGCCCGGC
CTGCGCCAGG GCCTGTTCGC CAAGGGGCGC ATCGAGCTGG AGCGCCGCAC CGCGAGCAGC
GTGCCGGCGA CGGCCGTGCG CATCGATCAG GCACGCCCCT ATGTGCTGGC GGTCGAGGAC
GGCCGGGTGG TGCAGCACGG CGTCGAACTG GGCCTGCGGG CGGACGCCGC GGCCCGCGGC
GACGAGGCGC TGGTCGAGGT GAACTCCGGC ATCGCCGACG GCAGCACGGT GCTGCGCGGC
ACCGTGGGTG CGGTGCGCGA GGGCACCCGG GTGCGGCTGC CAGCGCCCGG CGAGGCGCCC
GCGCCGGCCA CCGCGTCGGC CTCTTCGGGC GCCACCACGA CGCGCTGA
 
Protein sequence
MKRRAAWWIG AAVVLLLVGA ALGGGLLARK AEQGRMLARP ADVALELAPL DVARVQPREL 
LRTLEITGAL KAVNSAVVKA KVAGEVQQLS VREGDRVTAG QVLGRIDATE YAWKLRQAQE
QSAQAGAQLD IAERALENNR ALVNQGFISK NALDTSVSNA AAARAALQAA RAAAELARKA
QNDTVLRAPI AGEVSQRAAQ PGERVAVDAK LVEIVDLSRI ELEAAVAPED VGAVRIGAAA
RLQVDGIAAP VLARVARINP AAQAGTRAVM VYLAVEPQPG LRQGLFAKGR IELERRTASS
VPATAVRIDQ ARPYVLAVED GRVVQHGVEL GLRADAAARG DEALVEVNSG IADGSTVLRG
TVGAVREGTR VRLPAPGEAP APATASASSG ATTTR