Gene Mpe_A3400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3400 
Symbol 
ID4786330 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3614285 
End bp3615466 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content72% 
IMG OID640091976 
Producthypothetical protein 
Protein accessionYP_001022588 
Protein GI124268584 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0535164 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGACCG CTCCCCGCCG CATTCTGCCG ATCATCGTCG CCGCCCAGTT CGCGAGCACT 
TCGCTATGGT TCGCCGTCAA TGCGGTGATG CCCGATCTTC AGCACGCGCT CGGTCTGCCG
CCTACCGCTG TCGGCTCGTT GACTTCGGCG GTGCAACTGG GCTTCATCGC CGGCACGCTG
GTGTTCGCGC TGGGGTCGGT CGCCGACCGC CATTCGCCGC GCCTCGTGTT CCTGCTGTGC
GCGATCTCCG GTGCACTGTT CAACGCCGCG CCCGTGCTCT GGCCGGCAGG CGCGCTGGAC
CTGCCGACGC TGTGGCTGCT GCGGGCCTTG ACCGGCTTTG CGCTGGCCGG CATCTACCCG
GTCGGCATGA AGATCGCCGC TGGCTGGTAT CCGCAGGGGC TGGGCCGTGC GCTGGGCTGG
CTGATCGGCG CGCTGGTTCT GGGCACCGCG ACGCCGCATG CGCTGCGGGC CTTGGGCGCG
CACTGGCCCT GGCAGCAGGT GATGCTGGTG GTGTCCGGGC TTGCGGTGCT GGGGGGCTGG
ATGGTGTGGC AGTGGGTGCC GGAGAGCACC CATCTGCCCC GTGCCACGCG CCTGCAGTGG
CGTGCCTTCG GCCGCGTGCT GACCGACCGC CGCCTGCGTG CACCCGTGTT CGGCTACTTC
GGCCACATGT GGGAGCTCTA CACCTTGTGG GTGTTGCTGC CGGCGCTGCT GGCCACGCGG
CTCCAGGGGG CAGCCATCTC GGTGGGTGCG TTCGGCGTCA TCGCCGCGGG GGCGGTCGGG
TGCGTCGCGG GGGGCTGGCT GGCGCAGCGC CACGGCAGTG CGCGCGTGGC GGCGGGGCAG
CTGGCGCTGA GCGGCCTGTG CTGCGTGCTC ACGCCGCTCG CGATGGCAGC GCCCGCCGCG
GTGTTCGCGG CGTGGCTGCT GGTGTGGGGC ATCAGCGTGG CCGGCGACTC GCCGCAATTC
TCGGCGCTAA CGGCCACGCA TGCCCCGCGC GAGGCGGTCG GCAGCGTGCT GACGCTGGTC
AACTGCATCG GCTTCTCGAT CTCCATCCTG AGCATCCAGG GCTTCGTGGC GCTGGCACAG
CACCACGATC TGTCGCTGTT GCTGCCGTGG CTCGGTCTCG GGCCGCTGCT CGGCCTGTGG
GGCCTGCGGC CCTTGCTGGT CCGCGAGGGC TCGTCGCCCT GA
 
Protein sequence
MPTAPRRILP IIVAAQFAST SLWFAVNAVM PDLQHALGLP PTAVGSLTSA VQLGFIAGTL 
VFALGSVADR HSPRLVFLLC AISGALFNAA PVLWPAGALD LPTLWLLRAL TGFALAGIYP
VGMKIAAGWY PQGLGRALGW LIGALVLGTA TPHALRALGA HWPWQQVMLV VSGLAVLGGW
MVWQWVPEST HLPRATRLQW RAFGRVLTDR RLRAPVFGYF GHMWELYTLW VLLPALLATR
LQGAAISVGA FGVIAAGAVG CVAGGWLAQR HGSARVAAGQ LALSGLCCVL TPLAMAAPAA
VFAAWLLVWG ISVAGDSPQF SALTATHAPR EAVGSVLTLV NCIGFSISIL SIQGFVALAQ
HHDLSLLLPW LGLGPLLGLW GLRPLLVREG SSP