Gene Mpe_A1745 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1745 
Symbol 
ID4784203 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1871443 
End bp1872783 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content70% 
IMG OID640090315 
Productputative membrane transport protein 
Protein accessionYP_001020939 
Protein GI124266935 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.27935 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAGCT TCAGCCACGC CCAGCCCCAC CTGCCGTCGT CCAGCGCCGC CGAGCGCGAC 
GCGCGGCAGC GCTCCGCCGA TCACCAGAAG GTCGCTCCCG GCGAGATCGC GGTGGGCGTG
GTCATCGGCC GCGCGTCCGA GTACTTCGAC TTCTTCGTCT ACGGCATCGC CTCGGTGCTG
GTGTTCCCGG CGTTGTTCTT CCCCTTCGTC GACCGCCTGC AGGGCACGCT GTACGCCTTC
GTGGTGTTCT CGTTCGCCTT CATCGCGCGG CCGCTCGGCA CCGTGCTGTC GATGGCCATC
CAGCGGCGCT GGAGCCGCGG CACCAAGCTC ACGGTGGCGC TGTTCCTGCT GGGCACCTCG
ACCGCCGGCA TCGCCTTCCT GCCCGGCTAC CAGACGCTGG GCGCGGCCTC GATCGTGCTG
CTGGCGCTGT TCCGCATCGG CCAGGGCGTG GCGCTGGGCG GGTCCTGGGA TGGGCTGCCC
TCGCTGCTGG CGCTCAATGC CCCGCCCGAG CGTCGCGGCT GGTACGCCAT GCTCGGACAG
CTCGGCGCGC CACTGGGCTT CATGCTCGCC AGCGGCCTGT TCGCCTACCT GGTGGCGAGC
CTGTCGACCG CCGACTTCCT GGCCTGGGGC TGGCGCTACC CGTTCTTCGT GGCCTTCGCG
ATCAACGTGG TGGCGCTGTT CGCGCGGCTG CGGCTGGTCG TCACCCACGA ATACGAACGC
CTGCTCGACG AGCGCGAACT CGAGCCCATC GGCGTGCTCG AACTGGTCCG CAGCGAGGGC
CACAACCTGG TGATCGGGGC CTTCGCGGCG CTCGCCAGCT ACGCGCTGTT CCATCTGGTG
ACCGTGTTTC CGCTGTCGTG GGTCACGCTC TACTCGCAGC AGTCGGTGGC CGGCTTCCTG
ACCATCCAGA TCTTCGGCGC CGCACTGGCG GCCGGTGGCA TCGTCGCCTC GGGCTGGATC
GCCGACCGCA TCGGTCGGCG CAGCACGCTG GGCGCTTCGG CGGTGCTGAT CGGCATGTTC
AGCGGCTTCG CGCCCACGCT GCTGGGCGGT GGCCCGCTGG GCCAGGACGT GTTCATGCTG
ATCGGCTTCG CGCTACTGGG GCTGTCCTAC GGCCAGGCGG CCGGCGCGGT GACCTCGAAC
TTCTCGTCGA AGTACCGCTA CACCGGCGCC GCGCTGACCG CCGACCTGGC CTGGCTGATC
GGCGCGGCCT TCGCGCCGCT GGTCGCGCTG GGGCTGTCGG CCCACTTCGG ACTGGCCTAC
GTCAGCATCT ACCTGCTGTC GGGCGCGGCC TGCACGCTGG CCGCATTGAG CCTCAACCGC
GCGCTGGGCC CTCGCGACTG A
 
Protein sequence
MSSFSHAQPH LPSSSAAERD ARQRSADHQK VAPGEIAVGV VIGRASEYFD FFVYGIASVL 
VFPALFFPFV DRLQGTLYAF VVFSFAFIAR PLGTVLSMAI QRRWSRGTKL TVALFLLGTS
TAGIAFLPGY QTLGAASIVL LALFRIGQGV ALGGSWDGLP SLLALNAPPE RRGWYAMLGQ
LGAPLGFMLA SGLFAYLVAS LSTADFLAWG WRYPFFVAFA INVVALFARL RLVVTHEYER
LLDERELEPI GVLELVRSEG HNLVIGAFAA LASYALFHLV TVFPLSWVTL YSQQSVAGFL
TIQIFGAALA AGGIVASGWI ADRIGRRSTL GASAVLIGMF SGFAPTLLGG GPLGQDVFML
IGFALLGLSY GQAAGAVTSN FSSKYRYTGA ALTADLAWLI GAAFAPLVAL GLSAHFGLAY
VSIYLLSGAA CTLAALSLNR ALGPRD