Gene Mpe_A1819 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1819 
Symbol 
ID4786818 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1958921 
End bp1960174 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content73% 
IMG OID640090390 
Productputative transporter 
Protein accessionYP_001021013 
Protein GI124267009 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.143968 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATGACG CGCCGGGCGC GAGGCTGAGC GCCGACGCCC CTGCCGCATC GCACGTTGAG 
CCAATCGACG CCCTGCGCGG CGCGCGCTGG GCGTTGCTGT TCGGCAATTT CGTGATCGGC
TGCGGCGTGA TGGTCATCGC CGGTTCGCTG AACGACCTGA CGCGCTCGCT GCAGGTGAGC
GTGGCGCTCG GCGGCCAGTT GATCGCGATC GCCGCCGTGA TGATGTGCCT GGGCGCTCCG
CTGCTGGCGG CGGCAGTCGG CCGCGTGGAC CGCCGCCGGC TGCTGGCCTT CACGCTGGCC
TGGTATGCGG TGGGCCACGC CGTATCGGCG CTGATGCCCA GCTACGCCGC GCTGGCGCCG
GTGCGCGCGT TCTGCATGCT GGGTGCCGCG GTGTTCACGC CCCAGGCCGC GGCGGCGATC
GGCTGGCTGG CGCCGCCGGC GCAGCGCGGC CGAGCCATCA CCTTCGTCTT CCTCGGCTGG
TCGGTGGCCT CGGTGTTCGG CATGCCGCTG CACAGCTACA TCGGCGAGGC ACTCGGCTGG
CGCTGGGCCT TCGGCCTCGT GGCGCTGCTG GCCGCCGGTG GCGCCGTCTG GGTGTGGGTG
GTGATGCCCG ACGGCGTGCG GCCGCCGGCC ATGAACCTGC GCGCCTGGCG CGAGGTGCTG
ACGCACCCGC TGCTGATGGC GATCGTGGCG GTCACGGCCC TGTTCGGCGC GGGACAGTTC
ACCGTGTTCA CCTACTTCGC GCCGTACTAC AAGCAGGTAC TCGGCGCCGG GCCCACGCAG
ATCAGCCTGC TGTTCGCCTG GTTCGGTGCC TTCGGGCTGA TCGGCAACGT GCTGCTGTCG
CGCCACGTCG ACCGCATCGG CGCGGCGCGT GCGGTGACCT TGCTACTCGG TGCCGTGGCC
CTGACGATGC TGATCTGGCC GCTGGGCACC GGCTACCTGT CGATGGCCTG CGTGCTGCTG
CCGTGGGCGC TGGGCATGTT CGCGGCCAAT TCGGCGCAGC AGGCCCGCCT GGGTCAGGCC
GCGCCGGCCC TGGCGCCGGC ACTGATGGCG TTGAACACGT CGGCCATTTA CCTCGGCCAG
GCGGTTGGCG CAGCGGGGGG CGGCGCGCTG GTCGCAGCGC AGGAAGCCGC CGGCGCCAGT
GGGTCCGGAC TCTACGGCGG CCTGCACTGG GTCGGCCTGG GCTGGCTGCT CGCGGCGCTG
GCGCTCAGCC GCTGGGCCGA AAACCGCATG CGGCGCGACA GCCGCCACGC CTGA
 
Protein sequence
MDDAPGARLS ADAPAASHVE PIDALRGARW ALLFGNFVIG CGVMVIAGSL NDLTRSLQVS 
VALGGQLIAI AAVMMCLGAP LLAAAVGRVD RRRLLAFTLA WYAVGHAVSA LMPSYAALAP
VRAFCMLGAA VFTPQAAAAI GWLAPPAQRG RAITFVFLGW SVASVFGMPL HSYIGEALGW
RWAFGLVALL AAGGAVWVWV VMPDGVRPPA MNLRAWREVL THPLLMAIVA VTALFGAGQF
TVFTYFAPYY KQVLGAGPTQ ISLLFAWFGA FGLIGNVLLS RHVDRIGAAR AVTLLLGAVA
LTMLIWPLGT GYLSMACVLL PWALGMFAAN SAQQARLGQA APALAPALMA LNTSAIYLGQ
AVGAAGGGAL VAAQEAAGAS GSGLYGGLHW VGLGWLLAAL ALSRWAENRM RRDSRHA