Gene Mpe_A3350 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3350 
Symbol 
ID4786391 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3559957 
End bp3561633 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content66% 
IMG OID640091923 
Productputative sugar transport protein 
Protein accessionYP_001022538 
Protein GI124268534 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGCTG TTCTACCCAC CTCCGGCGCC GCTGCGGCGC GCCCGATGAC CGCGGAGGAG 
AAGAAGGTCA TCTTCGCGTC CTCGCTCGGG ACCGTGTTCG AGTGGTACGA CTTCTACCTG
TACGGATCGC TGGCGGCCAT CATCGCCAAG CAGTTCTTCG CGGGGCTGGA TGCCGGCTCG
GCCTTCATCT TCGCGCTGCT GGCGTTCGCC GCCGGCTTCA TCGTGCGACC GTTCGGCGCG
ATCTTCTTCG GCCGTCTGGG CGACATGATC GGCCGCAAGT ACACCTTCCT GGTCACGATC
CTGATCATGG GTCTGTCGAC CTTCATCGTC GGCATCCTGC CCAACTACGC CGCGATCGGC
GTGGCCGCGC CGGTCATCCT GATCGGCCTG CGCCTGCTGC AGGGCCTGGC GCTCGGCGGT
GAGTACGGCG GTGCCGCCAC CTACGTGGCC GAGCACGCTC CGCACGGCAA GCGCGGCGCC
TACACCTCGT GGATCCAGAC CACCGCGACG CTGGGCCTGT TCCTGTCGCT GATGGTCATC
CTGGGGACCC GCACGCTGGT CGGCGAAGCG GCGTTCGCCG ACTGGGGCTG GCGCGTGCCT
TTCCTGGTCT CGATCTTCCT GCTCGCGATC AGCGTGTGGA TCCGCCTGAG CATGAACGAA
TCGCCCGCCT TCAAGAAGAT GAAGGAGGAG GGCAAGACCT CCAAGGCGCC GCTGACCGAG
TCGTTCGGCC AGTGGAAGAA CCTGAAGATC GTGATCCTGG CGCTGATCGG CCTGACCGCC
GGCCAGGCCG TGGTCTGGTA CACCGGTCAG TTCTACGCGC TGTTCTTCCT GACGCAGTCG
CTGAAGGTCG ACGGTGCCAC CGCGAACATC ATGATCGCGA TCTCGCTGCT GATCGGCACG
CCGTTCTTCA TCGTCTTCGG CTCGCTGTCG GACAAGATCG GCCGCAAGCC CATCATCCTG
GCCGGCTGCC TGATCGCCGC GCTGACCTTC TTCCCGCTGT TCAAGGCGCT CACCGAGGCG
GCCAACCCCG ACCTCGCCGC CGCGCAGGCG AAGAACAAGG TGCTGGTGCA CGCCGACCCG
GCCGAGTGCT CGTTCCAGTT CAACCCGACC GGCACCGTCA AGTTCACCAG CTCGTGCGAC
ATCGCCAAGC AGGTCCTGGC CGCCGGCTCG GTGAGCTACG ACAACGTGGC GCATGCCGCC
GGCACGCCCG CCACCATCAC CATCGGCGAG ACGGTCATCC AGAGCTACAG CTCCAAGGGC
CTCCCGCCCG ACGAGGCGAA GGCGAAGGAC GCCGAGTTCA AGAAGTCGGT CGCCGAGACC
CTGAAGGCCG CCGGCTACCC CGCCAAGGCC GATCCGGCGA AGATGAACAA GCCGCTGATC
GTCGGCATCC TGGTGATCCT GGTGATCTAC GTCACCATGG TGTACGGGCC GATCGCCGCG
ATGCTGGTCG AGATGTTCCC GACCCGCATC CGCTACACCT CGATGAGCCT GCCGTACCAT
ATCGGCAACG GCTGGTTCGG CGGCCTGCTG CCCACCACCG CCTTCGCGAT CGTGGCCCAG
ACCGGCAACA TGTACAACGG CCTCTGGTAC CCGATCATCA TCGCCGGCAT CACCTTCGTC
GTGGGTCTGA TCTTCGTCCG CGAGACCAAG GACGTCGACA TCTACGCCAA GGACTGA
 
Protein sequence
MAAVLPTSGA AAARPMTAEE KKVIFASSLG TVFEWYDFYL YGSLAAIIAK QFFAGLDAGS 
AFIFALLAFA AGFIVRPFGA IFFGRLGDMI GRKYTFLVTI LIMGLSTFIV GILPNYAAIG
VAAPVILIGL RLLQGLALGG EYGGAATYVA EHAPHGKRGA YTSWIQTTAT LGLFLSLMVI
LGTRTLVGEA AFADWGWRVP FLVSIFLLAI SVWIRLSMNE SPAFKKMKEE GKTSKAPLTE
SFGQWKNLKI VILALIGLTA GQAVVWYTGQ FYALFFLTQS LKVDGATANI MIAISLLIGT
PFFIVFGSLS DKIGRKPIIL AGCLIAALTF FPLFKALTEA ANPDLAAAQA KNKVLVHADP
AECSFQFNPT GTVKFTSSCD IAKQVLAAGS VSYDNVAHAA GTPATITIGE TVIQSYSSKG
LPPDEAKAKD AEFKKSVAET LKAAGYPAKA DPAKMNKPLI VGILVILVIY VTMVYGPIAA
MLVEMFPTRI RYTSMSLPYH IGNGWFGGLL PTTAFAIVAQ TGNMYNGLWY PIIIAGITFV
VGLIFVRETK DVDIYAKD