Gene Mpe_A0995 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0995 
Symbol 
ID4787171 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1056335 
End bp1057585 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content72% 
IMG OID640089557 
Productputative membrane transport protein 
Protein accessionYP_001020192 
Protein GI124266188 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCTG CCGCGCCGCC GTCTCCTTCT TCTTCCTCCT CCTCGCCGCC GCTGCCGCGT 
GGAGCGGTCG CCTGCCTGGC GCTGGCGGCC TTCGGCAGCG GCTTGTCGAT GCGGGTGAAC
GATGCCCTGC TGCCGCGCCT GGCCGGCGAG TTCGCCCTCA CGCTGGGCCA GGCTTCGCAG
GTCATCGGGC TGTTTGCGAC GGCCTACGGG CTGGCCCAGC TGTTCTTCGG TCCGGTCGGC
GATCGCTACG GCAAGTACCG CGTCATCGCC TGGGCCACCG CGGCCTGCGC CCTCACGTCG
GTGCTGTGCG GGATGGCACC CGGCTTCGAT GCGCTGCGTC TGGCGCGCGT GCTGGCCGGA
GCCACCGCGG CGGCGGTGAT CCCCTTGTCG ATGGCGTGGA TCGGCGACGT CGTCGACTAC
GAGCGCCGGC AGCCCGTCCT TGCGCGCTTC CTGATCGGCC AGATCTGTGG CCTGTCCGCC
GGCGTCTGGT TGGGAGGCTT CGCGGCCGAT CACCTCGGCT GGCGCGCGCC TTATTTCCTG
CTCGCGGGCT TCTTCGCGCT GGTGAGCGTC GCGCTGTTCG CGCTGAACCG GCGTCTGCCG
GACGCCGCCC GCCCGGTGCG CGCGGCGAGT GACGGGTCGC CGTTGCGCCG CATCGCGACC
GAGTTCGGCG GCGTGCTGGC GCGTCCCTGG GCTCGGGTGG TCCTCGGTCT GGTGTTTCTC
GAGGGCCTGT TCCTGTTCGG GCCGTTCGCC TTCATCGCCT CGCACGTGCA CGAGGCCTTC
CAGCTCTCGC TGTCGGCCGC GGGCGCGCTG GTGATGCTGT TCGGGCTGGG CGGCTTCGCC
TTCGCCGTTT CGTCCGGCCC CCTGGTGCGG CGGCTCGGCG AGGCCGGCCT GGCACGTTGG
GGCTCGCTGA TGATGTGCGG GGCGCTTGTC GCGGTCGGCT TCGGGCCGGG CTGGGGCTGG
GCGCTGGCCG GATGTTTCGT CGCCGGACTG GGCTTCTACA TGGTGCACAA CACGCTGCAG
GTGAATGCCA CGCAGATGGC GCCCGACCGG CGTGGTGCGG CCGTCGCCGC CTTCGCCTCG
TGCTTCTTCC TCGGGCAGTC GGCCGGCGTG GCGCTGGGCG GGTGGCTGGT GGGGGTGATC
GGTCCGCCGG GCTTCCTGGC GATCGGCGCG GTGGGTCTGC TGCTCATCGG ACGGGCCTTC
GTGGCCGGTC TCGCGCTGCG GTCGCGGGCC GCGGCAGCCG TTGCCGTGTA G
 
Protein sequence
MNAAAPPSPS SSSSSPPLPR GAVACLALAA FGSGLSMRVN DALLPRLAGE FALTLGQASQ 
VIGLFATAYG LAQLFFGPVG DRYGKYRVIA WATAACALTS VLCGMAPGFD ALRLARVLAG
ATAAAVIPLS MAWIGDVVDY ERRQPVLARF LIGQICGLSA GVWLGGFAAD HLGWRAPYFL
LAGFFALVSV ALFALNRRLP DAARPVRAAS DGSPLRRIAT EFGGVLARPW ARVVLGLVFL
EGLFLFGPFA FIASHVHEAF QLSLSAAGAL VMLFGLGGFA FAVSSGPLVR RLGEAGLARW
GSLMMCGALV AVGFGPGWGW ALAGCFVAGL GFYMVHNTLQ VNATQMAPDR RGAAVAAFAS
CFFLGQSAGV ALGGWLVGVI GPPGFLAIGA VGLLLIGRAF VAGLALRSRA AAAVAV