Gene Mpe_A0228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0228 
Symbol 
ID4784013 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp244812 
End bp245993 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content73% 
IMG OID640088779 
Producthypothetical protein 
Protein accessionYP_001019425 
Protein GI124265421 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.624133 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTCCG ATGCTCCCGT GGGCCGCGGG CTCTGGCGCA TCGGGGCGGG CACGACGCTG 
GTGGTGCTGG CCTTCAGCGT TGTCAACCCG GTGCTCGCCG TGACGCTTCA GCGCCGGGGT
GTGAACGCTG GCGCGATCGG CCTGTTCGCG ATGCTGCCCT TCCTGACCGT GGCGACGATG
ATCCCGGTGA TGCCGCGCGT GTTCGCGCGC CTCGGCGTGA TCCGCGCCTA CCGGGGCGGC
CTGGTGCTGG GCGTGCTGTC ACTGGCGGGC TATGCGCTGA CCGACAGCTA TCTCGCTTGG
TGTTGCTGGT CGGTGCTCGG CGCATTGGGC GCGGCGGCCG AATGGAACGG CACCGAGGCG
CTGATCGCCT TCAACGCGCC GCCGGCCCGG CGCGGCCGCT TCACCGGGAT GTACCAGACC
GCGCTGGGCG CAGCTCTCGC GGTCGGTCCC TTGCTGCCCG GTGCGCTGCA ATGGCTGTTG
CCCGCGGGGA AGCCTCTGCA CACGGTGTGG CTGCTGTGGG GCGCCGCCGC CATCTACGCG
CTGGCGCTGG GAGTCACGGC CGGCCCCGCG GTCGGCCGTC TGCGGGCCTC GCACACCGGC
GGCGGCCGCG ACAGCCTGCG GGCCGCGCTG CGGGCGCGGC CGGCACTGGT GTGGATTGCC
TTCGCGGGTG GTGTGTTCGA GGCGGGCCTC GGTGGCATCA CCGCGGCCTA TGGGTCGCAG
CTCGGCATGT CGCTCGGCGT GGCGACGTCG ATCGCCGGCG CGCTGGGCGT CGGCAGCTTC
GTGCTGCAGT ACCCGGCCGG CTGGCTGGCG GACCACGCGC CGGTGCGGCG GGTGTTCGGC
GTCGCCGGTG CCTTGTTGCT GCTGTCGGTG CTGGCCTTCG GCCTGGCACC CCGCGTGGCC
GCGTTGTTCT GGGTGGCGGC TTTCCTGTGG GGCGCGATCG GCGGCGCGCT CTACACCTTG
ACGATGGTCC GCGTGGCGCA CGAGTTCACC GGTCGCTCCA CCATCGCCGG CACCGCAGCG
ATGATCACCG GCTACACCGC CGGCGGCGCC GTCGGGCCGG CGGTCAGCGG CCTGATGCTC
GAACGCTGCG GTGTGCCGGG GCAGTCGCTT TGGCTGGCCG CGCTCGCCGT CAGCGTGATC
GCCGTGGCAC TGCGCATGCG TGCCGGACCC GAGGGTCCCT GA
 
Protein sequence
MSSDAPVGRG LWRIGAGTTL VVLAFSVVNP VLAVTLQRRG VNAGAIGLFA MLPFLTVATM 
IPVMPRVFAR LGVIRAYRGG LVLGVLSLAG YALTDSYLAW CCWSVLGALG AAAEWNGTEA
LIAFNAPPAR RGRFTGMYQT ALGAALAVGP LLPGALQWLL PAGKPLHTVW LLWGAAAIYA
LALGVTAGPA VGRLRASHTG GGRDSLRAAL RARPALVWIA FAGGVFEAGL GGITAAYGSQ
LGMSLGVATS IAGALGVGSF VLQYPAGWLA DHAPVRRVFG VAGALLLLSV LAFGLAPRVA
ALFWVAAFLW GAIGGALYTL TMVRVAHEFT GRSTIAGTAA MITGYTAGGA VGPAVSGLML
ERCGVPGQSL WLAALAVSVI AVALRMRAGP EGP