Gene Mpe_B0049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_B0049 
Symbol 
ID4787652 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008826 
Strand
Start bp40374 
End bp41579 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content66% 
IMG OID640092458 
Producthypothetical protein 
Protein accessionYP_001023063 
Protein GI124262593 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.722506 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00143766 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGCTGC GGTTCCGGAA GACCTTCAAG CTCGCCCCCG GCGTCCGGTG GACCATTTCC 
GGCTCCGGCA GCAGCTGGAG CTTCGGTCCC CGAGGCGCCT CCATCAGCGT GGGCAAGCGC
GGCGTCTACG CCAACTACAG CATCCCCGGC ACCGGCCTGT CATACCGCGA GCGGCTCGGT
GGCCGCAGCG AGACGCCGTC CGCCCAGCTC GGACCCCCGC AGACGACCAA GGTCAGCCTG
ACCTGCGGCA TCGACGATGA AGGCGTCCTG AGCTTCACCG ATGGCGCTGG CATGCCACTC
AGCGAGGCGG TGGTCGAGGC CGCCAAGAAG CAGAACCGGG ACGCCATCCT GGCGCTCATC
CAGCGCAAGT GCGACGAGCT GAACGACCAG GTCGAGGCGC TCGGCCGGCT TCATCACGAC
ACCCCGGACT CCAGGGTGAA GCCCAAGTTC GAACCGCAGC GCATCAACCT GGTAGCGCCC
GAGCGGCCCA CTCTCCGCGT CCCGACATTT CTTGAAGGAC TCAGGAAGTC CGTCCGCCTG
GCCATCCAGG AGGAAAACGA CCGGGCGCTA GCCCGTTTCG AAGGGGACAC GGAAGAGTTC
GAGCGTCAGC GGCGTGCGTT CTACGCGGCC GAGACCAAGC GTCGAGTTCT GGTCGAGCAG
CTGATCTACC AGGACGTCCA GGCCATGGAG GACTTCCTCG AGGCAAACCT CCAGGACATC
GTCTGGCCGC GGGAGACCCA GGTTGCGGTG GACATCGGGG ATGGAGGTCT CACGGTCCAG
CTGGATGTCG ACCTGCCTGA AATCGAGAAC ATGCCGACCA AGTCGGCTGC CGTGCCGGCG
CGCGGACTGA AGCTCTCCGT GAAGGAGCTG CCGGCTGCCA AGGTCCGCCG GCTCTACGCG
GACCACGTGC ACGGCATCGT GTTTCGCCTG GTCGGCGAGA CCTTCGCCGC CCTGCCCGTC
GCCCGCACGG TCGTCGTCTC AGGCTACTCC CAGCGCAGCA ACAGTGCCAC CGGCCACCTC
GAGGACCAGT ACCTGCTCTC AGTCAAGGTC GCCCGAGAAG CCTGGGAACA GCTTGCCTTC
GACCGGCTTG CCGAGCTCAA CGTGGTGGAC TCGCTTGCCC GCCACGAGCT GCGCCGCGAT
CTGACCCGCA TCGGAGAACT GCGCCCCATC AGACCATTCC AGGAGGAGGA GACATGCGAG
GTTTGA
 
Protein sequence
MALRFRKTFK LAPGVRWTIS GSGSSWSFGP RGASISVGKR GVYANYSIPG TGLSYRERLG 
GRSETPSAQL GPPQTTKVSL TCGIDDEGVL SFTDGAGMPL SEAVVEAAKK QNRDAILALI
QRKCDELNDQ VEALGRLHHD TPDSRVKPKF EPQRINLVAP ERPTLRVPTF LEGLRKSVRL
AIQEENDRAL ARFEGDTEEF ERQRRAFYAA ETKRRVLVEQ LIYQDVQAME DFLEANLQDI
VWPRETQVAV DIGDGGLTVQ LDVDLPEIEN MPTKSAAVPA RGLKLSVKEL PAAKVRRLYA
DHVHGIVFRL VGETFAALPV ARTVVVSGYS QRSNSATGHL EDQYLLSVKV AREAWEQLAF
DRLAELNVVD SLARHELRRD LTRIGELRPI RPFQEEETCE V