Gene Mpe_B0103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_B0103 
Symbol 
ID4787706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008826 
Strand
Start bp94925 
End bp96310 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content58% 
IMG OID640092512 
Producthypothetical protein 
Protein accessionYP_001023117 
Protein GI124262647 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.887161 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000190274 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCCAGT TCGACATCCA CCGAGCATCC GAAGGGTTGG TGTTGGGGCT ACTGCGTGAG 
TTATATGGTT GGCCGAGGCT GCGCAATCTG AACACGGAAG AGCGAACCAA CTTCCCTGGG
ATCGATCTCG CTGACGACGA GGCGCGCGTG GCAGTGCAGG TCACGGGCAC GCCGACGCTG
GACAAGATCA AGGGAACCGT CTCCACCTTC CTGACGCACG GCCTAGACAA GCGGTACGAC
CGACTGGTGA TCTATGTCCT GACTCGGAAG CAGGGCAGCT ATTCGCAGGA CGCGATCGAC
AAGGTGTCTT TGGGACGCGT GAACGTCAGT GCTCGCGACG ACATACTTGA TGTGCGTGAC
GTGTGCGCCA AGGCGTCGAC CGTTGATCCA AAGACTTTGG CGAACGCACT TGAGGTCCTT
CGCTCCTACA TGCGAGGAGG CGTTGCTGCC GGTCTCGCTG AGGAGGACTT CGACCCTCCC
GCATTCCCGG TGGAGCGCGC CATCCTCAAT CTCATTGAGG TCTACTTTCC AGCGCGCATC
TACGTTGCGG ACCTGCGCGA CGATGTGGGT TCGAAGGCCG ACAGGCGTCC GCGCAATGAA
CGCAAGCTGA TCAGGACTAC GCTGGAAGAG TTGAACTTGC GTGTGCCCTC CGGCTACGAG
GTAAGCAGCA GGCAGTTAAT TACCTTTCAC CCACTTGATG ATTCTCAGGG GCCTTTTGCG
AGACTAATAG AGCTTGGGAC AGTCACCCCG CTGGTTCCGT CAGAGTTCTA TGGGAGCAAC
AACGATCAGG AGCGGATCTT CAAGTCGTTG CTACGCTTCA CGCTGCAGCA GAAGTTGCAC
AGGCATCGCG TGCGCTGGTT TCACGACGAT GGGCTCTTTG CGTTTCTGCC GTTTGATGAC
AAGGAGTCAC TTCGCGAGGA GACGTGGACA GGTCACAAGA AGACGTCGCG CCGAGTGTTT
GAGCGAAAGC AGAACAAGAA CGACCCCAGC AAGACCTTCA TCTGCAAGCA CTTCGCATTC
GCCACCGACT TCGTGCTGAA CGACGGTCGT TGGTATATCG CGCTCACCCC CGACTGGTAC
TTCAGCTATG GCGACGACTA TCGGCGCTCG CGGTACGCGG ACGAGTCGTT GAAGTGGTTG
AAGCGGAAGG AAGTGAATCG AACAGTCACC GACCACTTCC GGTTCTTGAC GTCCTGGCTA
GCAGCTCTCG ATCAGGACGA TTTGTTCGCT CTGGCTGCAG GTGGCGCGCC GACGCTAACT
TTCGGTGAGG TACTGGCGTT CGACAATCAC CCATCTCTTG ATGATGAGGC TTGGCTGCCG
CTGCGCGACG CCACAGGCGA CGACGACGAG GCCGCGACGA TCAAGGGCCT ATTCGACTCA
GAATGA
 
Protein sequence
MSQFDIHRAS EGLVLGLLRE LYGWPRLRNL NTEERTNFPG IDLADDEARV AVQVTGTPTL 
DKIKGTVSTF LTHGLDKRYD RLVIYVLTRK QGSYSQDAID KVSLGRVNVS ARDDILDVRD
VCAKASTVDP KTLANALEVL RSYMRGGVAA GLAEEDFDPP AFPVERAILN LIEVYFPARI
YVADLRDDVG SKADRRPRNE RKLIRTTLEE LNLRVPSGYE VSSRQLITFH PLDDSQGPFA
RLIELGTVTP LVPSEFYGSN NDQERIFKSL LRFTLQQKLH RHRVRWFHDD GLFAFLPFDD
KESLREETWT GHKKTSRRVF ERKQNKNDPS KTFICKHFAF ATDFVLNDGR WYIALTPDWY
FSYGDDYRRS RYADESLKWL KRKEVNRTVT DHFRFLTSWL AALDQDDLFA LAAGGAPTLT
FGEVLAFDNH PSLDDEAWLP LRDATGDDDE AATIKGLFDS E