Gene Mpe_A2086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2086 
Symbol 
ID4783665 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2234419 
End bp2235714 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content71% 
IMG OID640090654 
Productsulfite oxidase SoxC, putative 
Protein accessionYP_001021277 
Protein GI124267273 
COG category[R] General function prediction only 
COG ID[COG2041] Sulfite oxidase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.29998 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.615224 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGGACA CGCCCGGCCC GCCCGGCGAC GCACTGCAGC CGGTGGCCGG CGGCGGCCTG 
CTGGGCCGGC GCGCACTGCT GACGCGCGGG CTGGTGCTGG CCACGGCGGG CAGTGCGGCC
GCGCCGCTGC AGGCGGCCGC GCCGGCGGTC AGCGCCGGCG ATCCTTCGCC CCCATGGATG
CACGCGCCGG GCCGCCCGTT CACGGTCTAT GGCCAGCCCT CGAAGCACGA GCAGCAGGTG
ATCCGCCGCA TCAGCGGCAA CCGCCTGCTG CCGGGCAACG GCGTGTCGCT CACGCCGCTG
GAGGAGCTCG AAGGCATCAT CACGCCCACC GGCCTGCATT TCGAGCGCCA TCACAACGGC
GTACCCGACA TCGATCCGGC GCAGCACCGC CTGCTGATCC ACGGCCGTGT GAAGCGTGCG
CTGAGTTTCT GCGTCGACGA CCTGCTGCGC TACCCGATGC GCTCGCAGCT GCTTGTGCTG
GAGTGCGGCG GCAACAGCAA CGCAGGCTGG CATCCCGAGC CGATCCAGCG GCCGGTCGGC
TCCTTCCACG GCCTGGTTTC GTGCAGCGAG TGGACCGGCG TGCCGCTCTC GGTGCTGCTC
GACGAGGCCG GGATCGAGCC ACGCACGACC TGGGCCGTCG CTGAGGGCGC CGACGCCTTC
GCGATGAACG TGAGCCTGCC GGTCGCCAAG CTGATGGACG ACGCGATCGT CGCGCTCTAC
CAGAACGGCG AGCGCCTGCG GCCCGAGCAC GGCTATCCGC TGCGGCTCAT CGTCCCCGGC
TGGGAGGGTG TCCTGAATGT CAAGTGGCTC AGGCGGCTGC AGTTGAGCGA ACAGCCGCTG
ATGGCGCGCA ACGAGACGGC CAAGTACACC GAGCTGCTGC CCGACGGCAA GGCGCGCATG
TTCACCTTCG TGATGGAGGC CAAGTCGCTG ATCACCTCGC CCTCGCACGG CCAGCACCTG
CGCGGGCCCG ACGTCTACGC GATCAGCGGC CTGGCGTGGA GCGGCCGGGG TCGCATCCGG
CGCGTGGAGG TGTCGGCCGA CGGTGGCCGC AGCTGGGCCG ACGCCACGCT GCAGGACCCG
GTGCTGCCGC GCTGCATGAC GCGCTTTCGC GCCGCATGGA AATGGGATGG TCGACCGACC
GTGCTGAAGA GCCGCGCGAC CGACGAAACC GGCGATGTGC AGCCCGAGCG CAGCGCCCTG
GTTGCGCAGC GCGGCACCAA CGGCTACTTC CACTACAACG CCATCGTGTC CTGGGCCGTC
GACGAGGAAG GCAACGTGCG CCATGTCTAT GCGTGA
 
Protein sequence
MADTPGPPGD ALQPVAGGGL LGRRALLTRG LVLATAGSAA APLQAAAPAV SAGDPSPPWM 
HAPGRPFTVY GQPSKHEQQV IRRISGNRLL PGNGVSLTPL EELEGIITPT GLHFERHHNG
VPDIDPAQHR LLIHGRVKRA LSFCVDDLLR YPMRSQLLVL ECGGNSNAGW HPEPIQRPVG
SFHGLVSCSE WTGVPLSVLL DEAGIEPRTT WAVAEGADAF AMNVSLPVAK LMDDAIVALY
QNGERLRPEH GYPLRLIVPG WEGVLNVKWL RRLQLSEQPL MARNETAKYT ELLPDGKARM
FTFVMEAKSL ITSPSHGQHL RGPDVYAISG LAWSGRGRIR RVEVSADGGR SWADATLQDP
VLPRCMTRFR AAWKWDGRPT VLKSRATDET GDVQPERSAL VAQRGTNGYF HYNAIVSWAV
DEEGNVRHVY A