Gene Mpe_A0436 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0436 
Symbol 
ID4785426 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp475726 
End bp476787 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content69% 
IMG OID640088994 
Productputative sulfite oxidase subunit YedY 
Protein accessionYP_001019633 
Protein GI124265629 
COG category[R] General function prediction only 
COG ID[COG2041] Sulfite oxidase and related enzymes 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.464961 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGAAC TCCCGCCCCG CGCCGGCCTT CGAACCTCCG CGTTCCTCAC CAGGCGACCG 
ACCATGCTGA TTCCCCGACA GGCCCGCGCG GGCTACCTCC ATCCCGTGGC CAGCGAGATC
ACGCCGCGTG CGGCCTACGA GCAGCGGCGC GAGTTCCTGC GCCTGCTGGC CGCCGGCGGC
GCCGGCGCCG CGCTGGCCGG CTGGGCGCAG CGCGACGCGC TGGCGCAGGC TCCCCGCGCC
GGCAAGCTCG CGGCCCTGCC CGGCGCGCGC AGCGGCGTAG CCGGCGCCTC GACAGTCGAG
AAGCAGACCG CCTACGCGGA CGCCACCAGC TACAACAACT TCTACGAGTT CGGCACCGGC
AAGGAAGACC CGGCGCGCAA TGCCGGCAAG CTGCAGACGC GGCCCTGGAC GGTGGCGATC
GAGGGCGAGG TGAAGAAGCC GCAGACCCTC GGCATCGAGG ACCTGCTCAA GCTCGCCCCG
ATGGAGGAGC GCATCTACCG ACTGCGCTGC GTCGAGGGCT GGTCGATGGT GATTCCCTGG
GTCGGCTACT CGCTGGCGGA ACTGATCAAG CGCGTCGAGC CGACCGGCAA CGCGAAGTTC
ATCGAGTTCG TGACCCTGGC CGACCCGAAG CAGATGCCCT TCGTCGGCTC GCGCGTGCTC
GAGTGGCCCT ACGTCGAAGG CCTGCGCCTC GATGAGGCCC TGCACCCGCT GACCCTGCTG
GCCTTCGGCA TGTACGGCGA GGTGCTGCCC AACCAGAACG GCGCGCCGGT GCGGCTGGTC
GTGCCGTGGA AGTACGGGTT CAAGAGCGCC AAGTCCCTCG TCAAGATCCG CTTCGTCGAG
CAGCAGCCGA AGACCGCCTG GTTCAAGGCG GCCTCGCACG AGTACGGCTT CTACTCGAAC
GTGAACCCCA AGGTCGACCA CCCGCGCTGG AGCCAGGCCA CCGAGCGCCG CATCGGTGAG
GACGGCATCT TCCAGAAGAA GCGCCCGACG CAGATGTTCA ACGGCTACGA GGCGCAGGTC
GGTCAGCTCT ACGCGGGCCT CGATCTCGCC AAGAACTTCT GA
 
Protein sequence
MTELPPRAGL RTSAFLTRRP TMLIPRQARA GYLHPVASEI TPRAAYEQRR EFLRLLAAGG 
AGAALAGWAQ RDALAQAPRA GKLAALPGAR SGVAGASTVE KQTAYADATS YNNFYEFGTG
KEDPARNAGK LQTRPWTVAI EGEVKKPQTL GIEDLLKLAP MEERIYRLRC VEGWSMVIPW
VGYSLAELIK RVEPTGNAKF IEFVTLADPK QMPFVGSRVL EWPYVEGLRL DEALHPLTLL
AFGMYGEVLP NQNGAPVRLV VPWKYGFKSA KSLVKIRFVE QQPKTAWFKA ASHEYGFYSN
VNPKVDHPRW SQATERRIGE DGIFQKKRPT QMFNGYEAQV GQLYAGLDLA KNF