Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A0436 |
Symbol | |
ID | 4785426 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 475726 |
End bp | 476787 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640088994 |
Product | putative sulfite oxidase subunit YedY |
Protein accession | YP_001019633 |
Protein GI | 124265629 |
COG category | [R] General function prediction only |
COG ID | [COG2041] Sulfite oxidase and related enzymes |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.464961 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGGAAC TCCCGCCCCG CGCCGGCCTT CGAACCTCCG CGTTCCTCAC CAGGCGACCG ACCATGCTGA TTCCCCGACA GGCCCGCGCG GGCTACCTCC ATCCCGTGGC CAGCGAGATC ACGCCGCGTG CGGCCTACGA GCAGCGGCGC GAGTTCCTGC GCCTGCTGGC CGCCGGCGGC GCCGGCGCCG CGCTGGCCGG CTGGGCGCAG CGCGACGCGC TGGCGCAGGC TCCCCGCGCC GGCAAGCTCG CGGCCCTGCC CGGCGCGCGC AGCGGCGTAG CCGGCGCCTC GACAGTCGAG AAGCAGACCG CCTACGCGGA CGCCACCAGC TACAACAACT TCTACGAGTT CGGCACCGGC AAGGAAGACC CGGCGCGCAA TGCCGGCAAG CTGCAGACGC GGCCCTGGAC GGTGGCGATC GAGGGCGAGG TGAAGAAGCC GCAGACCCTC GGCATCGAGG ACCTGCTCAA GCTCGCCCCG ATGGAGGAGC GCATCTACCG ACTGCGCTGC GTCGAGGGCT GGTCGATGGT GATTCCCTGG GTCGGCTACT CGCTGGCGGA ACTGATCAAG CGCGTCGAGC CGACCGGCAA CGCGAAGTTC ATCGAGTTCG TGACCCTGGC CGACCCGAAG CAGATGCCCT TCGTCGGCTC GCGCGTGCTC GAGTGGCCCT ACGTCGAAGG CCTGCGCCTC GATGAGGCCC TGCACCCGCT GACCCTGCTG GCCTTCGGCA TGTACGGCGA GGTGCTGCCC AACCAGAACG GCGCGCCGGT GCGGCTGGTC GTGCCGTGGA AGTACGGGTT CAAGAGCGCC AAGTCCCTCG TCAAGATCCG CTTCGTCGAG CAGCAGCCGA AGACCGCCTG GTTCAAGGCG GCCTCGCACG AGTACGGCTT CTACTCGAAC GTGAACCCCA AGGTCGACCA CCCGCGCTGG AGCCAGGCCA CCGAGCGCCG CATCGGTGAG GACGGCATCT TCCAGAAGAA GCGCCCGACG CAGATGTTCA ACGGCTACGA GGCGCAGGTC GGTCAGCTCT ACGCGGGCCT CGATCTCGCC AAGAACTTCT GA
|
Protein sequence | MTELPPRAGL RTSAFLTRRP TMLIPRQARA GYLHPVASEI TPRAAYEQRR EFLRLLAAGG AGAALAGWAQ RDALAQAPRA GKLAALPGAR SGVAGASTVE KQTAYADATS YNNFYEFGTG KEDPARNAGK LQTRPWTVAI EGEVKKPQTL GIEDLLKLAP MEERIYRLRC VEGWSMVIPW VGYSLAELIK RVEPTGNAKF IEFVTLADPK QMPFVGSRVL EWPYVEGLRL DEALHPLTLL AFGMYGEVLP NQNGAPVRLV VPWKYGFKSA KSLVKIRFVE QQPKTAWFKA ASHEYGFYSN VNPKVDHPRW SQATERRIGE DGIFQKKRPT QMFNGYEAQV GQLYAGLDLA KNF
|
| |