Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A1489 |
Symbol | |
ID | 4784087 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 1603865 |
End bp | 1606042 |
Gene Length | 2178 bp |
Protein Length | 725 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640090056 |
Product | putative ThiO:disulfide interchange protein |
Protein accession | YP_001020686 |
Protein GI | 124266682 |
COG category | [C] Energy production and conversion [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4232] Thiol:disulfide interchange protein [COG4233] Uncharacterized protein predicted to be involved in C-type cytochrome biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.65412 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCCCAAC CGTTCGGTCC CCTGTTCGTC GCGGCACTCG CCTCGGCCGC CCTCCTCCTC GGCGGCGCGG CGCACGCGGC TGCGGTGCGC ACCGACCACG TCACGGCCGA ACTGGTGGCC GAGCGCGGTG CGGTGGCAGC CGGTCAGACG CTCAGGATCG GACTGAAGCT CCAGCACATC CCGCATTGGC ACACCTACTG GCGCAACCCG GGCGATTCGG GCCTGCCCAC GACGCTGAGC TGGACGCTGC CGCCCGGCAG CCGGATGGGC GAGATCGAAT GGCCAGCGCC CGAACGCCTG CCGATCGGCC CGCTGGTCAA CTACGGCTAC GAAGGCGAGG TGCTGCTCCC GCTGCAGTAC ACGGCACCGC CCGATGCCAG GCCGGGCGAC ACGCTGAGGC TGCAGGCCCA GGCACGCTGG CTGGTGTGCA ACGACGTCTG CATCCCCGAA CAGGCGACGC TCGATCTGCG GTTGCCGGTG GCCGAGGCAG CGGCCGCCGA CAACGCCGCA CCCGCCGCGC ACGCGGCACT GTTCGCGCAG GCAGCGGCGG CGCAGGCGGG GCCCCTCTCG GCGTGGACCG CGGAGGTTCA GCAGGCCGGA CGAGACCTGC TGCTGACGCT CGAACCGGCA GGCGGCGACT TGCCCGCCGA TGCACCGGAG GTTCATGTCT TCCCGTATGC GGAACAGTTG TTGGAACCCG CAAGCCATGC GCTCTATCGC GGCCCGCGCG GCTATGCCTT GAAGCTGAAG CTGCTGGAGG GCGCAACGGT GCCGGCCCGA CTCGACGGCA TCGCCGTTGC GCAGGCCGCC CCCGGCGCCT CGGGGACCGC CGTCTGGGGC GGCCCGCAAC GCTCGGTCGA GTTCAGCGCA CCGTTGCGTC CCGTCGCGAC GATCACCGTC CCCGCCGGCG CGCGCGCCGC GGCCGAGGAT CGGGGCCCCG CCTCGCTGCG CGGCAGCGCC CCTGTGGGCC TGCTGGCGGC ATTGGGCCTG GCCTTCCTGG GCGGCATGCT GCTGAACCTG ATGCCCTGCG TGTTCCCGGT GCTGTCGATC AAGCTGCTCG GCCTCGCGCG GCAGGAGGGA GACGCCCGCC GCCTGCGAAT GCACGCGCTG GCGTACGGCG TGGGCGTGGT GTGCAGCTTC GTCGCGCTCG CTGCAGCGCT GCTCGCACTG CGTGCGGCCG GCAGCGCCGT GGGCTGGGGC TTCCAGCTGC AGGAGCCCGG CGTGGTGTTC GCGCTGGCGC TGCTGTTCTT CTTGCTCGGG CTCAACCTGG CCGGCCAGTT CGAATTCGGC CTTCTGATGC CGCAAGGGCT GGCGCAATGG CGCGCACAGC GACCAGCGGT GGACGCCTTC GGATCCGGCG TGCTCGCGGT CGTGGCGGCG AGCCCGTGCA CGGCGCCGTT CATGGGCGCG GCCCTCGGCT ACGCCATCGC GCAACCGCCG GCCCAGGCAT TGGGCGTGTT CGCGGCGCTC GGACTCGGCA TGGCCTGGCC CTATGTGCTG CTGGTGCTGC GGCCGGGCTG GCGCGCGCGG TTGCCGCGGC CCGGGCCCTG GATGCTGAGG CTCAAGCAAG GCTTGGCGTT CCCGATGTTC GCCACCGTCG TGTGGCTGCT GTGGGTGCTG GGCCAGCAAG CCGGCATCGA CGGCAGCACG CGCGCGCTGG TGGCGCTGGT CGGCCTGGCT TTCGGGCTGT GGCTGGCGAG TGTGTGGCGC GGCGTGGCGG CGCGCGCGGG AGTGACCGTG CTGCTGGTCG CCGTGCTGGC CTGGGGGTGG CCGGTATCCG AGCGGGCGGC CTCACCGCAG GCCGGTAACG GCACGGCGTC GAACGCCGGC TCGCCCCACA CGGCCTGGCA AGCCTATGAC GAGGCCGCCA TCGACGCGCA TCTGGCGCAG GGCCGGGCCG TCTTCGTCGA CTTCACCGCC GCCTGGTGCG TGAGCTGCCA GGTCAACAAG CGGCTCGTGC TGCACACCGA TGAAACCCTG CAGGCCTTCA CCCGCTCCAA CGTGGCGCTG ATGCGCGCCG ACTGGACCCA CCGCGACGAG CGCATCACCG CCGCGCTCGG CCGGCTCGGT CGCAACGGCG TGCCCGTCTA TGTGCTGATG CGCCCCGGCC GCGAGCCGCT GCTGCTGCCG GAGATCCTCA CCGGAGGTCT CGTGCGCGAG GCCTTGTCGA CCCTGTAG
|
Protein sequence | MPQPFGPLFV AALASAALLL GGAAHAAAVR TDHVTAELVA ERGAVAAGQT LRIGLKLQHI PHWHTYWRNP GDSGLPTTLS WTLPPGSRMG EIEWPAPERL PIGPLVNYGY EGEVLLPLQY TAPPDARPGD TLRLQAQARW LVCNDVCIPE QATLDLRLPV AEAAAADNAA PAAHAALFAQ AAAAQAGPLS AWTAEVQQAG RDLLLTLEPA GGDLPADAPE VHVFPYAEQL LEPASHALYR GPRGYALKLK LLEGATVPAR LDGIAVAQAA PGASGTAVWG GPQRSVEFSA PLRPVATITV PAGARAAAED RGPASLRGSA PVGLLAALGL AFLGGMLLNL MPCVFPVLSI KLLGLARQEG DARRLRMHAL AYGVGVVCSF VALAAALLAL RAAGSAVGWG FQLQEPGVVF ALALLFFLLG LNLAGQFEFG LLMPQGLAQW RAQRPAVDAF GSGVLAVVAA SPCTAPFMGA ALGYAIAQPP AQALGVFAAL GLGMAWPYVL LVLRPGWRAR LPRPGPWMLR LKQGLAFPMF ATVVWLLWVL GQQAGIDGST RALVALVGLA FGLWLASVWR GVAARAGVTV LLVAVLAWGW PVSERAASPQ AGNGTASNAG SPHTAWQAYD EAAIDAHLAQ GRAVFVDFTA AWCVSCQVNK RLVLHTDETL QAFTRSNVAL MRADWTHRDE RITAALGRLG RNGVPVYVLM RPGREPLLLP EILTGGLVRE ALSTL
|
| |