Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A1170 |
Symbol | |
ID | 4785569 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 1254608 |
End bp | 1257574 |
Gene Length | 2967 bp |
Protein Length | 988 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640089733 |
Product | formate dehydrogenase large subunit precursor |
Protein accession | YP_001020366 |
Protein GI | 124266362 |
COG category | [C] Energy production and conversion [R] General function prediction only |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.970148 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGCTCA CGCGCAAGTC GCCTGATGCG GGCTCCGGCT CGCCTGCATC GTCGCTGGTC TCGAGCCTCT CGCGCGGCCT CTCGCGCGCC ATCCCGACGA TGGACCGTCG GTCCTTCCTG CGCCGCTCCG GCCTGGGCGT GGGGGCCGGT CTCGCCGCCG GGCAGCTGAC GTTGGTGCGC AAGGCGCAGG CCTCGGATGC GTCCAAGGCC GAGGCGGGCC CGGGCAAGGT GGAGGTCAGG CGCACGGTGT GCGGTCACTG CTCGGTGGGC TGCGCGGTCG ACGCGGTGGT GCACAACGGC GTGTGGGTGC GCCAGGAGCC GGTGTTCGAC TCGCCGATCA ACCTGGGCGC GCACTGCGCC AAGGGCGCTG CGCTGCGCGA GCACGGCCGC GGCGAATACC GCCTGCGCTA CCCGATGAAG CTGGTGAACG GCAAGTACGA GCGCATCAGC TGGGACACCG CACTCGACGA GATCAGCGCG CGCATGCTCG AGCTGCGCAA GCAGAGCGGC CCGGACTCGG TGTTCGTGGT CGGGTCGAGC AAGCACAACA ACGAGCAGGC CTACCTGCTG CGCAAGTGGA TGAGCCTGTG GGGCAGCAAC AACTGCGATC ACCAGGCACG CATCTGCCAC AGCACCACGG TCGCGGGCGT CGCGAACACC TGGGGCTACG GCGCGATGAC CAACTCCTAC AACGACATGC AGAACGCCCG CATGGCGCTC TACATCGGCT CCAACGCCGC CGAGGCGCAC CCGGTGTCGA TGCTGCACAT GCTGCATGCC AAGGAGACCG GCTGCAAGAT GATCGTCGTG GACCCGCGCT TCACGCGCAC CGCGGCGCGT GCCGACGAAC ACGTGCGCAT CCGTTCGGGC TCGGACATCC CCTTCCTGTT CGGCGTGCTG CACCACATCT TCAAGAACGG CTGGGAAGAC AAGCAGTACA TCAACGACCG CGTCTACGGC ATGGACAAGG TGCGCGAGGA CGTGCTCGCG AACTGGACGC CGGACAAGGT GCAGGAGGCC TGTGGCGTCG ACGAGGCCAC CGTGTTCCGC GTGGCCAAGA CCATGGCCGA CAACCGCCCC GGCACCATCG TCTGGTGCAT GGGCCAGACC CAGCACACCA TCGGCAACGC GATGGTGCGC GCCTCGTGCA TTCTGCAGCT CGCGCTCGGC AACATCGGCA AGAGCGGCGG CGGCGCCAAC ATCTTCCGTG GCCACGACAA CGTGCAGGGC GCCACGGACG TGGGCCCCAA CCCCGATTCG CTGCCTGGCT ACTACGGCCT GGCCGAGGGC TCGTTCAAGC ATTTCGCCAA GACCTGGGGC GTGGACTTCG AGTGGATCAA GAAGCAGTAC GCGCCGGGCA TGATGACCAA ACCCGGCATG ACGGTGTCGC GCTGGGTCGA CGGCGTGCTC GAAAAGAACG AGCTGATCGA CCAGGACAGC AACCTGCGCG GCCTGTTCTT CTGGGGCCAT GCGCCGAACT CGCAGACGCG CGGCCTGGAA ATGAAGAAGG CGATGGACAA GCTCGACCTG CTGGTGGTCG TCGATCCCTT CCCGTCGGCC ACCGCGGCGA TGGCGGCGAT GCCCGGCAAG CCCGAGGACC AGAACCCCAA CCGCGCCGTC TACCTGCTGC CGGCGACGAC GCAGTTCGAG ACCAGTGGCT CGTGCACCGC GTCGAACCGC TCGATCCAGT GGCGCGAGCA GGTCATCGAG CCGCTGTGGG AGAGCCGCAA CGACCACATG ATCATGTACC AGCTCGCGCA GAAGCTGGGC TTCGACAAGG AGCTGACGAA GAACTACAAG CTGGTCGCCG GCAAGGGCGG CATGATGGAG CCCGAGCCCG AATCGATCCT GCGCGAGATC AACAAGAGCG TCTGGACCAT CGGCTACACC GGCCAGAGCC CCGAGCGCCT GAAGGTGCAC ATGCGCAACA TGCACGTCTT TGACGTGAAG ACGCTGCGCG CCAAGGGCGG CACCGACAAG GAGACCGGGT ACTCCCTCGA CGGCGATTAC TTCGGCCTGC CTTGGCCGTG CTACGGCACG CCGGAAATGA AGCACCCGGG CTCGCCCAAC CTCTACGACA CCTCCAAGCA CGTGATGGAC GGCGGCGGCA ACTTCCGCGC CAACTTCGGT GTCGAGAAGG ACGGCGTCAA CCTGCTGGCC GAGGATGGCT CGTTCTCCAA GGGTGCGGAC CTCACCACCG GCTATCCGGA GTTCGACCAC GTGCTGCTGA AGAAGCTCGG CTGGTGGGAC GAACTGAGCG AGCCCGAGAA GGCCGCGGCC GAGGGCAAGA ACTGGAAGAC CGATTCGTCG GGCGGGATCA TCCGCGTCGC GATGAAGAAC CACGGCTGCC ATCCGTTCGG CAACGCGAAG GCGCGGGCCG TGGTGTGGAA CTTCCCCGAT GCGATCCCGA AGCACCGCGA GCCGCTGTAC TCGACGCGTC CGGACCTGGT GGCCAAGTAC CCGACGCACG ACGACAAGAA GGCGTTCTGG CGCCTGCCCA CCTTGTTCAA GACCGTGCAG GACAAGAACA AGGACATCGG CAAGGACTTC CCGCTGATCA TGAGCAGCGG GCGGCTGGTG GAGTACGAGG GCGGCGGCGA GGAGACCCGC TCCAACCCCT ACCTGGCCGA GCTGCAGCAG GAGATGTTCG TCGAGATCAA CCCAGCCACC GCCAACGACC GCGGCATCCG CAACGGCGAG ACGGTCTGGG TCCGCACGCC GACCGGCGCG CGCCTCACGG TCAAGGCGCT GGTCACCGAG CGAGTGGACC GCGAGACGGT GTGGCTGCCC TTCCACTTCT CGGGCCGCTG GCAGGGCGTC GATCTCGCGC CCTACTACCC GCAGGGGGCG ATGCCGGTCA TCCGCGGCGA GGCGATCAAC ACCGCCACCA CCTACGGCTA CGACAGCGTC ACGATGATGC AGGAGACCAA GACCACGGTC TGCCAGGTCG AGCGCGCCTC TGTGTGA
|
Protein sequence | MLLTRKSPDA GSGSPASSLV SSLSRGLSRA IPTMDRRSFL RRSGLGVGAG LAAGQLTLVR KAQASDASKA EAGPGKVEVR RTVCGHCSVG CAVDAVVHNG VWVRQEPVFD SPINLGAHCA KGAALREHGR GEYRLRYPMK LVNGKYERIS WDTALDEISA RMLELRKQSG PDSVFVVGSS KHNNEQAYLL RKWMSLWGSN NCDHQARICH STTVAGVANT WGYGAMTNSY NDMQNARMAL YIGSNAAEAH PVSMLHMLHA KETGCKMIVV DPRFTRTAAR ADEHVRIRSG SDIPFLFGVL HHIFKNGWED KQYINDRVYG MDKVREDVLA NWTPDKVQEA CGVDEATVFR VAKTMADNRP GTIVWCMGQT QHTIGNAMVR ASCILQLALG NIGKSGGGAN IFRGHDNVQG ATDVGPNPDS LPGYYGLAEG SFKHFAKTWG VDFEWIKKQY APGMMTKPGM TVSRWVDGVL EKNELIDQDS NLRGLFFWGH APNSQTRGLE MKKAMDKLDL LVVVDPFPSA TAAMAAMPGK PEDQNPNRAV YLLPATTQFE TSGSCTASNR SIQWREQVIE PLWESRNDHM IMYQLAQKLG FDKELTKNYK LVAGKGGMME PEPESILREI NKSVWTIGYT GQSPERLKVH MRNMHVFDVK TLRAKGGTDK ETGYSLDGDY FGLPWPCYGT PEMKHPGSPN LYDTSKHVMD GGGNFRANFG VEKDGVNLLA EDGSFSKGAD LTTGYPEFDH VLLKKLGWWD ELSEPEKAAA EGKNWKTDSS GGIIRVAMKN HGCHPFGNAK ARAVVWNFPD AIPKHREPLY STRPDLVAKY PTHDDKKAFW RLPTLFKTVQ DKNKDIGKDF PLIMSSGRLV EYEGGGEETR SNPYLAELQQ EMFVEINPAT ANDRGIRNGE TVWVRTPTGA RLTVKALVTE RVDRETVWLP FHFSGRWQGV DLAPYYPQGA MPVIRGEAIN TATTYGYDSV TMMQETKTTV CQVERASV
|
| |