Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A3710 |
Symbol | |
ID | 4786059 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 3921181 |
End bp | 3924114 |
Gene Length | 2934 bp |
Protein Length | 977 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640092293 |
Product | NAD-dependent formate dehydrogenase alpha subunit |
Protein accession | YP_001022898 |
Protein GI | 124268894 |
COG category | [R] General function prediction only |
COG ID | [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.749683 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.699855 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCGACC CCCACGCCGA GATCGACTAC GGCACCCCGG CCAGCGCGTC GACGCAGGCC GTCACGCTCG AGATCGACGG CCAGGCCGTC ACCGTGCCGG CCGGCACCTC GCTGATGCGC GCGGCGATCG ACGCCGGCGT GCAGGTGCCC AAGCTGTGCG CGACCGACAG CCTGGAGCCC TTCGGCTCGT GCCGCCTGTG CCTGGTCGAG ATCGAGGGCC GCCGCGGCTA CCCGGCCTCG TGCACCACGC CGGCCGAGGC CGGCATGAAG GTGCGCACGC AGAGCCCGAA GCTGCAGGAG CTGCGCAAGG GGGTGATGGA GCTCTACATC TCCGACCACC CGCTCGACTG CCTGACCTGC GCCGCCAACG GCGACTGCGA GCTGCAGGAC ATGGCCGGCG TGACCGGCCT GCGCGAGGTC CGCTACGGCG TCGGTGACCA GTACGGCGGC GCCCACCACC TGAAGAGCGC GAAGGACGAG TCGAACCCGT ACTTCACCTA CGACCCGAGC AAGTGCATCG TCTGCAACCG CTGCGTGCGC GCCTGCGAGG AGACCCAGGG CACCTTCGCG CTGACGATCA GTGGCCGCGG CTTCGAGTCG CGCGTGTCGG CGGGCCAGGA CCAGCCCTTC ATGGAAAGCG AGTGCGTCAG CTGCGGCGCC TGCGTGCAGG CCTGCCCGAC CGCCACGCTG CAGGAGAAGT CGGTCATCTG GCTGGGCCAG GCCGAGCACA GCGTCACCAC CACCTGTGCC TACTGCGGCG TCGGCTGCGG CTTCAAGGCC GAGATGAAGG GCAACGAGGT CGTGCGCATG GTGCCGTGGA AGAACGGCCA GGCCAACGAA GGCCACTCCT GCGTCAAGGG CCGCTTCGCC TGGGGCTACG CGACCCACAA GGACCGCATT ACCACGCCGA TGATCCGCAA GCGCATCACC GACCCGTGGC AGGAGGTGAG CTGGGACGAG GCGATCGGCT ATGCCGCCAG CGAGTTCAAG CGCATCCAGG CGAAGTACGG GCGCGACGCG ATCGGCGGCA TCGTCTCCTC GCGCTGCACC AACGAGGAAG GCTATCTGGT GCAGAAGCTG GTACGCGCGG CCTTCGGCAA CAACAACGTC GACACCTGCG CGCGCGTGTG CCACTCACCG ACCGGCTATG GCCTGAAGCA GACGCTGGGC GAATCGGCCG GCACGCAGAC CTTCAAGTCG GTGGAGAAGT CGGACGTGAT CATGGTCATC GGTGCCAACC CGACCGACGG CCACCCGGTG TTCGCCTCGC GCATGAAGAA GCGACTGCGC GAAGGCGCGA AGCTGATCGT CGTCGACCCG CGCAGGATCG ACCTGGTGAA GTCGCCGCAC GTGAAGGCCG ACCACCATCT TCAGCTGCGT CCGGGCACCA ACGTCGCGGT GATCACCGCG CTGGCGCACG TGATCGTCAC CGAGGGCTTG CTCGACGAGG CTTACATCGC CGAACGCTGC GAGGACAAGG CCTTCCGCGA GTGGCGCGAG TTCGTGTCGC GCGATGCCAA CTCCCCTGAG GCGACCGCCG TCGTCACCGG CGTGCCGGCC ACCGAGCTGC GCGCCGCCGC CCGACTGTTC ACCACCGGCA GATCCGACGG CACGGCTGCC ACGCTGCAGG CCCGCGGGGC GCGGCCCAAC GCCGCCATCT ACTACGGCCT GGGCGTGACC GAGCACAGCC AGGGTTCGAC GATGGTGATG GGCATCGCCA ACCTCGCGAT GGCCACCGGC AACGTCGGCC GCGAGGGGGT GGGCGTGAAC CCGCTGCGCG GCCAGAACAA CGTGCAGGGT TCGTGCGACA TCGGCTCCTT CCCGCACGAG TTGCCGGGCT ACCGCCACGT GTCGGACAGC AGCACGCGGG CGCTGTTCGA GAACGCCTGG AACGTCGAGC TGCAGCCCGA ACCCGGCCTG CGCATCCCCA ACATGTTCGA CGCCGCGCTG TCGGGCAGCT TCATGGGCCT GTACTGCGAG GGCGAGGACA TCGTGCAGTC CGACCCGGAC ACGCAGCACG TCGCGCATGC GTTGTCCTCC ATGGAATGCA TCGTCGTGCA GGACCTGTTC CTGAACGAGA CCGCGAAGTA CGCGCACGTC TTCCTGCCGG GCTCGTCCTT CCTGGAGAAG GACGGCACCT TCACCAACGC CGAGCGCCGC ATCTCGCGCG TGCGCAAGGT GATGCCGCCC AAGGCAGGCC TCGCCGACTG GGAGGTGACG GTGAAGCTGT CGAACGCGCT CGGCTACCCG ATGGACTACA CCCACCCCGA GCAGATCATG GCCGAGATCG CCGCGCTGAC GCCCACCTTC TCCGGCGTCA GCTACGAGAA GCTCGACCGT CTGGGCAGCA TCCAGTGGCC GTGCAACGAT GAGGCCCCCG AGGGCACGCC GACGATGCAC ATCGACCGCT TCGTGCGCGG CAAGGGCAAG TTCTTCATCA CGCAGTACGT CGCCTCCGAC GAGAAGGTCA CGCGCCGGTT CCCGCTGCTG CTGACGACAG GGCGCATCCT GTCGCAGTAC AACGTCGGCG CGCAGACGCG CCGCACCGAG AACAACCAGT GGCACAGCGA GGACCGGCTG GAGCTGCACC CGCACGACGC CGAGGAGCGC GGCATCCGGG ACGGCGACTG GGTCGGCATC GAGAGCCGCT CGGGCCAGAC CGTGCTGCGC GCGCAGGTCA GCGACCGCAT GCAGGCCGGC GTGGTCTACA CGACCTTCCA CTTCCCCGAG TCGGGCGCCA ACGTCATCAC CACCGACAGC TCCGACTGGG CCACCAACTG CCCCGAGTAC AAGGTGACCG CGGTGCAGGT GCTGCCGGTG ATGCAGCCTT CGGGCTGGCA GAAGGCCTAC AGCCGCTTCA CCGAGGTGCA GGAGCGCCTG CTCGCCGAAC GGCACAAGGC CGAGCCGGCC CTCGCCGGCG CGAAGAAGCC ATGA
|
Protein sequence | MLDPHAEIDY GTPASASTQA VTLEIDGQAV TVPAGTSLMR AAIDAGVQVP KLCATDSLEP FGSCRLCLVE IEGRRGYPAS CTTPAEAGMK VRTQSPKLQE LRKGVMELYI SDHPLDCLTC AANGDCELQD MAGVTGLREV RYGVGDQYGG AHHLKSAKDE SNPYFTYDPS KCIVCNRCVR ACEETQGTFA LTISGRGFES RVSAGQDQPF MESECVSCGA CVQACPTATL QEKSVIWLGQ AEHSVTTTCA YCGVGCGFKA EMKGNEVVRM VPWKNGQANE GHSCVKGRFA WGYATHKDRI TTPMIRKRIT DPWQEVSWDE AIGYAASEFK RIQAKYGRDA IGGIVSSRCT NEEGYLVQKL VRAAFGNNNV DTCARVCHSP TGYGLKQTLG ESAGTQTFKS VEKSDVIMVI GANPTDGHPV FASRMKKRLR EGAKLIVVDP RRIDLVKSPH VKADHHLQLR PGTNVAVITA LAHVIVTEGL LDEAYIAERC EDKAFREWRE FVSRDANSPE ATAVVTGVPA TELRAAARLF TTGRSDGTAA TLQARGARPN AAIYYGLGVT EHSQGSTMVM GIANLAMATG NVGREGVGVN PLRGQNNVQG SCDIGSFPHE LPGYRHVSDS STRALFENAW NVELQPEPGL RIPNMFDAAL SGSFMGLYCE GEDIVQSDPD TQHVAHALSS MECIVVQDLF LNETAKYAHV FLPGSSFLEK DGTFTNAERR ISRVRKVMPP KAGLADWEVT VKLSNALGYP MDYTHPEQIM AEIAALTPTF SGVSYEKLDR LGSIQWPCND EAPEGTPTMH IDRFVRGKGK FFITQYVASD EKVTRRFPLL LTTGRILSQY NVGAQTRRTE NNQWHSEDRL ELHPHDAEER GIRDGDWVGI ESRSGQTVLR AQVSDRMQAG VVYTTFHFPE SGANVITTDS SDWATNCPEY KVTAVQVLPV MQPSGWQKAY SRFTEVQERL LAERHKAEPA LAGAKKP
|
| |