Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A0337 |
Symbol | |
ID | 4786887 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 367484 |
End bp | 370351 |
Gene Length | 2868 bp |
Protein Length | 955 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640088892 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_001019534 |
Protein GI | 124265530 |
COG category | [R] General function prediction only |
COG ID | [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGCAC CGCTGACTTC GAAGCTGCTC GCGCCACAGG CCGCGATGGT GGAGTTCTCG CTCGACGGCC GCACGGTCAG CGCCGGCGCT GACGAGACGA TCTGGACCGT CGCGAAGCGC GAAGGCACGA CCATCCCCCA CCTGTGCCAC AAGGAGGGCC TGACGCCAGC CGGCAACTGC CGCGCCTGCG TGGTCGAGGT GGAGGGCGAA CGGGCACTGG CCGCCTCGTG CTGCCGCAAC GTCGCCGCCG GCATGAAGGT GCAGACGCAG AGCCCGCGCG CCGCGTCGGC CCGCAAGATG GTGGTCGAGC TGCTGGTCAG CGACGCCGGT GCCACCACCG ACACCTACAC GAAGGCCTCG GAACTCAGCC AATGGGCCGA ACAGCTGGCC GTGCCGCGCG GGCGCCTGCC GCAGCGCGAC GCGCCGCATT ACGCGGCGGC CGACCCTTCG CACCCGGCGA TCGCGGTGAA CCTGGACGCC TGCATCCAGT GCACGCGCTG CCTGCGCGCC TGCCGCGACG AGCAGGGCAA CGACGTGATC GGCCTGGCCT TCCGCGGCGC GCATGCGCAG ATCACCTTCG ATGCCGGTGC GACGCTCGGC GAGTCGAGCT GCGTGGCCTG CGGCGAATGC GTGCAGGCCT GCCCGACCGG CGCACTGATG CCGGCCCGCG GCGCCGGGCT GCTGGAAGTG ACGAAGCAGG TGGATTCGGT CTGCCCCTAC TGCGGCGTAG GCTGCCAGCT CACCTGGAAC GTCGGTCCGA ACGCACAGGG CGAGGAGCGC ATCCATTTCG TGACCGGCCG CGACGGCCCG GCCAACCACG GGCGGCTGTG CGTGAAGGGT CGCTATGGCT TCGACTACAT CCACAACCCG CGCCGCCTGA CCACGCCGCT GATCCGACGC GAAGGCGTGG CCAAGGACCC GGCCGACATC GAGCGCCTCA AGCAGGGCCA GCTGAAGCCG ACGGACATCT TCCGCACGGC GACCTGGGAC GAGGCGATGG AGCTCGCGGC CGGCGGCCTG GCCCGGCTGC GCGACGAGGC GCTGGCGGCC GGCCTGCGCG GCAACGACAT CCCGCTCGCG GGCTTCGGCT CGGCCAAGGG CAGCAACGAG GAGGCCTACC TGTTCCAGAA GCTGGTGCGG CAGGGCTTCC GGACCAACAA CGTCGATCAC TGCACGCGGC TGTGCCATGC CAGCTCGGTG GCGGCGCTGC TGGAGGGCAT CGGCTCGGGG GCGGTCAGCA ACCCGGTGGA GGATGTCGCC CACGCCGACC TGATCTTCCT GATCGGCGCG AACCCGGCGG TGAACCACCC GGTCGCGGCG AGCTGGATCA AGAACGCGGT CGACCGCGGC GCCCGGCTGG TGATCTGCGA TCCGCGTCAC ACCGCGCTGA CGCGTCGCGC CACCTGGCAC CTGCAGTTCC GTCCCGACAC CGACGTCGCG CTGCTCAACG GGCTGCTGCA CGTGATCGTC GCCGAGGGAC TGGTCGATGA GGCCTTCGTC GCGGCGCGCG TCAACGGCTA CGAGGCACTG AAGGCCTCGG TGGCCGAGGC GACGCCGGAG CGCATGAGCG AGATCTGCGG CATCGACGCG CAGACGATCC GCGACGTGGC CCGCGCCTAT GCCACCAGCA AGGGCTCGAT GATTCTCTGG GGCATGGGCG TGAGCCAGCA TGTGCACGGC ACCGACAACG CGCGCGGGCT GATCGCGTTG GCGATGCTGA CTGGCCAGAT CGGACGTGTC GGCACCGGCC TGCATCCGCT GCGCGGCCAG AACAACGTGC AGGGCGCCAG CGACGCCGGG CTGATCCCGA TGATGCTGCC CAACTACCAG CGCGTCATCA ACCCGACGGT GCGCCAGGCC TTCGAGCGCC TGTGGGCCAC GCCCGAGCCA CTGGATGCGA CGCCCGGCCT GACCGTCGTC GAGATCATGC ATGCGGCCAG CGAAGGCCGC ATCCGCGGCA TCTACGTCGA GGGCGAGAAC CCGGCGATGT CGGACCCCGA CCTCAGCCAT GCCCGCCGGG CACTGGCAGG CCTGGAGCAT CTGGTCGTGC AGGACATCTT CCTCACCGAG ACCGCGATGC TGGCCGACGT GGTGCTGCCG GCCTCGGCCC ATGCCGAGAA GTGGGGCAGC TACACGAACA CCGACCGGCT GATCCAGATC GGCCGCCCCG CGCTCGATCC GCCCGAGCTC GCGATGCAGG ATCTGTGGAT CATCGAGCGC GTCGGCCGGC GTCTGGGCCT GGCCTGGAAC TACTGGCGCG ACGAAGACGG CGGCGGCAAG CGCGCCTCGC AGGCGGCGGT GGCGCGCGTC TACGAGGAGA TGCGCGTCAG CATGCCGCCG CTGGCCGGCG TCCCCTGGAG CCGCCTGGTC AAGGCCGACG CGGTGATGAC GCCCGCGGCG AGCGAGGACG ACCCCGGCGC TGCGGTGGTC TTCATCGATC GCTTCCCGAC GGCCGACGGC CGAGCGACCG TGGTACCGAC CGTGTTCCGC CCCGGCGCCG AGCAGATCGA CGCCGAGTAC CCCTTCGTCC TGACCACCGG CCGCGTGCTC GAGCATTGGC ACACCGGCGC GATGACACGG CACGCCAGCA TGCTGGACGC CATCGCGCCC GAGGCGCTGG TGTCGCTGCA TCCGGCGGAT GCGCTGACGG TCGGCGTGCG CGACGGCCAG GCGGTGCTGA TGTCGACGCG GCACGGTGCG GTGCAGGCGC GCGTGCGCGT CAGCACCGAG GTGCAGCCCG GCCAGGTGTT TCTGCCGTTC GCCTTCTGGG AGGCGGCGGC GAACAAGCTG ACCGGCGACG CCCTGGACGA CGTGGCGAAG ATCCCTGGCT TCAAGGTCAC GGCCGCCAAG CTCAGCGTGA TCGCCTGA
|
Protein sequence | MNAPLTSKLL APQAAMVEFS LDGRTVSAGA DETIWTVAKR EGTTIPHLCH KEGLTPAGNC RACVVEVEGE RALAASCCRN VAAGMKVQTQ SPRAASARKM VVELLVSDAG ATTDTYTKAS ELSQWAEQLA VPRGRLPQRD APHYAAADPS HPAIAVNLDA CIQCTRCLRA CRDEQGNDVI GLAFRGAHAQ ITFDAGATLG ESSCVACGEC VQACPTGALM PARGAGLLEV TKQVDSVCPY CGVGCQLTWN VGPNAQGEER IHFVTGRDGP ANHGRLCVKG RYGFDYIHNP RRLTTPLIRR EGVAKDPADI ERLKQGQLKP TDIFRTATWD EAMELAAGGL ARLRDEALAA GLRGNDIPLA GFGSAKGSNE EAYLFQKLVR QGFRTNNVDH CTRLCHASSV AALLEGIGSG AVSNPVEDVA HADLIFLIGA NPAVNHPVAA SWIKNAVDRG ARLVICDPRH TALTRRATWH LQFRPDTDVA LLNGLLHVIV AEGLVDEAFV AARVNGYEAL KASVAEATPE RMSEICGIDA QTIRDVARAY ATSKGSMILW GMGVSQHVHG TDNARGLIAL AMLTGQIGRV GTGLHPLRGQ NNVQGASDAG LIPMMLPNYQ RVINPTVRQA FERLWATPEP LDATPGLTVV EIMHAASEGR IRGIYVEGEN PAMSDPDLSH ARRALAGLEH LVVQDIFLTE TAMLADVVLP ASAHAEKWGS YTNTDRLIQI GRPALDPPEL AMQDLWIIER VGRRLGLAWN YWRDEDGGGK RASQAAVARV YEEMRVSMPP LAGVPWSRLV KADAVMTPAA SEDDPGAAVV FIDRFPTADG RATVVPTVFR PGAEQIDAEY PFVLTTGRVL EHWHTGAMTR HASMLDAIAP EALVSLHPAD ALTVGVRDGQ AVLMSTRHGA VQARVRVSTE VQPGQVFLPF AFWEAAANKL TGDALDDVAK IPGFKVTAAK LSVIA
|
| |