Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_4406 |
Symbol | |
ID | 5834743 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 4902926 |
End bp | 4905778 |
Gene Length | 2853 bp |
Protein Length | 950 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641370199 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_001641845 |
Protein GI | 163853802 |
COG category | [R] General function prediction only |
COG ID | [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCTCA TCAAGGAAAT CGACTACGGC ACGCCGATCC GGCTCAATGA GCAGACGGTG ACGCTGACCA TCGACGGCGA GAGCGTGACG GTGCCCGCGG GCACCTCGGT CATGGCCGCC GCGATGCATA TGGGCACCAA GATCCCGAAG CTCTGCGCCA CGGATTCGCT GGAGCCGTTC GGCTCCTGCC GGATGTGCCT CGTCGAGATC GACGGCCGCC GCGGCACGCC CGCCTCCTGC ACCACCCCGG CCGAGAATGG CATGGTCGTG CACACGCAGA CCGACAAGCT GCATCGGCTG CGCAAGGGCG TGATGGAGCT CTACATCTCC GACCACCCGC TCGACTGCCT GACCTGCGCC GCCAACGGCG ATTGCGAGCT GCAAGATACT GCAGGTCAGG TCGGCCTGCG CGAGGTCCGC TACGGCTATG ACGGCGACAA CCACGTCAAG CCGGCCTCCG ACCGCTACCT GCCCAAGGAC GAGTCGAACC CCTACTTCAC CTACGACCCG TCGAAGTGCA TCGTCTGCAA CCGCTGCGTC CGCGCCTGCG AGGAGACGCA GGGCACCTTC GCGCTGACCA TCGAGGGCCG CGGCTTCGAC AGTCGCGTGG CGGCGGGCCC GACCAACTTC ATGCAGTCCG AATGCGTGTC CTGCGGCGCC TGCGTCCAGG CCTGCCCGAC CGCGACGCTG CAGGAGAAGA CGATCCACCA ATACGGCCAG CCGGACCATT CCGAGGTCAC GACCTGCGCC TATTGCGGCG TCGGCTGCGC CTTCAAGGCC GAGATGCAGG GCGACAAGGT CGTCCGCATG GTGCCCTACA AGGGCGGCAA GGCGAACGAG GGACATAGCT GCGTCAAGGG CCGCTTCGCC TACGGCTACG CCACCCACAA GGACCGCATC ACCAAGCCGA TGATCCGCGC CAAGATCACG GATCCGTGGC GCGAGGTGTC GTGGGAGGAG GCGATCAATC ACGCCGCCTC CGAGTTCAAG CGCATCCAGG CGACCTACGG CCGCGACTCG GTCGGCGGCA TCACCTCGTC GCGCTGCACC AACGAGGAAG CCTACCTCGT CCAGAAGCTG GTGCGCGCCG CCTTCGGCAA CAACAACGTC GATACCTGCG CCCGCGTCTG CCACTCGCCG ACCGGCTACG GCCTGATGTC CACGCTCGGC ACCTCCGCCG GCACGCAGGA CTTCAAGTCG GTCGAGGAAT CCGACGTGAT CCTCGTCATC GGCGCCAACC CGACCGACGG CCACCCCGTC TTCGGCTCGC GGATGAAGAA GCGGCTGCGT GAGGGCGCCC GCCTCATCGT CGCCGACCCG CGCAAGATCG ACCTCGTGAA GTCGCCCCAC ATCCGGGCCG AGCATCACCT GCCGCTCAAG CCCGGCTCCA ACGTCGCCTT CATCAACGCC TTCGCCCACG TCATCGTCAC GGAAGGGCTG ATCGCCGAGG ACTACGTCCG CGAGCGCTGC GATCTGGCCG AGTTCGAGTC CTGGGCCCGC TTCATTGCCG AGGAGCGCAA CTCGCCGGAA GCCGCGCAGG CCATCACCGG CGTCGATCCA CAGGAGATCC GCGCCGCGGC CCGGCTCTAC GCCACCGGCG GCAAGGCGGC GATCTACTAC GGGCTCGGCG TGACCGAGCA CAGCCAGGGC TCGACCATGG TGATGGGCAT GGCCAACATC GCCATGGCCA CCGGCAATAT CGGCATGGTG GGCGCCGGCG TGAACCCGCT GCGCGGCCAG AACAACGTGC AGGGCTCCTG CGACATGGGC TCGTTCCCGC ACGAGCTGCC GGGCTACCGC CACGTCTCGG ACGACGCCAC CCGCGAGAGC TTCGAGGCGA TCTGGGGGGC CAAGCTCGAC AACGCGCCGG GCCTGCGCAT CACCAACATG CTGGACGAGG CCGTCGGCGG CAGCTTCAAG GGCATGTACA TCCAGGGCGA GGACATCGCG CAGTCCGACC CCGACACCCA CCACGTCACC TCCGGCCTCA AGGCCATGGA GTGCATCGTG ATCCAGGACC TGTTCCTGAA CGAGACCGCC AAATACGCCC ACGTCTTCCT GCCGGGCGCT TCCTTCCTCG AGAAGGACGG CACCTTCACC AATGCCGAGC GCCGCATCTC CCGCGTGCGC AAGGTGATGG CCCCGATGGG CGGCTACGGC GATTGGGAGG GCACGGTGCT GCTCGCCAAC GCCCTGGGCT ACAAGATGGA GTACACCCAC CCGTCCGAGA TCATGGACGA GATCGCGGCG CTCACCCCGA GCTTCGCCGG CGTCTCCTAT GACAAGCTGG AGGAGCTGGG CTCGATCCAG TGGCCGTGCA ACGAGAAGGC GCCGCTCGGC ACGCCGATGA TGCACGTCGA CCGGTTCGTG CGCGGCAAGG GCCGCTTCAT GATCACGGAA TACGTGCCCA CCGACGAGCG GACCACGGGC AAGTTCCCGC TGATCCTCAC CACGGGCCGC ATCCTCTCGC AGTACAATGT CGGCGCGCAG ACGCGGCGCA CCGAGAACTC CCGCTGGCAC GAGGAAGACG TGCTGGAGAT CCACCCCTTC GACGCGGAGA TGCGCGGCAT CGTCGATGGC GACCTCGTCG CCCTGGAGAG CCGCTCGGGC GACATCGCGC TGAAGGCCAA GGTGACCGAG CGGATGCAGC CGGGGATCGT CTATACGACG TTCCACCACG CCAAGACCGG CGCCAACGTC ATCACCACGG ACTATTCGGA CTGGGCGACC AACTGCCCCG AATACAAGGT GACGGCGGTG CAGGTGCGGC GCACGAACCG GCCCTCGGAC TGGCAGGCGA AGTTCTACGA GGAGGACTTC TCGCTCACCC GCATCGCCGA AGCGGCGGAA TAG
|
Protein sequence | MTLIKEIDYG TPIRLNEQTV TLTIDGESVT VPAGTSVMAA AMHMGTKIPK LCATDSLEPF GSCRMCLVEI DGRRGTPASC TTPAENGMVV HTQTDKLHRL RKGVMELYIS DHPLDCLTCA ANGDCELQDT AGQVGLREVR YGYDGDNHVK PASDRYLPKD ESNPYFTYDP SKCIVCNRCV RACEETQGTF ALTIEGRGFD SRVAAGPTNF MQSECVSCGA CVQACPTATL QEKTIHQYGQ PDHSEVTTCA YCGVGCAFKA EMQGDKVVRM VPYKGGKANE GHSCVKGRFA YGYATHKDRI TKPMIRAKIT DPWREVSWEE AINHAASEFK RIQATYGRDS VGGITSSRCT NEEAYLVQKL VRAAFGNNNV DTCARVCHSP TGYGLMSTLG TSAGTQDFKS VEESDVILVI GANPTDGHPV FGSRMKKRLR EGARLIVADP RKIDLVKSPH IRAEHHLPLK PGSNVAFINA FAHVIVTEGL IAEDYVRERC DLAEFESWAR FIAEERNSPE AAQAITGVDP QEIRAAARLY ATGGKAAIYY GLGVTEHSQG STMVMGMANI AMATGNIGMV GAGVNPLRGQ NNVQGSCDMG SFPHELPGYR HVSDDATRES FEAIWGAKLD NAPGLRITNM LDEAVGGSFK GMYIQGEDIA QSDPDTHHVT SGLKAMECIV IQDLFLNETA KYAHVFLPGA SFLEKDGTFT NAERRISRVR KVMAPMGGYG DWEGTVLLAN ALGYKMEYTH PSEIMDEIAA LTPSFAGVSY DKLEELGSIQ WPCNEKAPLG TPMMHVDRFV RGKGRFMITE YVPTDERTTG KFPLILTTGR ILSQYNVGAQ TRRTENSRWH EEDVLEIHPF DAEMRGIVDG DLVALESRSG DIALKAKVTE RMQPGIVYTT FHHAKTGANV ITTDYSDWAT NCPEYKVTAV QVRRTNRPSD WQAKFYEEDF SLTRIAEAAE
|
| |