Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_4582 |
Symbol | |
ID | 5835114 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 5117029 |
End bp | 5119998 |
Gene Length | 2970 bp |
Protein Length | 989 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641370376 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_001642021 |
Protein GI | 163853978 |
COG category | [R] General function prediction only |
COG ID | [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.656955 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAACG GCCCCGAACC GCACGGCAAC AAGATCGAAC AGCCCGAGAT CCGCGCCGAC GAACGTCAGG ATGCCGGCGG GCCGGCAAAT GGCGCACCAT CGACCTCCGG CGGCGCCTAC TCTCAGGGCG CCAAGTCGGG TGGCCAGGCC GCGCCCGATC CTTCGGGCAC CTACGGCATC AAGGACGCCC CGGTCGCGCC CGCGACCATC GCCTTCGAGT TCGATGGCCA ACAGGTCGAG GCCCAGCCCG GCGAGACGAT TTGGGCGGTT GCCAAGCGCC TCGGCACGCA TATCCCGCAT CTCTGCCACA AGCCCGATCC CGGCTACCGC CCGGACGGCA ATTGCCGCGC CTGCATGGTC GAGATCGAGG GCGAGCGCGT GCTTGCCGCC TCGTGCAAGC GCACGCCCGC CATCGGCATG AAGGTGAAGT CGGCCACCGA GCGCGCCACC AAGGCCCGCG CCATGGTGCT CGAACTGCTC GTGGCCGATC AGCCCGAGCG CGCAACCTCG CATGACCCGT CGTCGCATTT CTGGGTGCAG GCCGACGTTC TCGACGTGAC CGAGAGCCGC TTCCCGGCGG CCGAGCGCTG GACCAGCGAC GTCAGCCACC CGGCGATGAG CGTCAATCTC GACGCCTGCA TCCAGTGCAA TCTGTGTGTG CGCGCCTGCC GCGAGGTTCA GGTCAACGAC GTGATCGGCA TGGCCTACCG CGCCGCGGGC TCCAAGGTCG TGTTCGACTT CGACGATCCG ATGGGTGGCT CCACCTGCGT CGCCTGCGGT GAGTGCGTCC AGGCCTGCCC GACCGGCGCG CTGATGCCGG CCGCCTATCT CGACGCCAAC CAGACCCGGA CGGTCTATCC CGACCGCGAG GTGAAGTCGC TCTGCCCCTA TTGCGGCGTC GGCTGCCAAG TCTCCTACAA GGTCAAGGAC GAGCGCATCG TCTACGCCGA GGGCGTGAAC GGACCGGCCA ACCAGAACCG GCTCTGCGTG AAGGGCCGCT TCGGCTTCGA CTACGTCCAC CACCCCCACC GCCTGACGGT GCCGCTGATC CGCCTGGAGA ACGTGCCCAA GGACGCCAAC GATCAGGTCG ATCCGGCGAA CCCCTGGACG CATTTCCGCG AGGCGACCTG GGACGAGGCG CTCGACCGCG CGGCGGGCGG CCTGAAGGCG ATCCGTGACA CCAACGGGCG CAAGGCGCTG GCGGGCTTCG GCTCGGCCAA GGGTTCGAAC GAGGAGGCCT ACCTCTTCCA GAAGCTCGTC CGCCTCGGCT TCGGCACCAA CAACGTCGAT CACTGCACGC GCCTGTGCCA CGCCTCGTCG GTCGCTGCGC TGATGGAGGG CTTGAATTCC GGCGCCGTCA CCGCCCCCTT CTCGGCAGCG CTCGACGCCG AGGTCATCGT CGTCATCGGC GCCAACCCGA CCGTGAACCA TCCGGTCGCG GCGACCTTCC TCAAGAACGC GGTGAAGCAG CGCGGCGCCA AGCTGATCAT CATGGACCCG CGGCGCCAGA CGCTCTCGCG CCACGCCTAT CGGCACCTCG CCTTCCGCCC CGGCTCGGAC GTGGCGATGC TCAACGCGAT GCTCAACGTG ATCGTCACGG AGGGCCTCTA CGACGAGCAG TACATCGCCG GCTACACCGA GAACTTCGAG GCTCTGCGCG AGAAGATCGT CGACTTCACG CCGGAGAAGA TGGCTTCGGT CTGCGGCATC GATGCCGAGA CCCTGCGTGA GGTCGCCCGG CTCTATGCCC GGGCCAAGTC GTCGCTCATC TTCTGGGGCA TGGGCGTCAG CCAGCACGTC CACGGCACCG ACAACTCGCG CTGCCTGATC GCGCTCGCCC TCATCACCGG CCAGATCGGC CGGCCCGGCA CCGGCCTGCA CCCGTTGCGC GGCCAGAACA ACGTCCAGGG CGCGTCCGAT GCCGGCCTGA TCCCGATGGT CTACCCGGAC TATCAGTCGG TCGAGAAGGA TGCGGTGCGC GAGCTGTTCG AGGAGTTCTG GGGTCAGTCC CTCGATCCGC AGAAGGGCCT CACCGTGGTC GAGATCATGC GCGCGATCCA CGCGGGCGAG ATCCGGGGCA TGTTCGTCGA GGGCGAGAAC CCGGCGATGT CCGACCCCGA CCTCAACCAC GCCCGCCACG CGCTGGCGAT GCTCGACCAT CTCGTGGTGC AGGACCTGTT CCTGACGGAG ACGGCCTTCC ACGCCGACGT GGTGCTGCCG GCCTCGGCCT TTGCCGAGAA GGCCGGGACC TTCACCAACA CCGACCGGCG TGTGCAGATC GCCCAGCCCG TCGTCGCCCC TCCGGGCGAT GCGCGCCAGG ATTGGTGGAT CATCCAGGAA CTGGCCCGAC GCCTCGACCT CGACTGGAAC TACGGCGGCC CGGCCGACAT CTTCGCCGAG ATGGCGCAGG TGATGCCGTC CTTGAACAAC ATCACCTGGG AGCGGCTGGA GCGCGAGGGG GCGGTGACCT ATCCGGTCGA TGCTCCGGAC CAGCCCGGCA ACGAGATCAT CTTCTATGCC GGCTTCCCGA CCGAGAGCGG GCGCGCCAAG ATCGTGCCCG CGGCGATCGT GCCGCCGGAC GAGGTGCCGG ACGACGAGTT CCCGATGGTG CTCTCGACCG GCCGCGTGCT CGAACACTGG CACACGGGCT CGATGACCCG GCGCGCGGGC GTGCTCGACG CGCTGGAGCC GGAGGCGGTG GCCTTCATGG CACCCAAGGA GCTCTACCGG CTCGGTCTCC GGCCCGGCGG GTCGATGCGG TTGGAAACAC GGCGCGGCGC CGTCGTGTTG AAGGTGCGCT CCGACCGGGA CGTGCCGATC GGCATGATCT TCATGCCCTT CTGCTACGCG GAAGCCGCCG CCAACCTTCT GACCAACCCC GCCCTCGACC CCCTTGGAAA GATTCCCGAG TTCAAATTCT GCGCAGCCCG CGTCGTCCCC GCGGAGGCTG CGCCGATGGC CGCCGAGTAA
|
Protein sequence | MSNGPEPHGN KIEQPEIRAD ERQDAGGPAN GAPSTSGGAY SQGAKSGGQA APDPSGTYGI KDAPVAPATI AFEFDGQQVE AQPGETIWAV AKRLGTHIPH LCHKPDPGYR PDGNCRACMV EIEGERVLAA SCKRTPAIGM KVKSATERAT KARAMVLELL VADQPERATS HDPSSHFWVQ ADVLDVTESR FPAAERWTSD VSHPAMSVNL DACIQCNLCV RACREVQVND VIGMAYRAAG SKVVFDFDDP MGGSTCVACG ECVQACPTGA LMPAAYLDAN QTRTVYPDRE VKSLCPYCGV GCQVSYKVKD ERIVYAEGVN GPANQNRLCV KGRFGFDYVH HPHRLTVPLI RLENVPKDAN DQVDPANPWT HFREATWDEA LDRAAGGLKA IRDTNGRKAL AGFGSAKGSN EEAYLFQKLV RLGFGTNNVD HCTRLCHASS VAALMEGLNS GAVTAPFSAA LDAEVIVVIG ANPTVNHPVA ATFLKNAVKQ RGAKLIIMDP RRQTLSRHAY RHLAFRPGSD VAMLNAMLNV IVTEGLYDEQ YIAGYTENFE ALREKIVDFT PEKMASVCGI DAETLREVAR LYARAKSSLI FWGMGVSQHV HGTDNSRCLI ALALITGQIG RPGTGLHPLR GQNNVQGASD AGLIPMVYPD YQSVEKDAVR ELFEEFWGQS LDPQKGLTVV EIMRAIHAGE IRGMFVEGEN PAMSDPDLNH ARHALAMLDH LVVQDLFLTE TAFHADVVLP ASAFAEKAGT FTNTDRRVQI AQPVVAPPGD ARQDWWIIQE LARRLDLDWN YGGPADIFAE MAQVMPSLNN ITWERLEREG AVTYPVDAPD QPGNEIIFYA GFPTESGRAK IVPAAIVPPD EVPDDEFPMV LSTGRVLEHW HTGSMTRRAG VLDALEPEAV AFMAPKELYR LGLRPGGSMR LETRRGAVVL KVRSDRDVPI GMIFMPFCYA EAAANLLTNP ALDPLGKIPE FKFCAARVVP AEAAPMAAE
|
| |