Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sfum_3510 |
Symbol | |
ID | 4458177 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Syntrophobacter fumaroxidans MPOB |
Kingdom | Bacteria |
Replicon accession | NC_008554 |
Strand | + |
Start bp | 4292004 |
End bp | 4295051 |
Gene Length | 3048 bp |
Protein Length | 1015 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 639704282 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_847616 |
Protein GI | 116750929 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | [TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.722583 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.530799 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAGTCA CACGGCGTGA GTTCCTTTTC ATCTCGGGAG CCATGGGAGC CGGTCTGGCA CTGTCGTCTC TGGGGGTCGA CATGCTTCCG GTGGTGGCCT ATGCGGAGGG GCTCAGTAAA ATCGACAAAG TGAAAAGCGC GAAGGAGATG TATTCCCTCT GCTACCACTG CGCGGTAACT TGCGGGCTGA TATGCAGCAC CGACACAAAA ACCGGCAAAA TCATAAACAT CGAGGGGGAT CCGGACAATC CCATCAACGA GGGATCGCTT TGCGCGAAAG GTGCGGCGTC GCTCCAGATG TCGGCCAGAA ATGAAAACCG TCTGACCAGG GTTCTCTATC GCAAACCGGG CGGCAACCAA TGGGAGGTCA AGACGTGGGA CTGGGCCTTG ACCAGGATCG CCAAGAACAT CAAGGCGGTA CGGGACAAGG AATTCATTGT CAAGAACGCC AAAGGGCAGG TGGTCAACCG TGTCGAAGCC CTGGCTCACA TGGGCAGCTC CAAGCTGGAC AATGAAGAGT GCTGGATGAT CACCACCGCC ATGAGGGCAC TTGGTCTCGT CTATTTGGAT CACCAGGCTC GGGTCTGTCA CGCTCCCAGC GTGGCCTCGC TGGCTGAGTC GTTGGGACGT GGTTCGATGA CCAATCATCC CATCGACATC GGGAACAGCG ATTGTGTCCT CGTCATGGGG GGCAACGCGG CGGAAGCTCA CCCCATCACC TTCAGGTGGG CCGTGCGTGC CAAGGATAAG GGGGCGAAAA TCATCCACGT CGATCCGCGC TTCACAAGGA CCTCGGCCAT AGCGGATCAT CATGCGTTCC TTCGTGCAGG CACGGACATC GCCTTCCTGG GTGGCATGAT CAAGTACATT CTCGACAACA ATAAGTACTT CGAGGAGTAC GTCAGGAACT ACACCAATGC CTCGTGCAGC ATAGACGAGA AGTTCAGCTT CAAGGACGGA CTTTTCTCGG GTTACGACCC CGCTGCCAGG AAATACGACA AGTCGACCTG GGTGTTCAGG CTCGATGCCA ACGGCGTCCC GGAAAAGGAC CCGACGCTCA AGAATCCACG GTGCGTCCTG CAGATGATGA AGGCCCATTA CGCGCGCTAC GATCTTGAGA AGGTGTCGTC CATTACGGGC ACCCCTGTGG AAGATCTCAA GACCATTTAT GAACTGTATT CCTCGACCGG GGTGAGAGAT AAAGCGGGGA CGATCCTCTA CGCACTCGGA TGGACCCAGC ACACGGTGGG GGTGCAGAAC ATCAGAACGA TGACCATCGT TCAGCTCCTG CTCGGGAATA TCGGCATCGC GGGGGGAGGT GTCAATGCAT TGCGCGGCCA GCCGAACGTC CAGGGATCGA CGGATCAGGC GATCCTGGCC CACATTCTTC CAGGCTATCT CAAGACGCCG GCGGCTTCAC TGACCACGCT GGACGAGTAC CTGGCTAGGA ACACTCCCAA GACCAGGGAG CCCCAGTCGG CCAACTGGTA TCAAAATACC CCGAAGTACA TGGTGAGCCT CCTCAAATCA TGGTACGGCG ACAAGGCCAT CAAAGAGAAC GGTTTTGCCT ATTCATACCT TCCTAAGATC GATGACGGGC AGGATGCGAC GCTTCTGGAC ATGATCGACA AGATGTATGC AGGGAAGATC AAAGGCTTTA CGTGCGTCGG CCAGAATCCG GCGTGCAGCA ACCCCAATGC CGGCAAAACC CGGAAGGCCC TGGCCAACCT TGACTGGATG GTGCACATCA ACATTTTCGA CAATGAGACG GCCTCTTTCT GGAAGGGGCC GGGGATGGAT CCGAAAAAGA TCAAGACGGA GGTCTTTCTC CTGCCGGCGG CGGCTCAAAT GGAAAAAGCG GGCAGCATGA CCAATACGGG ACGTTGGCTC CAATGGAAAT ACACGGCGGA GAAACCGCCG GGAGACGCCC TCTCGATGGG CGATATCCTG TACCGGCTGG TAATGAAACT CAAAGACCTC TACAAGAAGG AAAAGGGGAC CTTCCCCGAT CCCATTCTCG ATCTGCAATG GAATTATGCG GATGTCAAGG GGATGTATGA TTCCACGGCG GTCGCCAAGG CGATGAACGG ATATTTCCTG GATGACGTCA CCGTTGGAGA CAAGTCTTTC AAAAAGGGGG AGTGTGTTCC CGGCTTCAGT TACCTGCAGG CCGATGGAAA GACCTCCTGC GGCATATGGA TCCATTGCGG GAGCTTCACA CAGGATGGAA CCAACCTGAT CGCCCGAAGG AAGAAAGACG ATCCGACGGG GCTCGGCCTC TATCCCGAAT GGGCGTGGGC ATGGCCGATG AACCGGCGCA TCCTCTACAA TCGGGCGTCG GTCGATGAGA AGGGGCAGCC CTGGGATCCC AGGCGGGCGG TCCTCAAGTG GGCCGACGGA AAGTGGGTGG GCGACGTTCC CGACGGGCCA TGGCCGCCCC TGAGCGACAA GGAAAAAGGA AAGTTGCCTT TCATTATGAA GCCCGACGGA GTCGCCTCCC TGTTCGGTCC GGGGCTGGCG GACGGACCGT TCCCCGAGCA CTACGAGCCC CTCGAGAGTC CGCTTTCCAA GAGCCTGATA TCCGCACAGT TGAGCAGTCC CGTCATCAGG ATATTCAATA GTGATTGCGA TAAGATTGCC GGGTGCGATC CGAGGTTCCC TCTGGTCTGT ACCACCTACT CCTGCACGGA GCACTGGTGC ACGGGCGCGG ATACGAGATG GCAGTCGTGG CTCACGGAGA CCATGCCCCA GGCCTACGTC GAGATCAGCA GGGAATTCGC GGATCTCCGT GGCATCGAGA ACGGTGAAAA GGTCAAGGTT GAATCGGTTC GCGGCAAAGT CGAATGCGTG GCGATGGTCA CTACGCGGCT GCGTCCTTTG AAAGTGGGCG GACAGACCCT GCATCAGGTG GGGATCACCT ACAACTACGG CTGGCTTTTC CCCAAGGATT GCGGCGATTC CGCCAACCTG CTGACCCCCA CCGCCGGCGA CCCCAACACC GGGACCCCGG AATACAGAGC TTTCATGGTT AATGTGACAA AGGTGTAG
|
Protein sequence | MGVTRREFLF ISGAMGAGLA LSSLGVDMLP VVAYAEGLSK IDKVKSAKEM YSLCYHCAVT CGLICSTDTK TGKIINIEGD PDNPINEGSL CAKGAASLQM SARNENRLTR VLYRKPGGNQ WEVKTWDWAL TRIAKNIKAV RDKEFIVKNA KGQVVNRVEA LAHMGSSKLD NEECWMITTA MRALGLVYLD HQARVCHAPS VASLAESLGR GSMTNHPIDI GNSDCVLVMG GNAAEAHPIT FRWAVRAKDK GAKIIHVDPR FTRTSAIADH HAFLRAGTDI AFLGGMIKYI LDNNKYFEEY VRNYTNASCS IDEKFSFKDG LFSGYDPAAR KYDKSTWVFR LDANGVPEKD PTLKNPRCVL QMMKAHYARY DLEKVSSITG TPVEDLKTIY ELYSSTGVRD KAGTILYALG WTQHTVGVQN IRTMTIVQLL LGNIGIAGGG VNALRGQPNV QGSTDQAILA HILPGYLKTP AASLTTLDEY LARNTPKTRE PQSANWYQNT PKYMVSLLKS WYGDKAIKEN GFAYSYLPKI DDGQDATLLD MIDKMYAGKI KGFTCVGQNP ACSNPNAGKT RKALANLDWM VHINIFDNET ASFWKGPGMD PKKIKTEVFL LPAAAQMEKA GSMTNTGRWL QWKYTAEKPP GDALSMGDIL YRLVMKLKDL YKKEKGTFPD PILDLQWNYA DVKGMYDSTA VAKAMNGYFL DDVTVGDKSF KKGECVPGFS YLQADGKTSC GIWIHCGSFT QDGTNLIARR KKDDPTGLGL YPEWAWAWPM NRRILYNRAS VDEKGQPWDP RRAVLKWADG KWVGDVPDGP WPPLSDKEKG KLPFIMKPDG VASLFGPGLA DGPFPEHYEP LESPLSKSLI SAQLSSPVIR IFNSDCDKIA GCDPRFPLVC TTYSCTEHWC TGADTRWQSW LTETMPQAYV EISREFADLR GIENGEKVKV ESVRGKVECV AMVTTRLRPL KVGGQTLHQV GITYNYGWLF PKDCGDSANL LTPTAGDPNT GTPEYRAFMV NVTKV
|
| |