Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gmet_1059 |
Symbol | |
ID | 3739506 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter metallireducens GS-15 |
Kingdom | Bacteria |
Replicon accession | NC_007517 |
Strand | + |
Start bp | 1177284 |
End bp | 1180316 |
Gene Length | 3033 bp |
Protein Length | 1010 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637778337 |
Product | formate dehydrogenase alpha subunit |
Protein accession | YP_384024 |
Protein GI | 78222277 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence [TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.080154 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAATTT CACGCAGGCA GTTTCTGCAG CGGGGGGCAC TGGCCGGCGC GGCCATCGCG CTCTCCGGCA AGCCGGGGGA GGCGAGCGTC GATGCTCCCG AGCTCCGGAC CAAGGGGCTG AAGACCACGA CCACCATCTG TCCGTTCTGC TCGGTTGGGT GCGGCCTGAT CGTCCACACC AAGGACGGCA GGGTGGTGAA CGCCGAAGGT GACCCGCAGC ACCCGATCAA CCAGGGGGCG CTCTGTCCCA AGGGGGGCGC GCTCTTCCAG ATCGCCAACA ACGAGAACCG GCTCCAGAAG GTGAAGTACC GGGCGCCCGG CTCCGACACA TGGGAAGAGA AGAGCTGGGA CTGGGCCCTT GACCGGATCG CCCAGCGGAT GAAGGAAACC CGCGACAAGT CCTTCAAGAA GACTGAGCTC AACAAGAAGG ACAACAAAGA GTACGTGGTG AACCGCACCG ATGGCATGGC CTTCTTCGGC GGCGCCGGTC TTGATAACGA AGAGTGCTAC CTCTGGACCA AATTCGCCCG CGCCATGGGG GTGGGGCAGC TCGAACACCA GGCCCGACTT TGACACTCGT CAACGGTCGC CGGTCTGGCG GCTTCATTCG GTCGTGGTGC CATGACCAAC CATTGGATTG ACCTGAAGAA CTCCGACGTA ATCCTGGCCA TCGGCTGCAA CCCGGCCGAG AACCACCCTG TTTCGTTCAA GTGGATCGAG CAGGCGCTCG ACAGCGGCGC CAAGCTCATT GCCGTCGATC CGCGCTATAC ACGCACGGCC TCCAAGTCGG ACATCTACGC CCAGATCCGT CCGGGGACCG ACATCGCCTT CCTGGGGGGG ATAATCAACT TCGCCCTCCA GAACAACATG ATCCACGAGG AGTACGTCCG CGAGTACACC AACGCCACGT TCATCGTCTC GGAGCAGTTC GACTTCCAGG ACGGCATGTT CTGCGCCTTC GACGATCAGG AGAAGACCTA CGACCTCAAG TCCTGGGCCT ACGCCACCAC CGCCGACGGC AAGCCCAAGC GGGACATGGG CATGAAGGAT CCCAAGTCCG TCTACCAACT CCTGAAGAAC CACTACCGGC GCTATGATGT GGACACCGTC TGCGCCATCA CCGGCACGAA AAAGGAAGAC TACCTCAAGG TGGCCAAGGC GTTCTGCGGC ACCGGTCGCG CCGACAAGGC GGGGACCATC CTCTACGCCA TGGGGATCAC CCAGTCGACC CACGGCAGCC AGAACGTCCG GGCCGTGGCG CTCCTCCAGA TGCTGCTGGG GAACATCGGC ATCGCCGGCG GCGGCGTGAA CGCCTTGCGG GGCGAGTCCA ACGTCCAGGG CTCCACCGAC TACGGGCTCC TCTTCCACAT CCTCCCCGGC TACCTCAAGT CGCCGGAGTT CGACAACGTG GACCTGAAGG CCTACAACGA GAAGTGGACC CCCAAGACCA AGGACCCCAA GAGCGCCAAC TGGTGGGGGA ACACCCCCAA GTACATCACG AGCCTCCTCA AGGCCTGGTA CGGCGAGAAC GCCACGGCCG AGAACGACTT CTGCTACGAC TACCTCCCCA AGCGGTCGGG GAACTACTCC TACGTGAAGA TCATGGAGAA GATGGGGAAG GGAGAACTGG AAGGGCTCGT CTGCATGGGG ATGAACCCGG CCGTGGGGGG CCCCGATTCC GGCAAGGCCC GGGAGGCCCT TGGCAAGCTC AAGTGGCTCG TCACCGCCGA CCTCTGGGAG ACCGAGAGCT CCATCTTCTG GAAGCGTCCC GGCGTCGACC CGAAGTCGAT CCAGACCGAG GTCTTCATGT TGCCGGCCGC CTCCTCCATC GAGAAGGAAG GCTCCATCTC CAACTCGGGC CGCTGGGCCC AGTGGCGCTA CGCCGCCGTC CACCCGCTGG GTGATTCCCG CAGCGACCTC CACATCATCG ACGAGTTCTA CAAGCGGGTG AAGGCCCTCT ACCTGAAGCA GGGGGGCGCG TTCCCCGAGC CGATCACCAA GCTCTCCTGG AGCTACGGCA CCGGCCACGA GCCCGACGTC CATATGGTGG CCAGGGAGAT CAACGGCTAC TTCACCAAGG ACATGACCAT CGTCGACAAG GACAAGACCC TGGAGTTCAA GAAGGGGGAT CAGGTCCCGA TGTTCAAGTA CCTCCAGGAC GACGGCTCCA CCGTCTCCGG CTGCTGGATC TACTGCGGCT CCTACACCAA GGACGGCAAC CAGATGGCCC GCCGGGACGC CTCCGACCCC ACCGGCCTCG GCCTCTTCCC CAAGTGGTCC TGGTGCTGGC CGGTGAACCG CCGGATCATC TACAACCGCG CCTCCGTCAA CCCGGCCGGT GAGCCGTTTA ACCCCAAGCG CCCGGTCATC GCCTGGGACG CGCTGGAGAA GAAGTGGAAG GGGGATGTCC CCGACGGTCC CTGGCCCCCC ATGAAGGACG ACAAGGAAGG GAAGTACCCC TTCATCATGC TCCCCGAGGG ACATGGCCGC CTCTACGCCC TCGACATGAA GGACGGGCCG TTCCCCGAGC ACTACGAGCC AGTGGAGAGT CCGGCCAGGA ACCTCCTCTC CAAGGTGCAG AGCAACCCGG TCGTCAAGGT GCCGAAGAAC GTGTCGAGCG ATACCGCCAA GTTCCCCTAC GTCGGCACCA CCTACCGGGT GACCGAGCAC TGGCAGGCCG GTGCCATGAC CAGGAGCCTC CCGTGGCTCG TGGAACTGGT TCCCGACATG TTCGTGGAAC TCTCCGAGAC CCTGGCCCAG CGGAAGGGGA TCAAGCAGGG GGACACGGTG AAGGTCACCA CCGAGCGGGG CTCCATCGAA GCGGTGGCCC TTGTCACCAG CCGCCTCAAG CCCTTCAACG TCCAGGGGCG GATGATCGAG CAGGTGGGGA TGCCGTGGCA CTTCGGCTAC GCGGGACTCG CCAAGGGGGA CAGCGGTAAC GTCCTCACCC CCACCGTTGG GTGCGCCAAC ACCGGCATTC CGGAGTTCAA GGCGTTCCTC TGCAACATCG AGAAAGGGGG TAAGCGGGCA TGA
|
Protein sequence | MGISRRQFLQ RGALAGAAIA LSGKPGEASV DAPELRTKGL KTTTTICPFC SVGCGLIVHT KDGRVVNAEG DPQHPINQGA LCPKGGALFQ IANNENRLQK VKYRAPGSDT WEEKSWDWAL DRIAQRMKET RDKSFKKTEL NKKDNKEYVV NRTDGMAFFG GAGLDNEECY LWTKFARAMG VGQLEHQARL UHSSTVAGLA ASFGRGAMTN HWIDLKNSDV ILAIGCNPAE NHPVSFKWIE QALDSGAKLI AVDPRYTRTA SKSDIYAQIR PGTDIAFLGG IINFALQNNM IHEEYVREYT NATFIVSEQF DFQDGMFCAF DDQEKTYDLK SWAYATTADG KPKRDMGMKD PKSVYQLLKN HYRRYDVDTV CAITGTKKED YLKVAKAFCG TGRADKAGTI LYAMGITQST HGSQNVRAVA LLQMLLGNIG IAGGGVNALR GESNVQGSTD YGLLFHILPG YLKSPEFDNV DLKAYNEKWT PKTKDPKSAN WWGNTPKYIT SLLKAWYGEN ATAENDFCYD YLPKRSGNYS YVKIMEKMGK GELEGLVCMG MNPAVGGPDS GKAREALGKL KWLVTADLWE TESSIFWKRP GVDPKSIQTE VFMLPAASSI EKEGSISNSG RWAQWRYAAV HPLGDSRSDL HIIDEFYKRV KALYLKQGGA FPEPITKLSW SYGTGHEPDV HMVAREINGY FTKDMTIVDK DKTLEFKKGD QVPMFKYLQD DGSTVSGCWI YCGSYTKDGN QMARRDASDP TGLGLFPKWS WCWPVNRRII YNRASVNPAG EPFNPKRPVI AWDALEKKWK GDVPDGPWPP MKDDKEGKYP FIMLPEGHGR LYALDMKDGP FPEHYEPVES PARNLLSKVQ SNPVVKVPKN VSSDTAKFPY VGTTYRVTEH WQAGAMTRSL PWLVELVPDM FVELSETLAQ RKGIKQGDTV KVTTERGSIE AVALVTSRLK PFNVQGRMIE QVGMPWHFGY AGLAKGDSGN VLTPTVGCAN TGIPEFKAFL CNIEKGGKRA
|
| |