Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2638 |
Symbol | |
ID | 8137980 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 3075312 |
End bp | 3078491 |
Gene Length | 3180 bp |
Protein Length | 1059 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644870242 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_003022432 |
Protein GI | 253701243 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence [TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 67 |
Fosmid unclonability p-value | 0.409853 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAGTGT CACGGAGGAA CTTTTTAAAG ATCTCAGGTG TCGGCGTGGC GGCGACCACC CTGGGCCTGA ACCTCGATCC GGTCGAGGCG AAAGCTCAGG ATCTCGCCAT CCAGCATGCC AAGGAAACCA CGACCATCTG CCCCTACTGT TCCGTCGGGT GCGGCATGAT CGTGCACACC CAGGGGGACA AGGTGATCAA CGTCGAAGGG GACCCGGACC ATCCCATCAA CGAAGGGGCC CTCTGCCCCA AGGGTTCCTC CGTGTACCAA CTGCGCGACA ACAAGGCGCG CATCACCAAG CCGATGTATC GGGCCGCCGG CGCCGCAACC TGGCAGGAGG TGACCTGGGA ATGGGCGCTC GACCAGATAG CCAGAAAGAC CAAGGCGACC CGCGACGCCT CCTTCGTCCC CACCAGCAAG ATAAAGGTCA AGGAGAAGGT CGCCGGGGTT GAGGTCGAGA AGGAGATCGA GGCCGTGGTG AACCGCACCA TGGGGATCGC CTCGGTCGGC AGCGCCGCGC TGGACAACGA GGAATGCTAC CTGTACCAAA AATTTTTAAG GGGGTTGGGC CTGGTGTACA TCGAACATCA GGCACGCATT TGACACAGCG CCACTGTAGC GGCTCTGGCA GAGTCGTTTG GACGCGGTGC GATGACGAAC CACTGGATAG ATTTCAAGAA CGCCGACGTA ATCCTCATCA TGGGCGCGAA CCCCGCCGAG AACCACCCGG TCGCCTTCCG CTGGATCTTG AAGGCGAAGG AGTCGGGGGC GAAGGTGATC TGCGTCGATC CCCGCTTCAC CCGCAGCGCC GCCAAGTCGG ACCTGTACGC GCAGCTTCGC TCGGGGACCG ATATCGCCTT TCTGGGCGGA ATGATCAACC ACATCATGCA GAACAAGCTC TACTTCGAGC AGTACGTCGC CGAGTACACC AACGCCTCCT ATCTGGTGAA CGGCGACTTC AAGCTCCCGG GAGAACTGGA CGGCCTCTTC TCCGGCTACG ACCCGCAAAA GCGGAGCTAC GACGTTAAAA GCTGGTCCTT CCAGAAGGAG GCCGACGACA GCATCAGGAA GGACCCTACC CTGCAGGACC CGAACTGCGT GTTCCAGCTC CTGAAGAAGC ACTACAGCCG CTACACCCCG GAGCTGGTCT CGAAGACCAC GGGGACGCCC AAGGAAAAGC TCCTGGAGGT GTACCAGCTC TACGCCTCCA CCGGGAGGCC CGACCGCGCC GGCACCTCGC TTTACGCCAT GGGGTGGACC CAGCACACGG TCGGCACCCA GAACATACGC ACCATGTCCA TCATCCAGCT CCTGTTGGGG AACATCGGCG TCGCCGGAGG CGGGGTCAAC GCCCTTCGCG GCGAATCGAA CGTGCAGGGA TCTACCGACC AGGGGCTCTT GTTCCACATC CTCCCCGGTT ACATGCCGGT CCCCTCGGCC GAGCTCCCCA CCACGGCCGC CTACATCGAG AAGCACACCC CGAAGAGCAA GGACCCGCAG AGCGCCAACT GGTGGGGGAA CCGCAACAAG TACCTGGTCA GCTACCTGAA GGCGATCTAC GGCAACAACG CCACCAAGGA GAACGACTTC GGCTACAACT GGCTCCCGAA ACTGGACCCG GGGATGAACG GCTCCTGGCT GATGATCTTC GACAACATGC TCAAGGGTAA GTTGAAGGGG TTCTACGCCT GGGGGCAGAA TCCGGCCTGT TCGGGGGCTA ACTCCAACAA GGTGAGAAAC GCCCTGGCCA AGCTCGACTG GATGGTGGCG GTCAACCTCT TCGACAACGA GACCGCCTCC TTCTGGAAAG GACCGGGGAT GGACCCGGCC AAGGTGAAGA CCGAGGTCTT CTTCCTTCCG GCGGCGGCAT CCTTCGAGAA GGAAGGCTCC ATCACCAACT CCGGGCGCTG GGCCCAGTGG CGCTATCAGG CGGTGAAGCC CCTCGGGCAG AGCAAGCCCG ACGCCGAGAT CATGAACGAC CTGTACCAGG CGATCAAGGG GCTCTACGCG AAAGAAGGAG GCGCGCTCCC CGAACAGCTC TTGAAGCTGA CCTGGAACTA CGGCTTCAAA AGGGCCGACG GCAGCATCCG CTCCATCGAC ATCCACCAGG TGGCCAAGGA GATCAACGGC TACTTCCTCC AGGACGTATC CGAGCCCGTG AAGCCGCTAA AGCCCGGGCA AGCGCCGCCG AAACCGCGGG AGCCCTGGGA GGCGAAGAAG CTTTTAGGCA AGAAGGGAGA ACTGGTGGGA GGTTTCGCCC AGCTTCAGGC CGACGGGAGC ACCTCCTGCG GCTCGTGGAT CTTCAGCCAG AGCTACAACG AAAAGGGGAA CATGATGGCA CGCCGCGGCA AGAAGGACCC GACCGGCCTC GGCATGTTCC CCGAGTGGAG CTGGTGCTGG CCGGTGAACC GCCGCATCAT CTACAACCGC GCCTCGTGCG ACCCAAGCGG CAAACCGTTC AACGTGAAAA AGGCGGTGGT CTACTGGAAC CCCCTGGCGG TGCAGCCCGG CGGCAAAATG GGGGCCTGGG TGGGCGATGT CCCCGACGGC CCCTGGCCGC CTCTCGCCGC CGGAGCGGAG GGGCGCAAAC CCTTCATCAT GAGGGCGGAC GGGGTGGCCG CGATCTTCGG CCCGGGGCTT AAGGACGGCC CCTTCCCCGA GCACTACGAG CCGATGGAGT CGCCGCTGGC CCAGAACCTG ATGTCCAAGC AGCTCAACAA CCCCGCCGTG AAGATCTTCA AGGACCCCGC GGTGAAGGAG GACGTCTTGG CCAGCGCCGA CCCGCGCTTT CCGCTGGTCG CCACCACCTA CCGGGTCACC GAGCACTGGC AGACCGGCGT CATGACCCGC AACACCCCTT GGCTATTGGA ACTGCAACCG CGCCAGTTCT GCGAGATCAG CCAGGAGCTT GCCAAGGAGA AGGGAATCGC CAACGGCGAC CTGGTGGAGG TAGCCTCGGC CCGCGGCAAG GTCGAGGCGG TGGCCATGGT GACCGTGCGC ATGAAGCCGA TGAAGATAGG GGACAAGGTC GTGCACCAGA TCGGCCTCCC CTGGTGCTTT GGCTGGCACA CCCCCGGCGT GGGAGACGCC GCCAACCTGC TGACCCCCAC CGCGGGGGAC GCCAATACGA TGATTCCCGA GACCAAGGCG TTCATGGCCG GGATCGCGAG AAAGGGGTGA
|
Protein sequence | MEVSRRNFLK ISGVGVAATT LGLNLDPVEA KAQDLAIQHA KETTTICPYC SVGCGMIVHT QGDKVINVEG DPDHPINEGA LCPKGSSVYQ LRDNKARITK PMYRAAGAAT WQEVTWEWAL DQIARKTKAT RDASFVPTSK IKVKEKVAGV EVEKEIEAVV NRTMGIASVG SAALDNEECY LYQKFLRGLG LVYIEHQARI UHSATVAALA ESFGRGAMTN HWIDFKNADV ILIMGANPAE NHPVAFRWIL KAKESGAKVI CVDPRFTRSA AKSDLYAQLR SGTDIAFLGG MINHIMQNKL YFEQYVAEYT NASYLVNGDF KLPGELDGLF SGYDPQKRSY DVKSWSFQKE ADDSIRKDPT LQDPNCVFQL LKKHYSRYTP ELVSKTTGTP KEKLLEVYQL YASTGRPDRA GTSLYAMGWT QHTVGTQNIR TMSIIQLLLG NIGVAGGGVN ALRGESNVQG STDQGLLFHI LPGYMPVPSA ELPTTAAYIE KHTPKSKDPQ SANWWGNRNK YLVSYLKAIY GNNATKENDF GYNWLPKLDP GMNGSWLMIF DNMLKGKLKG FYAWGQNPAC SGANSNKVRN ALAKLDWMVA VNLFDNETAS FWKGPGMDPA KVKTEVFFLP AAASFEKEGS ITNSGRWAQW RYQAVKPLGQ SKPDAEIMND LYQAIKGLYA KEGGALPEQL LKLTWNYGFK RADGSIRSID IHQVAKEING YFLQDVSEPV KPLKPGQAPP KPREPWEAKK LLGKKGELVG GFAQLQADGS TSCGSWIFSQ SYNEKGNMMA RRGKKDPTGL GMFPEWSWCW PVNRRIIYNR ASCDPSGKPF NVKKAVVYWN PLAVQPGGKM GAWVGDVPDG PWPPLAAGAE GRKPFIMRAD GVAAIFGPGL KDGPFPEHYE PMESPLAQNL MSKQLNNPAV KIFKDPAVKE DVLASADPRF PLVATTYRVT EHWQTGVMTR NTPWLLELQP RQFCEISQEL AKEKGIANGD LVEVASARGK VEAVAMVTVR MKPMKIGDKV VHQIGLPWCF GWHTPGVGDA ANLLTPTAGD ANTMIPETKA FMAGIARKG
|
| |