Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2249 |
Symbol | |
ID | 8137588 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 2621278 |
End bp | 2622549 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644869864 |
Product | nickel-dependent hydrogenase large subunit |
Protein accession | YP_003022056 |
Protein GI | 253700867 |
COG category | [C] Energy production and conversion |
COG ID | [COG3259] Coenzyme F420-reducing hydrogenase, alpha subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 80 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCAGCG TCGTTGAGTT GAACCTGACC CGTGTGGAGG GTCACGGCAG CGTCAAGGTC TACCGCGAGG GGTCGCGCGT GGAACGGGTT GAGCTGTGTC TCGCCGATTC TCCGCGCCTC TTCGAGGCGC TCCTTATCGG GAAAAGCTAT CTGGAAGTTC CGGAGATAGT CTGCCGCATC TGCTCCCTCT GCTCGACGGT GCATAAGGTG ACCGCGCTTT TGGCCGTCGA GAACGCTTTC GGCATCGAGG TCTCCGAGAC CACCGCCCTG ACCCGCGAGC TCATAATGCA GGGGGGGATG ATCCAGGACC ACGCGCTGCA CCTTTACTGC CTGCTCCTTC CCGACCTCCT CGGCGTGCCG GGGGTGACCG GGCTGGCCCA GAAGGCGCCC GAACTACTGA AGACGGGGCT TGGCATCAAG AGGGTCGGCA ACATGATCCA GGAGACGGTC GGCGGCCGCC TGATCCACCC GGTCAACATC CGGCTGGGGG GACTGGGGCA GAGGGTGGGC AAAAAGGAAC TGCTACGCCT GCGCCATGAG CTTGAGTCGG TCCTTCCCGC CTGCCGCGAC GCGTATCGAT TTTTTCGCAC CCCCTTCCCT TTCCCGGAAC TCCCCTCCGC GAACGCGCTG GCAGTGGAGC CTCGCGGCGC CGGCCGCCCC GCCGCCATCC GGTGCCGCAT GGCCGGAGGG GAGTCGTTCG CCGTGTCCGG GTACCGCGAG GCGGTCAAGG AAAGCGTCCT CCCCCATTCC AACGCCAAAT ATTCGAAGGT GATGGGAAAA GAAGCCACGG TGGGCGCCCT CGCCAGGCTC GCCCTCGGAG TCCGACTCAG CGCGAAGGCT CAGGGCGTCT TCGACGGGGT AAAGCACGAG ATCCTCGGCA GGGACATACG TGGCAACAGC CTGGCCCAGG CGATTGAGCT TTGTGACGCA GCGGAACGCG CGATAGAGCT CATCGACCGG CTCCTCGACG AAAATCCCGG CGCACCGGGC GACGTCGAGC CGGTTCCGCG CGCGGGGAGC GGGAGCGCAG CCTGCGAGGC GCCACGCGGC CTGCTGATCC ACAGCTACGG TTTCGACTCG GACGGCATCT GCACCGGAGC CGATGTCGTC ACCCCCACCG CCTTGAACCA GGGGGCCATG GCGCGCGACC TGCTGGCGCT GGCGCGGGGA ATGGAGGGGG AAGAGACGAA AAAGATGACC ACGGCGCTGG AGCGCCTGAT CAGGTGCTAC GACCCCTGCA TCTCGTGTTC GGTGCACATG CTGAAGCTCT GA
|
Protein sequence | MGSVVELNLT RVEGHGSVKV YREGSRVERV ELCLADSPRL FEALLIGKSY LEVPEIVCRI CSLCSTVHKV TALLAVENAF GIEVSETTAL TRELIMQGGM IQDHALHLYC LLLPDLLGVP GVTGLAQKAP ELLKTGLGIK RVGNMIQETV GGRLIHPVNI RLGGLGQRVG KKELLRLRHE LESVLPACRD AYRFFRTPFP FPELPSANAL AVEPRGAGRP AAIRCRMAGG ESFAVSGYRE AVKESVLPHS NAKYSKVMGK EATVGALARL ALGVRLSAKA QGVFDGVKHE ILGRDIRGNS LAQAIELCDA AERAIELIDR LLDENPGAPG DVEPVPRAGS GSAACEAPRG LLIHSYGFDS DGICTGADVV TPTALNQGAM ARDLLALARG MEGEETKKMT TALERLIRCY DPCISCSVHM LKL
|
| |