Gene GM21_2249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2249 
Symbol 
ID8137588 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2621278 
End bp2622549 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content66% 
IMG OID644869864 
Productnickel-dependent hydrogenase large subunit 
Protein accessionYP_003022056 
Protein GI253700867 
COG category[C] Energy production and conversion 
COG ID[COG3259] Coenzyme F420-reducing hydrogenase, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones80 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCAGCG TCGTTGAGTT GAACCTGACC CGTGTGGAGG GTCACGGCAG CGTCAAGGTC 
TACCGCGAGG GGTCGCGCGT GGAACGGGTT GAGCTGTGTC TCGCCGATTC TCCGCGCCTC
TTCGAGGCGC TCCTTATCGG GAAAAGCTAT CTGGAAGTTC CGGAGATAGT CTGCCGCATC
TGCTCCCTCT GCTCGACGGT GCATAAGGTG ACCGCGCTTT TGGCCGTCGA GAACGCTTTC
GGCATCGAGG TCTCCGAGAC CACCGCCCTG ACCCGCGAGC TCATAATGCA GGGGGGGATG
ATCCAGGACC ACGCGCTGCA CCTTTACTGC CTGCTCCTTC CCGACCTCCT CGGCGTGCCG
GGGGTGACCG GGCTGGCCCA GAAGGCGCCC GAACTACTGA AGACGGGGCT TGGCATCAAG
AGGGTCGGCA ACATGATCCA GGAGACGGTC GGCGGCCGCC TGATCCACCC GGTCAACATC
CGGCTGGGGG GACTGGGGCA GAGGGTGGGC AAAAAGGAAC TGCTACGCCT GCGCCATGAG
CTTGAGTCGG TCCTTCCCGC CTGCCGCGAC GCGTATCGAT TTTTTCGCAC CCCCTTCCCT
TTCCCGGAAC TCCCCTCCGC GAACGCGCTG GCAGTGGAGC CTCGCGGCGC CGGCCGCCCC
GCCGCCATCC GGTGCCGCAT GGCCGGAGGG GAGTCGTTCG CCGTGTCCGG GTACCGCGAG
GCGGTCAAGG AAAGCGTCCT CCCCCATTCC AACGCCAAAT ATTCGAAGGT GATGGGAAAA
GAAGCCACGG TGGGCGCCCT CGCCAGGCTC GCCCTCGGAG TCCGACTCAG CGCGAAGGCT
CAGGGCGTCT TCGACGGGGT AAAGCACGAG ATCCTCGGCA GGGACATACG TGGCAACAGC
CTGGCCCAGG CGATTGAGCT TTGTGACGCA GCGGAACGCG CGATAGAGCT CATCGACCGG
CTCCTCGACG AAAATCCCGG CGCACCGGGC GACGTCGAGC CGGTTCCGCG CGCGGGGAGC
GGGAGCGCAG CCTGCGAGGC GCCACGCGGC CTGCTGATCC ACAGCTACGG TTTCGACTCG
GACGGCATCT GCACCGGAGC CGATGTCGTC ACCCCCACCG CCTTGAACCA GGGGGCCATG
GCGCGCGACC TGCTGGCGCT GGCGCGGGGA ATGGAGGGGG AAGAGACGAA AAAGATGACC
ACGGCGCTGG AGCGCCTGAT CAGGTGCTAC GACCCCTGCA TCTCGTGTTC GGTGCACATG
CTGAAGCTCT GA
 
Protein sequence
MGSVVELNLT RVEGHGSVKV YREGSRVERV ELCLADSPRL FEALLIGKSY LEVPEIVCRI 
CSLCSTVHKV TALLAVENAF GIEVSETTAL TRELIMQGGM IQDHALHLYC LLLPDLLGVP
GVTGLAQKAP ELLKTGLGIK RVGNMIQETV GGRLIHPVNI RLGGLGQRVG KKELLRLRHE
LESVLPACRD AYRFFRTPFP FPELPSANAL AVEPRGAGRP AAIRCRMAGG ESFAVSGYRE
AVKESVLPHS NAKYSKVMGK EATVGALARL ALGVRLSAKA QGVFDGVKHE ILGRDIRGNS
LAQAIELCDA AERAIELIDR LLDENPGAPG DVEPVPRAGS GSAACEAPRG LLIHSYGFDS
DGICTGADVV TPTALNQGAM ARDLLALARG MEGEETKKMT TALERLIRCY DPCISCSVHM
LKL