Gene GM21_3972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3972 
Symbol 
ID8139346 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4555684 
End bp4557105 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content67% 
IMG OID644871588 
Productnickel-dependent hydrogenase large subunit 
Protein accessionYP_003023746 
Protein GI253702557 
COG category[C] Energy production and conversion 
COG ID[COG3259] Coenzyme F420-reducing hydrogenase, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.00000529156 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCGATGA AGCGAACTTT GAAGATTGAC CCAGTGACCC GGATCGAGGG GCACGCCAAG 
GTGTTCATCA ACCTGGACGA GGCCGGGGCG CTGGAAAGCG CGGGGCTCGT GGTGAACGAG
CTGCGCGGCT TCGAGAAGAT CCTGATCGGG ATGGAGGCGG ACCGGATGCC GCACGTGACG
GCGCGCATCT GCGGGGTCTG CCCCACCGCG CACCATATCG CCGCCTGCAA CGCACTGGAC
CACGCGGCAG GGGTCACGCC CCCCCCCGCC GCGCTTCTTC TGCGGGAGCT GATGTATCTC
GGGCACATCA TCCACTCGCA CTCGCTTTCC ATCTTCGTGC TGCAGGGGCC GGACCTGGTG
CTGGGACTCG ACGCGGACCC GGCGATCCGG AACATCGTGG GGATCGTGCA GGCGAACCCG
GAGCTCGCCA AACTCGCCCT GCGCCTGAGG AGCATCGGCC AGAAGATCAA CGAGATGGTG
GGGGGGAGGG GGACGCACCC GGTGACCTCG GTGGCCGGCG GCATCGCCTT CGTGCTCGAC
AAGGAGAAGC TAAAGGCGCT CAAGGAGTGG GTGGACGAGG CGCGGGGGGT GCTGCCGCAG
GTGGTCCCGG CCGTCAAGGG GCTCCTGATG CGGGCCCTGG AAGCGCACCC GGAGATGGGA
GAAAAATGGA TCGTCCCGAG TTTCGGGATG GGTACCGTGC AGGATGGAGC GGTCTCCCTG
ATCTCGGGGG AGCTTCGCGT CATCGACGAC ACCGGCGCCA CCGTTTTGGA GTTCGGGATC
GAGGAGTACG ACCGGTACCT GCGCGAGTCT GTCGTCGAGT GGTCCTACAT GAAGAAGGTG
CAGGTAGAGC TGGACGGCGA GCTGCACGAC TACCGGGTGG GGCCCATGGC GCGGATGAAC
GTGGCGCGCC GTTTCGGGAC CGAAATGGCC GACGCCGAGT ACGCCGAGTT CGCCAGGTTG
GGGGGAGCCC CCTGCCACAC CACCGTGTTC CAGACCTACG CCAAGCTGAT CGAGATCGTC
TGGGCCATCG AGCGGGCGGG GGAGATCCTG CGCGACAAGG CGATCCGCGG GGAGACCCGG
GTCCCGGTCC GCTTCCAGGG GGGGAGGGGG GTGGGGCACG TCGAGGCGCC GCGCGGCACG
CTGATCCACG ACTACCAGAT CGACGAGCGC GGGATCGTGC GGGCGGCGAA CCTGATCGTC
GCCACCCAGC AGAACTACTC GCTCATCAAC CGCTCCATCG AGCAGTCCGC CCAGTCCCAC
GTGATCGACC GCCCCGACGA CCGGGCGCTG ATGAACGCCG TCGAGTTCAG CATCCGCTGC
TACGACCCCT GCCTCTCCTG CGCCACCCAC GCTCTCGGGC GGATGCCGCT GGAGGTAGCG
GTCAGGCGGG GCGCGGAGAC GGTCAAGACC CTTTGGAGGT AA
 
Protein sequence
MPMKRTLKID PVTRIEGHAK VFINLDEAGA LESAGLVVNE LRGFEKILIG MEADRMPHVT 
ARICGVCPTA HHIAACNALD HAAGVTPPPA ALLLRELMYL GHIIHSHSLS IFVLQGPDLV
LGLDADPAIR NIVGIVQANP ELAKLALRLR SIGQKINEMV GGRGTHPVTS VAGGIAFVLD
KEKLKALKEW VDEARGVLPQ VVPAVKGLLM RALEAHPEMG EKWIVPSFGM GTVQDGAVSL
ISGELRVIDD TGATVLEFGI EEYDRYLRES VVEWSYMKKV QVELDGELHD YRVGPMARMN
VARRFGTEMA DAEYAEFARL GGAPCHTTVF QTYAKLIEIV WAIERAGEIL RDKAIRGETR
VPVRFQGGRG VGHVEAPRGT LIHDYQIDER GIVRAANLIV ATQQNYSLIN RSIEQSAQSH
VIDRPDDRAL MNAVEFSIRC YDPCLSCATH ALGRMPLEVA VRRGAETVKT LWR