Gene GM21_1126 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1126 
Symbol 
ID8136448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1318281 
End bp1319960 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content63% 
IMG OID644868737 
ProductCytochrome-c3 hydrogenase 
Protein accessionYP_003020945 
Protein GI253699756 
COG category[C] Energy production and conversion 
COG ID[COG0374] Ni,Fe-hydrogenase I large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones150 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTAGAA TCACGCTTGA TCCCATCACC CGCATCGAAG GGCACCTCAG GATAGACGTC 
GAAGCCAACG GCGGCAAGGT AACCAACGCC TGGTCCTCGG CCCAAATGTG GCGCGGCATA
GAGGTCATCC TCAAGGGGCG CGCCCCCGAG GACGCCTGGA GCTTCGTGCA ACGCTTTTGC
GGCGTCTGCA CCACGGTGCA CGCCATCTCC TCGATCCGCG CCGTCGAGCA CGCCCTGAAC
GTCGAGGTGC CGTTAAACGC CCAGTACATC CGCAACATCA TGATCGCCCA GCACTCGGTG
CAGGACCACA TCGTCCACTT CTACCACCTC TCGGCGCTGG ACTGGGTCGA CATCGTCTCG
GCTCTCTCCG CAGACCCCAA GAAGACGGCC CAGATCGCCC AGAGCATCTC CGACTGGCCG
GGGAATTCCG AGAAGGAGTT CAAGGCGGTC CAGGACAAGC TGAAGGCGTT CGTCGCCGGC
GGCAAGCTAG GCATCTTCGC CTCCGGCTAC TGGGGGCATC CGGCCATGAA GCTCTCCCCC
GAGGTGAACC TGATGGCGGT GGCCCACTAC CTGAAGGCGC TCGACTACCA GAGAAAGGCG
TCCCAGGCGA CCGCCATCCT GGGGGGGAAG AACCCGCACA TCCAGAACCT GGTGGTGGGG
GGCGTCGCCA CCGCGATCAA CACCGAGAAC ATCGCCACCC TCAACATGGA GCGCCTCATC
TACTTGAGGA CCCTGATGGA GGAGACCCGA GACTTCGTGC AGAAGGTGTA CTACCCGGAC
CTCCTGGCCA TCGCCGGCGC CTACAAGGAG TGGTTCAAGT ACGGGGCGGG GGTCACCAAC
TACATGGCGG TCCCGGAGTT CCCGGAGGAC ACCAAGAACG GCAAGTTCTC GCTCCCGGGC
GGGGTGGTCA TGGGCGGGAA CATCGCCGGG ATCAGGCCGG TCAAGGACCA CCGCGACGAG
TACCTGATCA AGAACATCAA GGAGAACGTC ACCCACGCCT GGTACGAGGG GAACGGCACG
CTGCACCCCT GGGAAGGGGA GACCAAGCCC GACTACACCG ACTTCAAGGA AAACGGCAAG
TACTCCTGGT GCAAGGCGCC GACCCTGGAC GGAAAGCCGG TGCAGGTAGG TCCTCCGGCC
CAGATCATGG CCGCCTACGC CGCCGGAAAC CCGAAGGTGA AAAAGCTCGT CGACGGAGCC
GTCTCCACCC TCGGCCTTTC CATGAAGGAC ATCCACTCCA CCATGGGGCG CCTGTACTGC
CGCGGCGCGC GCGCCCACAT CATGGCCGAC ATCGCGCTGG AAAACCTGGA CAAGCTGATC
GCCAACATCG CCACCGGCGA TCAAACTTAC GCGAACCACA CCGAGATCCC CAAAGGGGAG
TACCGCGGGG TGGGCTTCCA CGAGGCGCCG CGCGGCACCT TGAGCCACTG GATCGTCATC
AAGGACAAGA AGATCGAGAA CTATCAGGCC GTGGTCCCCT CCACCTGGAA CGCCGCTCCC
AGAAACGACA AGGGAGAGCT CGGCCCCTAC GAGGCGTCCT TGGTGGGGAA CCCCATCGCG
GACAGCAGCA AGCCGCTGGA GGTCTTGAGG ACCATCCACT CCTTCGACCC CTGCATTGCC
TGCGCCGTGC ACACGATCGA CCCCGAAGGG AAGGAAATCA CCAAGGTGAA GGTGCTGTAA
 
Protein sequence
MSRITLDPIT RIEGHLRIDV EANGGKVTNA WSSAQMWRGI EVILKGRAPE DAWSFVQRFC 
GVCTTVHAIS SIRAVEHALN VEVPLNAQYI RNIMIAQHSV QDHIVHFYHL SALDWVDIVS
ALSADPKKTA QIAQSISDWP GNSEKEFKAV QDKLKAFVAG GKLGIFASGY WGHPAMKLSP
EVNLMAVAHY LKALDYQRKA SQATAILGGK NPHIQNLVVG GVATAINTEN IATLNMERLI
YLRTLMEETR DFVQKVYYPD LLAIAGAYKE WFKYGAGVTN YMAVPEFPED TKNGKFSLPG
GVVMGGNIAG IRPVKDHRDE YLIKNIKENV THAWYEGNGT LHPWEGETKP DYTDFKENGK
YSWCKAPTLD GKPVQVGPPA QIMAAYAAGN PKVKKLVDGA VSTLGLSMKD IHSTMGRLYC
RGARAHIMAD IALENLDKLI ANIATGDQTY ANHTEIPKGE YRGVGFHEAP RGTLSHWIVI
KDKKIENYQA VVPSTWNAAP RNDKGELGPY EASLVGNPIA DSSKPLEVLR TIHSFDPCIA
CAVHTIDPEG KEITKVKVL