Gene GSU0785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0785 
Symbol 
ID2687322 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp845690 
End bp847372 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content64% 
IMG OID637125457 
Productnickel-dependent hydrogenase, large subunit 
Protein accessionNP_951842 
Protein GI39995891 
COG category[C] Energy production and conversion 
COG ID[COG0374] Ni,Fe-hydrogenase I large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.433799 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTAAAC GTATCACCAT AGACCCCATC ACCCGGATCG AGGGTCACCT GAGAATCGAT 
GTGGAAGTGA ATGGCGGCCA GGTTGCCAAG GCCTGGTCCT CGGCCCAGAT GTGGCGGGGC
ATCGAGACCA TCCTCAAGGG GCGCGATCCC CAGGACGCCT GGTCCTACGC CCAGCGTTTC
TGCGGCGTCT GCACTACGGT GCACGCCATC TCGTCCATCC GTTCCGTGGA GAACGCCCTG
AACGTGGAAG TTCCCCTCAA CGCCCAGTAC ATCCGCAATA TCATGATCGC CCAGCACTCG
GTGCAGGATC ACATCGTCCA CTTCTACCAC CTGTCCGCCC TGGACTGGGT CGACATCGTG
TCCGCCCTGA AGGCCGACCC GAAGAAGGCC TCCTCCATCG CCCAGAGCCT GTCCGACTGG
CCCGGCAACA GCGAGAAGGA GTTCAAGGCG GTCCAGGACA AGCTCAAGGC GTTCGTGGCC
AGCGGCCAGC TCGGCATCTT CGCCTCCGGC TACTGGGGCC ATCCGGCCAT GAAGCTGCCG
CCCGAGGTGA ACCTGATCGC CGTGGCCCAC TACCTCAAGG CTCTCGACTA CCAGCGCCGG
GCTTCACAGG CTGTGGCCAT TCTCGGCGGC AAGAACCCCC ACGTTCAGAA CCTGGTGGTG
GGCGGGGTAG CCACCGCTGT GAACATGGAG AACATCGCCA CCCTCAACAT GGAGCGGATC
GCCTTCCTCC GCACCCTTAT GGAAGAGACC CGCGAGTTCG TCCAGAAGGT CTACTACCCG
GACCTGGTGG CCATCGCTTC CTTCTACAAG GAATGGTTCA AGTACGGCGC CGGCGTCACC
AACTACCTGG CGGTCCCCGA GTTCGCCGAA GACACCCGCA ACACCAAGTT CGGCCTCCCC
GGCGGCACCA TCTACGGCGG CGATCTGGGT ACCTTCAAGG CGATCACCAC CCACCAGGAT
GCGGCGCTCA TTCAGGGGGT CACCGAAGGG GTGGCTCACG CCTGGTACGA AGGCTCCGAT
TCGCTCCACC CCTGGGAAGG GGAGACCAAG CCCCAGTACA CCGACTTCCA GGAGAACGGC
AAGTACACCT GGTGCAAGTC GCCGCGCTAC AACGGCAAGC CCATGCAGGT TGGACCGCCC
GCCCAGGTCA TGGCCGCCTA CGCCACCGGC CATCCCAAGG TCAAGAAGCT GGTGGACGAC
GCCGCAGCCA AGCTCGGCAT CGGTCCCAAG GAGCTCCACT CCACCATGGG CCGGCTCTTC
TGCCGGGGCG TGCGCGCCCA CGTCATGGCC GACTACTCCC TGGAGTACCT GGACAAGCTC
GTGGCCAACA TCGGTAAGGG TGACTCCACC TACGCCAACC ACACGGAAAT CCCCGACGGC
GAGTACAAGG GGGTCGGCTT CCACGAGGCG CCCCGTGGTG CCCTGTCCCA CTGGATAGTC
ATCGAGAAGA AGAAGATCAA GAACTACCAG GCCGTGGTTC CGTCCACCTG GAACGCCTCG
CCGCGGGACG AGAACGGCGT TGCCGGTCCC TACGAAGCCT GCCTCGTGGG CAACCCCGTG
GCTCAGCCGG ACAAGCCGCT GGAAGTGCTG CGCACCATCC ACTCCTTCGA CCCCTGCATT
GCGTGCGCCG TCCACACCAT CGACCCGGCC GGCAAGGAGA TCACCAAGGT CAAGGTCCTG
TAA
 
Protein sequence
MSKRITIDPI TRIEGHLRID VEVNGGQVAK AWSSAQMWRG IETILKGRDP QDAWSYAQRF 
CGVCTTVHAI SSIRSVENAL NVEVPLNAQY IRNIMIAQHS VQDHIVHFYH LSALDWVDIV
SALKADPKKA SSIAQSLSDW PGNSEKEFKA VQDKLKAFVA SGQLGIFASG YWGHPAMKLP
PEVNLIAVAH YLKALDYQRR ASQAVAILGG KNPHVQNLVV GGVATAVNME NIATLNMERI
AFLRTLMEET REFVQKVYYP DLVAIASFYK EWFKYGAGVT NYLAVPEFAE DTRNTKFGLP
GGTIYGGDLG TFKAITTHQD AALIQGVTEG VAHAWYEGSD SLHPWEGETK PQYTDFQENG
KYTWCKSPRY NGKPMQVGPP AQVMAAYATG HPKVKKLVDD AAAKLGIGPK ELHSTMGRLF
CRGVRAHVMA DYSLEYLDKL VANIGKGDST YANHTEIPDG EYKGVGFHEA PRGALSHWIV
IEKKKIKNYQ AVVPSTWNAS PRDENGVAGP YEACLVGNPV AQPDKPLEVL RTIHSFDPCI
ACAVHTIDPA GKEITKVKVL