Gene Gura_0544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_0544 
Symbol 
ID5163161 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp636557 
End bp638257 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content60% 
IMG OID640548046 
Productnickel-dependent hydrogenase, large subunit 
Protein accessionYP_001229331 
Protein GI148262625 
COG category[C] Energy production and conversion 
COG ID[COG0374] Ni,Fe-hydrogenase I large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCAAGA TCGTTATCGA CCCAATTACC AGGATAGAAG GACATCTCCG TATCGAAGCA 
GAGGTGAATA ACGGAAAGGT TTCCGACGCC TGGAGCAGCA GTACCATGTT CCGCGGCATT
GAAAAAATCC TCAAGACCAG GGACCCACGC GACGCCTGGT TCTTTACCCA GCGGTTCTGC
GGCGTCTGCA CCACGGTGCA TTCCATTGCC TCGATCCGCG CCGTGGAGAA CGCCCTCAAC
ATCAAGATCC CTGCCAATGC CGAGCTGATC AGAAACATCA TCATCGGCAT CCAGAACGTC
CAGGACCACG TGATCCACTT CTACCACCTC CACGCCCTGG ACTGGGTGGA CATCACCTCG
GGGCTCAAGG CCGATCCGGC CAAGACTGCC GCCCTGGCAG CCTCCATTTC CGACTGGCCC
CTAAACTCCG CCACCTACTT CAGAGGGGTG CAGGAGAAGC TGAAGGCCTT CGTCGCCAAG
GGGCGTCTCG GCCCCTTCGC CAACGCCTAC TGGGGGCACC CGGCCTACAG ACTTCCCCCC
GAGGCGAACC TGATGGCCAC CGCCCACTAC CTGGAGGCAC TGGAGTGGCA AAAGGACGTC
ATCAAGATAC ACGCCATCCT GGGAAGCAAG AATCCCCATC CTCAGACTTT CCTGGTAGGC
GGCATGGCGA TCGCCATCGA TCCCGATTCC CAGAACGCCC TCAACGCCGA CAAGCTGATG
GAGATCAAGC GGCTCCTCGC CAAGGCCAGG GAGTTCGTGG AAAAGGTCTA CATCCCGGAT
CTCCTGGCGG TGGCATCCTT CTACAAGGAG TGGGCAGGCA TCGGCGGCGG AGTCGGCAAT
TTCCTCTCTT ACGGCGACTT CCCCCAGCCC GGCAGTGGCC AGCTCTGGAT GCCTTCAGGA
GCCATCATGG GTAAGGACCT GTCCAAAGTC ATCCCTGTCA GCCACGAAAA GGTCACCGAG
TATGTGGACC ACTCCTGGTA TGAATATTCC GGCGGAGCCG GCAAGGGTCT TCATCCATGG
GAAGGGGAGT CGGAGCACAA GTATAGCGGC CCGAAACCTC CCTTCAAGAA CCTGGACACC
GACGGCAAAT ACTCCTGGGT CAAGGCGCCG CGGTATGAGG ACCAGCCGAT GGAAGTCGGT
CCGCTGGCCC GGCTGCTGGT CGCCTACGCG TCAGGACACA AAGAGGTCAA AGGCGCGGTG
GACGGCGTCC TGCATAAACT GGGAGTCGGA CCCGAGGCAC TCTTCTCCAC GTTGGGGCGT
ACCGCGGCAC GTGGCATCGA CTGCCTCCTG ATCGCCCAGG AAACGCCGAA ATGGCTCGAC
CAGCTCATCG ACAATATCGG CAAAGGAGAT TACAAGGTCC ACAACAACGA GAAGTGGGAT
CCCGCCACCT GGCCCGCCGA AGCGGCAGGT TACGGCTGGC ATGAGGCACC GCGCGGAGCG
CTGGGGCACT GGATCAAGAT CAAGGACCAA AAAATACTCA ACTACCAGGC GGTGGTACCC
TCCACCTGGA ACGCCTCCCC CCGTGATGCG AAAGGGCTCA GGGGCCCCTA CGAGGCGGCG
CTGGTCGGCA CTCCGCTGGC AGACCCCAAT AAGCCGCTGG AGATCCTCAG GACCATTCAC
TCATTCGACC CCTGCCTTGC CTGCGCTGTC CATGTTTTCG ATGCAAGCGG CAACGAAATG
GTGAAAGTCA ACGTGCTCTA G
 
Protein sequence
MAKIVIDPIT RIEGHLRIEA EVNNGKVSDA WSSSTMFRGI EKILKTRDPR DAWFFTQRFC 
GVCTTVHSIA SIRAVENALN IKIPANAELI RNIIIGIQNV QDHVIHFYHL HALDWVDITS
GLKADPAKTA ALAASISDWP LNSATYFRGV QEKLKAFVAK GRLGPFANAY WGHPAYRLPP
EANLMATAHY LEALEWQKDV IKIHAILGSK NPHPQTFLVG GMAIAIDPDS QNALNADKLM
EIKRLLAKAR EFVEKVYIPD LLAVASFYKE WAGIGGGVGN FLSYGDFPQP GSGQLWMPSG
AIMGKDLSKV IPVSHEKVTE YVDHSWYEYS GGAGKGLHPW EGESEHKYSG PKPPFKNLDT
DGKYSWVKAP RYEDQPMEVG PLARLLVAYA SGHKEVKGAV DGVLHKLGVG PEALFSTLGR
TAARGIDCLL IAQETPKWLD QLIDNIGKGD YKVHNNEKWD PATWPAEAAG YGWHEAPRGA
LGHWIKIKDQ KILNYQAVVP STWNASPRDA KGLRGPYEAA LVGTPLADPN KPLEILRTIH
SFDPCLACAV HVFDASGNEM VKVNVL