Gene Gura_1122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_1122 
Symbol 
ID5166337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp1324027 
End bp1325304 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content58% 
IMG OID640548627 
Productnickel-dependent hydrogenase, large subunit 
Protein accessionYP_001229900 
Protein GI148263194 
COG category[C] Energy production and conversion 
COG ID[COG3259] Coenzyme F420-reducing hydrogenase, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAACG TTATTGAGAT AAAGCCCCTC ACCCGGGTGG AGGGACACGG CATCGCCCGG 
GTTTACATGG ACGGCAAGCG GGCCGAACGG GTCGAACTTG CGCTGACTGA GCCGCCGCGG
CTGTTTGAGG CGCTCCTCTT GGGGAAAAGC TTCGAGGAGG TGCCGGAGAT CATCTGCCGC
ATCTGCTCCC TCTGCTCCAC AATCCATCGG GTAACGTCCC TGCTGGCCGT TGAAAATGCC
CTGGGGATCG ATGTCTCCGA GGAGACCCGC CTCTACCGGG AACTGATCGT TAACGGCGGC
CATATCCAGA GCCACGCCCT CCATCTTTTC TGCCTTGCCC TGCCCGATTA CTATAATGCC
GCAAGCTTTG CCGACCTGGC AACACATGCG CCGGAACTGT TAAAACTGGG GCTGCGGATA
AAAGGAGTTG GCAATTTGAT ACAGGAAACC GTAGGGGGGC GGCTTATTCA CCCGGTGAAC
ATCGTCCCCG GCGGCATGGG AAAACGGGTC AACCACGCGG GATTACGGGA ACTACAAGTG
GCACTCGAAA CCATCCTCCC CGACACCATC GAAGCTTGCA ATCTGTTCGC TTCATTCGCC
GTTCCCGGCC CCCCTCTGCC GCGTTCAATA TTCATGGCCG TGCAAGGGGA AACATCATCT
CCGCTGTTCG GCGACAGGCT GGATTTAAGC AACGGTCGAT CCTTCATGGC CTCGAACTAC
CGGGAAGCCT TACCGGAAAA GGTCATCGGC CATTCCTATG CCAAGCAAAG CCGCTTCGAA
GGAGAAGCCG TCATCGTCGG CGCACTAGCC CGGCTCAATG TGGGAATGGG GCTGACAACC
ATGGCTAATC AGGCTTTTCT TGATGCGCGG GAAAAACTGA TTGACACGGA CATCCGGGGC
AACAATCTGG CCCAGGCCAT AGAGCTGATT CTGGCCGTAG AATGCTCCCT GGAGATCATA
GCCACCCTTT TGAGCTTCGA GACAAAACCT GACAGACCGG TTTCTATCGC GCCGCGAAAA
GGGAGCGGCA GCGCCGCCAC CGAGGCGCCA AGGGGAGTTC TCATCCACAG CTACAGCTTC
GACAACAGGG GGTTCTGCAC GGCCGCAGAC ATCATCACCC CCACCGCCAT CAACCAGGCA
GCCATGGAAC GTGATCTGCT GGCCCTTGCC AGGGAAATGG AAGGTGCAGA CGAGGCTGAG
TTGAAACTGA AGCTCGAAAT GCTGGTACGC GCCTACGATC CCTGCATCTC CTGCGCAGTC
CACCTGGTCA GGCTCTAG
 
Protein sequence
MSNVIEIKPL TRVEGHGIAR VYMDGKRAER VELALTEPPR LFEALLLGKS FEEVPEIICR 
ICSLCSTIHR VTSLLAVENA LGIDVSEETR LYRELIVNGG HIQSHALHLF CLALPDYYNA
ASFADLATHA PELLKLGLRI KGVGNLIQET VGGRLIHPVN IVPGGMGKRV NHAGLRELQV
ALETILPDTI EACNLFASFA VPGPPLPRSI FMAVQGETSS PLFGDRLDLS NGRSFMASNY
REALPEKVIG HSYAKQSRFE GEAVIVGALA RLNVGMGLTT MANQAFLDAR EKLIDTDIRG
NNLAQAIELI LAVECSLEII ATLLSFETKP DRPVSIAPRK GSGSAATEAP RGVLIHSYSF
DNRGFCTAAD IITPTAINQA AMERDLLALA REMEGADEAE LKLKLEMLVR AYDPCISCAV
HLVRL