Gene Gura_1943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_1943 
Symbol 
ID5164615 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp2251128 
End bp2252264 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content54% 
IMG OID640549437 
Producthydrogenase (NiFe) small subunit HydA 
Protein accessionYP_001230706 
Protein GI148264000 
COG category[C] Energy production and conversion 
COG ID[COG1740] Ni,Fe-hydrogenase I small subunit 
TIGRFAM ID[TIGR00391] hydrogenase (NiFe) small subunit (hydA)
[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.162149 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGGAG GTGTTATGAA GAAAGAAGAA GATTTTTTGT GTGCAGGGGT ATCCCGAAGA 
AGTTTTATGA AAACCTGCAT AACTGCCACG GCCATGATGG GGCTGCCGTT CAGCATGCAT
ACCAAGGTTG CCGAAGCGAT GGAGAAAAAC GGCAACCCTT CGGTAATCTG GCTGCATTTC
CAGGAATGTA CCGGTTGTTC AGAGTCGCTC CTCAGGTCTA CCCATCCGAC AATTTCGACT
CTGATCCTGG ATATGATATC CCTCGACTAT CACGAGACGT TGATGGCCGG ATCAGGCGCC
CAGGCTGAAA AGTCGCTGCA CGATTCGATG CTCGCCAACA AAGGCAAGTA CTTGCTGGTT
GTCGAAGGAG CGATTCCGAC CAAGGAGAGC GGCATTTATT GCAAGGTCGG CGGCAAGACT
GCTCTCGAAT CCTTGCAGCA TGCGGCTTCA AATGCGGCTG CCATCATCTC CATCGGCACC
TGCGCATCTT ACGGCGGAAT CCAGTCTGTC GGCCCGAATC CCACCGGCGC CGTAGGGGTG
CGGGATATCG TCAAGGACAA GCCGATCATC AACATTCCCG GCTGCCCTCC CAGTCCCTAT
AATCTGCTTT CCACCGTGAT GTATTACCTG ACGTTCAAAA AAATACCCGA GCTGGATGCA
CTCGGACGGC CGAAATTCGC TTACGGCAGA AAGATCCACG AGCATTGCGA GCGGCGGCCC
CATTTCGATG CCGGCCGGTT TGCCAAGGCG TACGGTGATG ATACCCATGC CCAGGGATAC
TGCTTGTTCA AGCTCGGCTG CAAGGGACCT GCAACCTATG CCAACTGTTC CGTACAGCGC
TTTAATGAAG TTGGCGTCTG GCCGGTATCT GTCGGCCATC CCTGTATCGG CTGTACCGAG
CCGGATGTGC TCTTTAAAAT GGCGATTGCC GACAAGGTGC AGATACACGA ACCTACTCCG
TTTGACAGTT ATGCACCGGT AGATTTGAAG GAAAAAGGTA AGGGTCCGGA ACCGTTGACC
ACGGGCTTTG TCGGACTTGC TGCGGGCGCT GCCCTTGGGG CCGGAGCAAT GCTGGCCAAA
AAGCTGCCGA AAGATGATGG CCACAAGGAG GACGACCACC ATGAAGAACA AGAGTAG
 
Protein sequence
MSGGVMKKEE DFLCAGVSRR SFMKTCITAT AMMGLPFSMH TKVAEAMEKN GNPSVIWLHF 
QECTGCSESL LRSTHPTIST LILDMISLDY HETLMAGSGA QAEKSLHDSM LANKGKYLLV
VEGAIPTKES GIYCKVGGKT ALESLQHAAS NAAAIISIGT CASYGGIQSV GPNPTGAVGV
RDIVKDKPII NIPGCPPSPY NLLSTVMYYL TFKKIPELDA LGRPKFAYGR KIHEHCERRP
HFDAGRFAKA YGDDTHAQGY CLFKLGCKGP ATYANCSVQR FNEVGVWPVS VGHPCIGCTE
PDVLFKMAIA DKVQIHEPTP FDSYAPVDLK EKGKGPEPLT TGFVGLAAGA ALGAGAMLAK
KLPKDDGHKE DDHHEEQE