Gene Gmet_3332 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGmet_3332 
Symbol 
ID3740725 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter metallireducens GS-15 
KingdomBacteria 
Replicon accessionNC_007517 
Strand
Start bp3746145 
End bp3747284 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content67% 
IMG OID637780622 
ProductNi-Fe hydrogenase, small subunit:twin-arginine translocation pathway signal 
Protein accessionYP_386270 
Protein GI78224523 
COG category[C] Energy production and conversion 
COG ID[COG1740] Ni,Fe-hydrogenase I small subunit 
TIGRFAM ID[TIGR00391] hydrogenase (NiFe) small subunit (hydA)
[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones57 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAAGA ATGGTGATGA GCATGACGAG ATGAAGCAGC ATTGTTCCGG CACGTCGGGG 
CCCTGGGAGG AGCGGGGGGT ATCCCGGCGC GACTTCCTGA AGTTCTGCAC CGCCATGTCG
GCGGCCTTGG CCCTGCCGGT CTCCCTGGCG CCACGCATTG CCGAGGCGCT GGAGAGCGAC
AGTCGGCCGT CGGTGATCTG GCTCGAATTC CAGAGTTGCA CCGGCGACAC CGAGGCGCTC
CTGCGGGCCG CTAACCCCAC GGTGGGTGAG ATCGTCCTCG ACGTCCTCTC CATTGATTAT
GCCGAGACCA TCATGGCCGC CGCCGGCCAC CAGGCCGAGG AGGCGCGGCT GAAGACCCTG
AAAGAGCGAA GTGGCAAGTA CATCGCCGTC GTCGAGGGGG CGATCCCCAT GAAGGACAAC
GGCGTCTACT GCTGCGTCGG CGGGAGATCG GCCGTGGATA TCGCCCGGGA GGTCTGCGGC
GGCGCCATGG CCACCATCAC CGTCGGCACC TGCGCCTCCT ACGGCGGCAT CCCGGCTGCA
TCCCCCAACC CCACCGGCGC CGTGGGGGTC AAGGACGCGG TCCCCGGCGC CACGGTCATC
AACCTTCCCG GCTGCCCCGT CAACACCGAC AATCTGGTGG CCACCGTGGT CCATATCCTC
ACCTTCGGCA AGCTCCCGGC CACCGACAGC AAGGGACGCC CCCTTTTCGC CTACGGCAAG
CGGATTCACG ACAACTGCGA ACGCCGCCCC CACTTCGACG CCGGCCAGTA CGTGGAGCAA
TGGGGCGACC AGGCCCACCG TGCCGGCCAC TGCCTCTACA AGATGGGGTG CAAGGGGCCC
GAAACCTTCC ACAACTGCCC GACCCAGCGC TACAACGAGA AGACGAGCTG GCCAGTGGGA
TCAGGCCACG GCTGTGCCGG CTGCTCCGAG CCCCACTTCT GGGACACCAT GACCCCCTTC
TACCGGCGGC TTCCCAGCGT TCCCGGTTTC GGGATCGAGG CCACGGCCGA CAAGATCGGC
CTTGGCGTCG CTGCGGCCAC GGCGGCGGTC TTCGGCATCC ACGGCGTGGT GAGCGCGCTG
CGCAAGGGAG ATGAATCCGA CGGGGAAGGA GGGGTAGACC ATGGCCAGGA TCGTCGTTGA
 
Protein sequence
MAKNGDEHDE MKQHCSGTSG PWEERGVSRR DFLKFCTAMS AALALPVSLA PRIAEALESD 
SRPSVIWLEF QSCTGDTEAL LRAANPTVGE IVLDVLSIDY AETIMAAAGH QAEEARLKTL
KERSGKYIAV VEGAIPMKDN GVYCCVGGRS AVDIAREVCG GAMATITVGT CASYGGIPAA
SPNPTGAVGV KDAVPGATVI NLPGCPVNTD NLVATVVHIL TFGKLPATDS KGRPLFAYGK
RIHDNCERRP HFDAGQYVEQ WGDQAHRAGH CLYKMGCKGP ETFHNCPTQR YNEKTSWPVG
SGHGCAGCSE PHFWDTMTPF YRRLPSVPGF GIEATADKIG LGVAAATAAV FGIHGVVSAL
RKGDESDGEG GVDHGQDRR