Gene SeD_A3498 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3498 
SymbolhybO 
ID6875318 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3359695 
End bp3360813 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content54% 
IMG OID642786490 
Producthydrogenase 2 small subunit 
Protein accessionYP_002217127 
Protein GI198244463 
COG category[C] Energy production and conversion 
COG ID[COG1740] Ni,Fe-hydrogenase I small subunit 
TIGRFAM ID[TIGR00391] hydrogenase (NiFe) small subunit (hydA)
[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.271513 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones71 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGGAG ATAATACTCT CATCACTTCT CACGGCATTA ACCGTCGTGA TTTCATGAAG 
CTTTGTGCAG CACTGGCCGC TACTATGGGG CTCAGTAGCA AAGCCGCCGC AGAAATGGCA
GAATCGGTAT CCAATCCACA GCGTCCGCCC GTTATCTGGA TTGGCGCTCA GGAGTGTACC
GGTTGTACCG AATCACTGCT TCGTGCTACA CACCCAACCG TTGAAAACCT CGTTCTGGAG
ACTATCTCTC TGGAATACCA CGAGGTACTT TCCGCCGCAT TCGGTCACCA GGTCGAAGAA
AACAAACATA ACGCTCTGGA GAAGTATAAA GGGCAATATG TTCTGGTGGT GGATGGTTCT
ATCCCACTAA AAGATAACGG TATCTACTGC ATGGTTGCCG GCGAGCCGAT CGTGGATCAC
ATCCGCAAAG CCGCTGACGG CGCAGCCGCG ATTATCGCTA TCGGTTCCTG CTCGGCATGG
GGCGGCGTTG CTGCGGCTGG CGTAAACCCA ACCGGCGCTG TCAGTCTGCA GGAAGTCTTA
CCGGGCAAAA CGGTTATCAA TATTCCAGGT TGTCCGCCAA ACCCGCATAA CTTCCTGGCG
ACCGCCGCGC ATATCATCAC TTACGGCACG CCGCCGAAGC TGGATGCGAA AAATCGTCCA
ACCTTTGCCT ATGGCCGTCT GATTCATGAG CATTGCGAAC GTCGTCCACA CTTCGACGCA
GGCCGTTTTG CCAAAGAATT TGGCGACGAA GGCCACCGTC AGGGCTGGTG TCTCTACCAT
CTTGGCTGTA AAGGGCCGGA AACCTGGGGC AACTGTTCTA CGTTACAGTT CTGTGACGTT
GGCGGCGTCT GGCCAGTGGC GATCGGTCAT CCTTGCTATG GCTGTAACGA AGAAGGTATC
GGCTTCCATA AGGGCATTCA CCAGCTTGCT CATGTCGAAA ACCAAACTCC GCGTTCAGAG
AAACCTGACG TCAATATGAA AGAAGGCGGC AATATCTCTG CGGGCGCTGT CGGTCTGCTT
GGCGGCGTAG TCGGTCTGGT TGCCGGCGTC AGCGTGATGG CGGTACGTGA ACTGGGGCGT
CAGCAAAAGA AAGATAACGC TGACTCACGG GGAGAATAA
 
Protein sequence
MTGDNTLITS HGINRRDFMK LCAALAATMG LSSKAAAEMA ESVSNPQRPP VIWIGAQECT 
GCTESLLRAT HPTVENLVLE TISLEYHEVL SAAFGHQVEE NKHNALEKYK GQYVLVVDGS
IPLKDNGIYC MVAGEPIVDH IRKAADGAAA IIAIGSCSAW GGVAAAGVNP TGAVSLQEVL
PGKTVINIPG CPPNPHNFLA TAAHIITYGT PPKLDAKNRP TFAYGRLIHE HCERRPHFDA
GRFAKEFGDE GHRQGWCLYH LGCKGPETWG NCSTLQFCDV GGVWPVAIGH PCYGCNEEGI
GFHKGIHQLA HVENQTPRSE KPDVNMKEGG NISAGAVGLL GGVVGLVAGV SVMAVRELGR
QQKKDNADSR GE