Gene SeHA_C3397 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C3397 
Symbolhyb0 
ID6488984 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3305011 
End bp3306129 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content54% 
IMG OID642743530 
Producthydrogenase 2 small subunit 
Protein accessionYP_002047145 
Protein GI194450897 
COG category[C] Energy production and conversion 
COG ID[COG1740] Ni,Fe-hydrogenase I small subunit 
TIGRFAM ID[TIGR00391] hydrogenase (NiFe) small subunit (hydA)
[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones75 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGGAG ATAATACTCT CATCACTTCT CACGGCATTA ACCGTCGTGA TTTCATGAAG 
CTTTGTGCAG CACTGGCCGC TACTATGGGG CTCAGTAGCA AAGCCGCCGC AGAAATGGCA
GAATCGGTAT CCAATCCACA GCGTCCGCCC GTTATCTGGA TTGGCGCTCA GGAGTGTACC
GGTTGTACCG AATCACTGCT TCGTGCTACA CACCCAACCG TTGAAAACCT CGTTCTGGAG
ACTATCTCTC TGGAATACCA CGAGGTACTT TCCGCCGCAT TCGGTCACCA GGTCGAAGAA
AACAAACATA ACGCTCTGGA GAAGTATAAA GGGCAATATG TTCTGGTGGT GGATGGTTCT
ATCCCACTAA AAGATAACGG TATCTACTGC ATGGTTGCCG GCGAGCCGAT CGTGGATCAC
ATCCGCAAAG CCGCTGATGG CGCAGCCGCG ATTATCGCTA TCGGTTCCTG CTCGGCATGG
GGCGGCGTTG CTGCGGCTGG CGTAAACCCA ACCGGCGCTG TCAGTCTGCA GGAAGTCTTA
CCGGGCAAAA CGGTTATCAA TATTCCAGGT TGTCCGCCAA ACCCGCATAA CTTCCTGGCG
ACCGTCGCGC ATATCATCAC TTACGGCACG CCGCCGAAGC TGGATGCGAA AAATCGTCCA
ACCTTTGCCT ATGGCCGTCT GATTCATGAG CATTGCGAAC GTCGTCCACA CTTCGACGCA
GGCCGTTTTG CCAAAGAATT TGGCGACGAA GGCCACCGTC AGGGCTGGTG TCTCTACCAT
CTTGGCTGTA AAGGGCCGGA AACCTGGGGC AACTGTTCTA CGTTACAGTT CTGTGACGTT
GGCGGCGTCT GGCCAGTGGC GATCGGTCAT CCTTGCTATG GCTGTAACGA AGAAGGTATC
GGCTTCCATA AGGGCATTCA CCAGCTTGCT CATGTCGAAA ACCAAACTCC GCGTTCAGAG
AAACCTGACG TCAATATGAA AGAAGGCGGC AATATCTCTG CGGGCGCTGT CGGTCTGCTT
GGCGGCGTAG TCGGTCTGGT TGCCGGCGTC AGCGTGATGG CGGTACGTGA ACTGGGGCGT
CAGCAAAAGA AAGATAACGC TGACTCACGG GGAGAATAA
 
Protein sequence
MTGDNTLITS HGINRRDFMK LCAALAATMG LSSKAAAEMA ESVSNPQRPP VIWIGAQECT 
GCTESLLRAT HPTVENLVLE TISLEYHEVL SAAFGHQVEE NKHNALEKYK GQYVLVVDGS
IPLKDNGIYC MVAGEPIVDH IRKAADGAAA IIAIGSCSAW GGVAAAGVNP TGAVSLQEVL
PGKTVINIPG CPPNPHNFLA TVAHIITYGT PPKLDAKNRP TFAYGRLIHE HCERRPHFDA
GRFAKEFGDE GHRQGWCLYH LGCKGPETWG NCSTLQFCDV GGVWPVAIGH PCYGCNEEGI
GFHKGIHQLA HVENQTPRSE KPDVNMKEGG NISAGAVGLL GGVVGLVAGV SVMAVRELGR
QQKKDNADSR GE