Gene SeHA_C1984 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C1984 
Symbol 
ID6489742 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp1932552 
End bp1933754 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content57% 
IMG OID642742189 
Producthydrogenase-1 small chain 
Protein accessionYP_002045832 
Protein GI194449251 
COG category[C] Energy production and conversion 
COG ID[COG1740] Ni,Fe-hydrogenase I small subunit 
TIGRFAM ID[TIGR00391] hydrogenase (NiFe) small subunit (hydA)
[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.286053 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones87 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTGGCGG AACGTCAACG GACCCGCAAA AGCGCCTTAT GCCCCTGGCG GAGCGTGGTG 
CAAAAAGGAG AAAAAATACG CGTTATGAAT AACGAGGAGA CCTTTTATCA AGCCATGCGT
CGTAAGGGAG TGACCCGACG CAGCTTTCTC AAATTCTGTA GCCTTGCCGC CACATCGCTG
GGACTGGGCG CCGGAATGAC GCCAAAGATC GCCTGGGCGC TGGAGAATAA ACCGCGAATT
CCGGTGGTCT GGATTCATGG ACTGGAATGC ACCTGCTGTA CCGAATCCTT TATCCGTTCC
TCGCACCCGC TAGCCAAAGA TGTGATCCTC TCGCTGATTT CCCTTGATTA TGACGACACC
CTGATGGCCG CCGCCGGCGC ACAGGCCGAA GAAGTCTTTG ACGATATTAC CACTCGCTAC
GCCGGGAAAT ACATTCTGGC GGTGGAAGGC AATCCGCCGT TAGGAGAGCA AGGAATGTTC
TGTATCAGCG GCGGCCGCCC GTTTATTGAA AAACTGAAGA AAGCCGCCGC GGGCGCCAGC
GCTATTATCG CCTGGGGAAA CTGCGCCTCC TGGGGTTGCG TCCAGGCCGC CCGCCCCAAT
CCGACCCAGG CAACGCCTAT CGATAAAGTG ATCACCGACA AGCCGATCGT GAAAGTCCCT
GGATGTCCGC CAATCCCGGA TGTCATGAGC GCCATTATCA CCTATATGGT GACGTTTGAT
CGTCTGCCGG AACTCGATCG CATGGGCCGT CCACTGATGT TCTATGGTCA GCGTATCCAC
GATAAATGCT ACCGCCGCGC CCATTTTGAC GCCGGTGAAT TTGTCGAGAG CTGGGATGAT
GACGCCGCCC GCAAGGGATA CTGCCTGTAC AAGATGGGCT GTAAAGGGCC AACCACCTAT
AACGCCTGCT CCTCCACACG CTGGAATGAC GGCGTCTCCT TTCCTATCCA GTCCGGTCAC
GGATGTCTGG GATGCTCAGA AAATGGTTTC TGGGATCGCG GCTCGTTTTA TAGCCGCGTG
GTGGATATTC CCCAGATGGG TACCCATTCA ACCGCCGATA CGGTGGGGCT GACCGCGCTG
GGCGTGGTCG CGGCGGGCGT TGGCGGTCAC GCTGTCGCCA GCGCGCTCAA CCAACGTAAA
CGCCACAAAC AACAGTTAGC GCAAGCCGAA CAACAGCCGG ACAATGAGGA TAAACAGGCA
TGA
 
Protein sequence
MLAERQRTRK SALCPWRSVV QKGEKIRVMN NEETFYQAMR RKGVTRRSFL KFCSLAATSL 
GLGAGMTPKI AWALENKPRI PVVWIHGLEC TCCTESFIRS SHPLAKDVIL SLISLDYDDT
LMAAAGAQAE EVFDDITTRY AGKYILAVEG NPPLGEQGMF CISGGRPFIE KLKKAAAGAS
AIIAWGNCAS WGCVQAARPN PTQATPIDKV ITDKPIVKVP GCPPIPDVMS AIITYMVTFD
RLPELDRMGR PLMFYGQRIH DKCYRRAHFD AGEFVESWDD DAARKGYCLY KMGCKGPTTY
NACSSTRWND GVSFPIQSGH GCLGCSENGF WDRGSFYSRV VDIPQMGTHS TADTVGLTAL
GVVAAGVGGH AVASALNQRK RHKQQLAQAE QQPDNEDKQA