Gene SNSL254_A1924 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A1924 
Symbol 
ID6485437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp1883166 
End bp1884311 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content57% 
IMG OID642737293 
Producthydrogenase-1 small chain 
Protein accessionYP_002041043 
Protein GI194444862 
COG category[C] Energy production and conversion 
COG ID[COG1740] Ni,Fe-hydrogenase I small subunit 
TIGRFAM ID[TIGR00391] hydrogenase (NiFe) small subunit (hydA)
[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones76 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAAAAAG GAGAAAAAAT ACGCGTTATG AATAACGAGG AGACCTTTTA TCAAGCCATG 
CGTCGTAAGG GAGTGACCCG ACGCAGCTTT CTCAAATTCT GTAGCCTTGC CGCCACATCG
CTGGGACTGG GCGCCGGAAT GACGCCAAAG ATCGCCTGGG CGCTGGAGAA TAAACCGCGG
ATTCCGGTGG TCTGGATTCA TGGACTGGAA TGCACCTGCT GTACCGAATC CTTTATCCGT
TCCTCGCACC CGCTAGCCAA AGATGTGATC CTCTCGCTGA TTTCCCTCGA TTATGACGAC
ACCCTGATGG CCGCCGCCGG CGCACAGGCC GAAGAAGTCT TTGACGATAT TACCACTCGC
TACGCCGGGA AATACATTCT GGCGGTGGAA GGCAATCCGC CGCTAGGAGA GCAAGGAATG
TTCTGTATCA GCGGCGGCCG CCCGTTTATT GAAAAACTGA AGAAAGCCGC CGCGGGCGCC
AGCGCTATTA TCGCCTGGGG AAACTGCGCC TCCTGGGGTT GCGTCCAGGC CGCCCGCCCC
AATCCGACCC AGGCAACGCC TATCGATAAA GTGATCACCG ACAAGCCGAT CGTGAAAGTC
CCTGGATGTC CACCAATCCC GGATGTCATG AGCGCCATTA TCACCTATAT GGTGACGTTT
GATCGTCTGC CGGAACTCGA TCGCATGGGC CGTCCACTGA TGTTCTACGG TCAGCGTATC
CACGATAAAT GCTACCGTCG CGCCCATTTT GACGCCGGTG AATTTGTCGA GAGCTGGGAT
GATGACGCCG CCCGCAAGGG ATACTGTCTG TACAAGATGG GCTGTAAAGG GCCAACCACC
TATAACGCCT GCTCCTCCAC TCGCTGGAAT GACGGCGTCT CCTTTCCTAT CCAGTCCGGT
CACGGATGTC TGGGATGTTC AGAAAATGGT TTCTGGGATC GCGGCTCGTT TTATAGCCGC
GTGGTGGATA TTCCCCAGAT GGGTACCCAT TCAACCGCCG ATACGGTGGG GCTGACCGCG
CTGGGCGTGG TCGCGGCGGG CGTTGGCGGT CACGCTGTCG CCAGCGCGCT CAACCAACGT
AAACGCCACA AACAACAGTT AGCGCAAGCC GAACAACAGC CGGACAATGA GGATAAACAG
GCATGA
 
Protein sequence
MQKGEKIRVM NNEETFYQAM RRKGVTRRSF LKFCSLAATS LGLGAGMTPK IAWALENKPR 
IPVVWIHGLE CTCCTESFIR SSHPLAKDVI LSLISLDYDD TLMAAAGAQA EEVFDDITTR
YAGKYILAVE GNPPLGEQGM FCISGGRPFI EKLKKAAAGA SAIIAWGNCA SWGCVQAARP
NPTQATPIDK VITDKPIVKV PGCPPIPDVM SAIITYMVTF DRLPELDRMG RPLMFYGQRI
HDKCYRRAHF DAGEFVESWD DDAARKGYCL YKMGCKGPTT YNACSSTRWN DGVSFPIQSG
HGCLGCSENG FWDRGSFYSR VVDIPQMGTH STADTVGLTA LGVVAAGVGG HAVASALNQR
KRHKQQLAQA EQQPDNEDKQ A