Gene SNSL254_A3401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3401 
SymbolhybB 
ID6483143 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3299818 
End bp3300996 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content53% 
IMG OID642738692 
Productputative hydrogenase 2 b cytochrome subunit 
Protein accessionYP_002042412 
Protein GI194443624 
COG category[C] Energy production and conversion 
COG ID[COG5557] Polysulphide reductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones79 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCATG ATCCAAAACC GCTGGGCGGA AAAATCATCA GTAAACCGGT CATTATTTTT 
GGACCGTTAA TCGTCCTGTG TATGCTCCTT ATCGTGAAGC GTCTGGTCTT CGGGTTGGGC
TCCGTTTCCG ACCTGAACGG CGGTTTCCCG TGGGGCGTCT GGATTGCCTT TGACCTGTTG
ATCGGCACCG GCTTTGCCTG CGGCGGTTGG GCGTTGGCAT GGGCGGTGTA TGTCTTTAAC
CGTGGGCAAT ACCATCCGTT GGTGCGCCCG GCGTTGCTGG CAAGCTTGTT TGGTTACTCG
CTGGGCGGCC TGTCGATCAC TATCGACGTC GGTCGTTACT GGAACCTGCC GTACTTCTAC
ATTCCAGGTC ACTTCAACGT GAACTCGGTA CTGTTTGAGA CGGCGGTCTG TATGACCATC
TACATCGGCG TGATGGCGCT GGAGTTTGCG CCTGCCCTGT TTGAACGTCT GGGCTGGAAA
GTGTCGCTCA AGCGTCTGAA TAAGGTGATG TTCTTTATTA TCGCGCTGGG CGCGCTGCTG
CCGACGATGC ACCAGTCCTC AATGGGGTCA TTGATGATCT CGGCGGGCTA TAAAGTGCAT
CCGCTATGGC AAAGCTATGA AATGCTGCCG CTGTTCTCGG TTCTGACCGC GTTTATCATG
GGCTTCTCCA TTGTCATATT TGAGGGTTCG CTGGTTCAGG CAGGCCTGAA AGGAAACGGT
CCGGATGAGA AAAATTTGTT TGTTAAGCTG ACGAATACCA TCAGCGTGCT GCTGGCGATT
TTCGTCGTCC TGCGCTTTGG CGAACTGATT TATCGCGACA AGCTGTCCTA TGCGTTTGCC
GGCGATTTTT ACTCCGCTAT GTTCTGGATT GAAGTCGTCC TGATGGTCTT CCCGTTAGTG
GTGCTGCGTG TGGCGAAACT GCGTAATGAC TCTCGTATGC TGTACCTGTC GGCGCTGAGC
GCGCTGTTGG GCTGCGCGAC GTGGCGTCTG ACCTATTCGC TGGTGGCATT CAACCCGGGT
GGCGGCTACC ACTACTTCCC AACCTGGGAA GAATTGTTGA TTTCTATTGG TTTTGTGGCC
ATTGAGATTT GTGCATACAT CGTACTCATT CGTCTACTGC CGATACTTCC TCCTTTAAAA
CAAAACGATC ATAATCGTCA TGAGGCGAGC AAAGCATGA
 
Protein sequence
MSHDPKPLGG KIISKPVIIF GPLIVLCMLL IVKRLVFGLG SVSDLNGGFP WGVWIAFDLL 
IGTGFACGGW ALAWAVYVFN RGQYHPLVRP ALLASLFGYS LGGLSITIDV GRYWNLPYFY
IPGHFNVNSV LFETAVCMTI YIGVMALEFA PALFERLGWK VSLKRLNKVM FFIIALGALL
PTMHQSSMGS LMISAGYKVH PLWQSYEMLP LFSVLTAFIM GFSIVIFEGS LVQAGLKGNG
PDEKNLFVKL TNTISVLLAI FVVLRFGELI YRDKLSYAFA GDFYSAMFWI EVVLMVFPLV
VLRVAKLRND SRMLYLSALS ALLGCATWRL TYSLVAFNPG GGYHYFPTWE ELLISIGFVA
IEICAYIVLI RLLPILPPLK QNDHNRHEAS KA