Gene SNSL254_A3052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3052 
SymbolhycE 
ID6485677 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2970395 
End bp2972104 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content57% 
IMG OID642738367 
Productformate hydrogenlyase subunit E 
Protein accessionYP_002042091 
Protein GI194442683 
COG category[C] Energy production and conversion 
COG ID[COG3261] Ni,Fe-hydrogenase III large subunit
[COG3262] Ni,Fe-hydrogenase III component G 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value0.309808 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAAG AAAAATTAGG TCAACAATAC CTTGCGGCGC TGCACCAGGC GTTTCCGGGC 
GTCGTGCTGG ACGAAGCCTG GCAGACCAAA GATCAGCTGA CTATTACGGT AAAAGTGAAC
TATCTGCCGG AAGTGGTGGA GTTTCTTTAC TACCAGCAGG GTGGGTGGCT GTCGGTGCTG
TTCGGTAATG ACGAACGCCA GTTGTGCGGC CACTATGCCG TTTATTACGT GCTGTCGATG
GAGCAGGGCA CGAAGTGCTG GATTACCGTC CGCGTTGAAG TGGATGCCAA TAAGCTGGAA
TTCCCATCCG TTACGCCGCG CGTGCCGGCT GCCGTGTGGG GTGAGCGCGA AGTACGCGAC
ATGTACGGTT TAATCCCGGT CGGTCTGCCG GACGAGCGCC GTCTGGTGCT GCCGGACGAC
TGGCCGGATG AACTCTATCC GCTGCGTAAA GACAGCATGG ATTATCGTCA GCGCCCGGCG
CCGACCACCG ATGCGGAAAC CTACGAGTTC ATTAACGAGC TGGGTGACAA GAAAAATAAC
GTGGTGCCGA TTGGCCCGCT GCATGTCACT TCTGATGAAC CGGGCCACTT CCGTCTGTTC
GTCGATGGCG AAAACATTAT CGACGCCGAC TACCGCCTGT TCTACGTCCA CCGTGGTATG
GAAAAACTGG CGGAAACCCG TATGGGTTAC AACGAAGTGA CATTCCTGTC GGACCGCGTG
TGTGGTATCT GCGGCTTCGC CCACAGCACC GCCTACACCA CTTCCGTGGA AAACGCGATG
GGCATTCAGG TGCCGGAGCG TGCGCAAATG ATCCGCGCTA TTCTGCTGGA AGTGGAACGC
CTGCACTCGC ATCTGCTCAA CCTTGGCCTC GCCTGTCACT TTACCGGTTT TGACTCCGGC
TTTATGCAGT TCTTCCGCGT GCGTGAAACC TCCATGAAAA TGGCAGAGAT CCTTACCGGC
GCGCGCAAAA CTTACGGACT GAACCTGATC GGCGGGATTC GCCGCGATCT GCTCAAAGAG
GACATGATCC AGACCCGTCA ACTGGCGCAG CAGATGCGTC GTGACGTGCA GGAGCTGGTG
GACATGCTGC TGAGCACGCC GAATATGGAA CAGCGTACCG TGGGTATCGG CCGTCTGGAC
CCGGAAATTG CCCGTGACTT CAGTAATGTC GGCCCGATGG TGCGCGCCAG CGGTCACGCC
CGCGACACCC GCGCCGACCA CCCGTTTGTG GGCTATGGCC TGCTGCCGAT GGAAGTGCAC
AGCGAGCAGG GCTGCGATGT GATTTCTCGT CTGAAAGTCC GTATCAACGA AGTTTACACC
TCGCTGAATA TGATCGATTT CGGTCTGGAT AATCTGCCGG GCGGCCCGCT GATGGTGGAA
GGCTTTACCT ATATTCCGCA CCGTTTTGCG CTCGGCTTTG CCGAAGCGCC GCGTGGCGAT
GATATCCACT GGAGCATGAC CGGCGACAAC CAAAAGCTTT ACCGCTGGCG CTGTCGTGCG
GCGACCTACG CCAACTGGCC GACGCTGCGC TATATGCTGC GCGGCAACAC CGTCTCCGAT
GCGCCGCTGA TTATCGGCAG CCTCGACCCT TGCTACTCCT GTACCGACCG GATGACGGTG
GTCGATGTGC GTAAGAAGAA GAGCAAAGTC GTGCCGTACA AAGAACTTGA GCGCTACAGC
ATTGAGCGTA AAAACTCGCC GCTGAAATAA
 
Protein sequence
MSEEKLGQQY LAALHQAFPG VVLDEAWQTK DQLTITVKVN YLPEVVEFLY YQQGGWLSVL 
FGNDERQLCG HYAVYYVLSM EQGTKCWITV RVEVDANKLE FPSVTPRVPA AVWGEREVRD
MYGLIPVGLP DERRLVLPDD WPDELYPLRK DSMDYRQRPA PTTDAETYEF INELGDKKNN
VVPIGPLHVT SDEPGHFRLF VDGENIIDAD YRLFYVHRGM EKLAETRMGY NEVTFLSDRV
CGICGFAHST AYTTSVENAM GIQVPERAQM IRAILLEVER LHSHLLNLGL ACHFTGFDSG
FMQFFRVRET SMKMAEILTG ARKTYGLNLI GGIRRDLLKE DMIQTRQLAQ QMRRDVQELV
DMLLSTPNME QRTVGIGRLD PEIARDFSNV GPMVRASGHA RDTRADHPFV GYGLLPMEVH
SEQGCDVISR LKVRINEVYT SLNMIDFGLD NLPGGPLMVE GFTYIPHRFA LGFAEAPRGD
DIHWSMTGDN QKLYRWRCRA ATYANWPTLR YMLRGNTVSD APLIIGSLDP CYSCTDRMTV
VDVRKKKSKV VPYKELERYS IERKNSPLK