Gene SeD_A3159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3159 
SymbolhycE 
ID6872837 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3038284 
End bp3039993 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content58% 
IMG OID642786180 
Productformate hydrogenlyase, subunit E 
Protein accessionYP_002216821 
Protein GI198245407 
COG category[C] Energy production and conversion 
COG ID[COG3261] Ni,Fe-hydrogenase III large subunit
[COG3262] Ni,Fe-hydrogenase III component G 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value0.146854 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAAG AAAAATTAGG TCAACAATAC CTTGCGGCGC TGCACCAGGC GTTTCCGGGC 
GTCGTGCTGG ACGAAGCCTG GCAGACCAAA GATCAGCTGA CTATTACGGT GAAAGTAAAC
TATCTGCCGG AAGTGGTGGA GTTTCTTTAC TACCAGCAGG GTGGGTGGTT GTCGGTGCTG
TTCGGTAATG ACGAACGCCA GTTGTGCGGC CACTATGCCG TTTATTACGT GCTGTCGATG
GAGCAGGGCA CGAAGTGCTG GATTACCGTT CGCGTTGAAG TAGATGCCAA TAAGCTGGAA
TTCCCATCCG TTACGCCGCG CGTGCCGGCT GCCGTGTGGG GTGAGCGCGA AGTACGCGAC
ATGTACGGTT TAATCCCGGT CGGTCTGCCG GACGAGCGCC GTCTGGTGCT GCCGGACGAC
TGGCCGGATG AACTCTATCC GCTGCGTAAA GACAGCATGG ATTATCGTCA GCGCCCGGCG
CCGACCACCG ATGCGGAAAC CTACGAGTTC ATTAACGAGC TGGGTGACAA GAAAAATAAC
GTGGTGCCGA TTGGCCCGCT GCATGTCACC TCCGATGAAC CGGGCCACTT CCGTCTGTTC
GTCGATGGCG AAAACATTAT CGACGCCGAC TACCGCCTGT TCTACGTCCA CCGTGGTATG
GAAAAACTGG CGGAAACCCG CATGGGTTAT AACGAAGTCA CCTTCCTGTC GGATCGCGTG
TGTGGTATCT GCGGCTTCGC GCACAGCACT GCCTACACCA CTTCCGTGGA AAACGCGATG
GGCATTCAGG TGCCGGAGCG TGCGCAGATG ATCCGCGCCA TTCTGCTGGA AGTGGAGCGT
CTGCACTCGC ACCTGCTGAA CCTAGGCCTC GCCTGCCACT TTACCGGTTT TGACTCCGGC
TTTATGCAGT TCTTCCGCGT GCGTGAAACC TCCATGAAAA TGGCAGAGAT CCTTACCGGC
GCGCGCAAAA CTTACGGACT GAACCTGATC GGCGGGATTC GCCGCGATCT GCTCAAAGAG
GACATGATCC AGACCCGTCA ACTGGCGCAG CAGATGCGTC GTGACGTGCA GGAGCTGGTG
GACATGCTGC TGAGCACGCC GAATATGGAA CAGCGTACCG TGGGTATCGG CCGTCTGGAC
CCGGAAATTG CCCGTGACTT CAGTAATGTC GGCCCGATGG TGCGCGCCAG CGGTCACGCC
CGCGACACCC GCGCCGACCA CCCGTTTGTC GGCTACGGTC TGCTGCCGAT GGAAGTACAT
AGCGAGCAGG GGTGCGACGT GATTTCTCGT CTGAAAGTCC GTATCAACGA AGTTTACACC
TCGCTGAATA TGATCGATTT CGGTCTGGAT AATCTACCGG GCGGCCCGCT GATGGTGGAA
GGCTTTACCT ATATTCCGCA CCGTTTTGCG CTCGGCTTCG CTGAAGCGCC GCGTGGTGAC
GATATCCACT GGAGCATGAC CGGCGACAAC CAGAAGCTGT ACCGCTGGCG CTGCCGTGCG
GCGACCTACG CCAACTGGCC GACGCTGCGC TATATGCTGC GCGGCAACAC CGTCTCCGAT
GCGCCGCTGA TTATCGGCAG CCTCGACCCG TGCTACTCCT GTACCGACCG GATGACCGTG
GTCGATGTGC GTAAGAAGAA GAGCAAAGTC GTGCCGTACA AAGAACTTGA GCGCTACAGC
ATTGAGCGTA AAAACTCGCC GCTGAAATAA
 
Protein sequence
MSEEKLGQQY LAALHQAFPG VVLDEAWQTK DQLTITVKVN YLPEVVEFLY YQQGGWLSVL 
FGNDERQLCG HYAVYYVLSM EQGTKCWITV RVEVDANKLE FPSVTPRVPA AVWGEREVRD
MYGLIPVGLP DERRLVLPDD WPDELYPLRK DSMDYRQRPA PTTDAETYEF INELGDKKNN
VVPIGPLHVT SDEPGHFRLF VDGENIIDAD YRLFYVHRGM EKLAETRMGY NEVTFLSDRV
CGICGFAHST AYTTSVENAM GIQVPERAQM IRAILLEVER LHSHLLNLGL ACHFTGFDSG
FMQFFRVRET SMKMAEILTG ARKTYGLNLI GGIRRDLLKE DMIQTRQLAQ QMRRDVQELV
DMLLSTPNME QRTVGIGRLD PEIARDFSNV GPMVRASGHA RDTRADHPFV GYGLLPMEVH
SEQGCDVISR LKVRINEVYT SLNMIDFGLD NLPGGPLMVE GFTYIPHRFA LGFAEAPRGD
DIHWSMTGDN QKLYRWRCRA ATYANWPTLR YMLRGNTVSD APLIIGSLDP CYSCTDRMTV
VDVRKKKSKV VPYKELERYS IERKNSPLK