Gene SeHA_C3036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C3036 
SymbolhycE 
ID6489569 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp2968405 
End bp2970114 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content58% 
IMG OID642743191 
Productformate hydrogenlyase subunit E 
Protein accessionYP_002046810 
Protein GI194448570 
COG category[C] Energy production and conversion 
COG ID[COG3261] Ni,Fe-hydrogenase III large subunit
[COG3262] Ni,Fe-hydrogenase III component G 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.000143989 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCTGAAG AAAAATTAGG TCAACAATAC CTTGCGGCGC TGCACCAGGC GTTTCCGGGC 
GTCGTGCTGG ACGAAGCCTG GCAGACCAAA GATCAGCTGA CTATTACGGT AAAAGTGAAC
TATCTGCCGG AAGTGGTGGA GTTTCTTTAC TACCAGCAGG GTGGGTGGTT GTCGGTGCTG
TTCGGTAATG ACGAACGCCA GTTGTGCGGC CACTATGCCG TTTATTACGT GCTGTCGATG
GAGCAGGGCA CGAAGTGCTG GATTACCGTT CGCGTTGAAG TGGATGCCAA TAAGCTGGAA
TTCCCATCCG TTACGCCGCG CGTGCCGGCT GCCGTGTGGG GTGAGCGCGA AGTACGCGAC
ATGTACGGTT TAATCCCGGT CGGTCTGCCG GACGAGCGCC GTCTGGTGCT GCCGGACGAC
TGGCCGGATG AACTCTATCC GCTGCGTAAA GACAGTATGG ATTATCGTCA GCGCCCGGCG
CCGACCACCG ATGCGGAAAC CTACGAGTTC ATTAACGAGC TGGGTGACAA GAAAAATAAC
GTGGTGCCGA TTGGCCCGCT GCATGTCACC TCCGATGAAC CGGGCCACTT CCGTCTGTTC
GTCGATGGCG AAAACATTAT CGACGCCGAC TACCGCCTGT TCTACGTCCA CCGTGGTATG
GAAAAACTGG CGGAAACCCG CATGGGTTAT AACGAAGTCA CCTTCCTGTC GGATCGCGTG
TGTGGTATCT GCGGCTTCGC GCACAGCACT GCCTACACCA CTTCCGTGGA AAACGCGATG
GGCATTCAGG TGCCGGAGCG TGCGCAGATG ATCCGCGCCA TTCTGCTGGA AGTGGAGCGT
CTGCACTCGC ACCTGCTGAA CCTAGGCCTC GCCTGCCACT TCACCGGCTT TGACTCCGGC
TTTATGCAGT TCTTCCGCGT GCGTGAAACC TCCATGAAGA TGGCGGAGAT ACTGACCGGG
GCGCGCAAAA CTTACGGACT GAACCTGATC GGCGGGATTC GCCGCGATCT GCTCAAAGAA
GACATGATCC AGACCCGTCA ACTGGCGCAG CAGATGCGTC GTGACGTGCA GGAGCTGGTG
GACATGCTGC TGAGCACGCC GAATATGGAA CAGCGTACCG TGGGTATCGG CCGTCTGGAC
CCGGAAATTG CCCGTGACTT CAGTAATGTC GGCCCGATGG TGCGCGCCAG CGGTCACGCC
CGCGACACCC GCGCCGACCA CCCGTTTGTC GGCTACGGTC TGCTGCCGAT GGAAGTGCAC
AGCGAGCAGG GCTGCGATGT GATTTCTCGT CTGAAAGTCC GTATCAACGA AGTTTACACC
TCGCTGAATA TGATCGATTT CGGTCTGGAT AATCTGCCGG GCGGCCCGCT GATGGTGGAA
GGCTTTACCT ATATTCCGCA CCGTTTTGCG CTCGGCTTCG CTGAAGCGCC GCGTGGTGAC
GATATCCACT GGAGCATGAC CGGCGACAAC CAGAAGCTGT ACCGCTGGCG CTGCCGTGCC
GCGACCTACG CCAACTGGCC GACGCTGCGC TATATGCTGC GCGGCAACAC CGTCTCCGAC
GCGCCGCTGA TTATCGGCAG CCTCGACCCG TGCTACTCCT GTACCGACCG GATGACCGTG
GTCGATGTGC GTAAGAAGAA GAGCAAAGTC GTGCCGTACA AAGAACTTGA GCGCTACAGC
ATTGAGCGTA AAAACTCGCC GCTGAAATAA
 
Protein sequence
MSEEKLGQQY LAALHQAFPG VVLDEAWQTK DQLTITVKVN YLPEVVEFLY YQQGGWLSVL 
FGNDERQLCG HYAVYYVLSM EQGTKCWITV RVEVDANKLE FPSVTPRVPA AVWGEREVRD
MYGLIPVGLP DERRLVLPDD WPDELYPLRK DSMDYRQRPA PTTDAETYEF INELGDKKNN
VVPIGPLHVT SDEPGHFRLF VDGENIIDAD YRLFYVHRGM EKLAETRMGY NEVTFLSDRV
CGICGFAHST AYTTSVENAM GIQVPERAQM IRAILLEVER LHSHLLNLGL ACHFTGFDSG
FMQFFRVRET SMKMAEILTG ARKTYGLNLI GGIRRDLLKE DMIQTRQLAQ QMRRDVQELV
DMLLSTPNME QRTVGIGRLD PEIARDFSNV GPMVRASGHA RDTRADHPFV GYGLLPMEVH
SEQGCDVISR LKVRINEVYT SLNMIDFGLD NLPGGPLMVE GFTYIPHRFA LGFAEAPRGD
DIHWSMTGDN QKLYRWRCRA ATYANWPTLR YMLRGNTVSD APLIIGSLDP CYSCTDRMTV
VDVRKKKSKV VPYKELERYS IERKNSPLK