Gene SeSA_A3001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A3001 
SymbolhycE 
ID6519866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp2901088 
End bp2902797 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content58% 
IMG OID642748025 
Productformate hydrogenlyase subunit E 
Protein accessionYP_002115802 
Protein GI194737308 
COG category[C] Energy production and conversion 
COG ID[COG3261] Ni,Fe-hydrogenase III large subunit
[COG3262] Ni,Fe-hydrogenase III component G 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.33609 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.016492 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAAG AAAAATTAGG TCAACAATAC CTTGCGGCGC TGCACCAGGC GTTTCCGGGC 
GTCGTGCTGG ACGAAGCCTG GCAGACCAAA GATCAGCTGA CTATTACGGT AAAAGTGAAC
TATCTGCCGG AAGTGGTGGA GTTTCTTTAC TACCAGCAGG GTGGGTGGCT GTCGGTGCTG
TTCGGTAATG ACGAACGCCA GTTGTGCGGC CACTATGCCG TTTATTACGT GCTGTCGATG
GAGCAGGGCA CGAAGTGCTG GATTACCGTC CGCGTTGAAG TGGATGCCAA TAAGCTGGAA
TTCCCATCCG TTACGCCGCG CGTGCCGGCT GCCGTGTGGG GTGAGCGCGA AGTACGCGAC
ATGTACGGTT TAATCCCGGT CGGTCTGCCG GACGAGCGCC GTCTGGTGCT GCCGGACGAC
TGGCCGGACG AGCTGTATCC GCTACGTAAA GACAGCATGG ATTATCGTCA GCGCCCGGCG
CCGACCACCG ATGCGGAAAC CTACGAGTTC ATTAACGAGC TGGGTGACAA GAAAAATAAC
GTGGTGCCGA TTGGCCCGCT GCATGTCACT TCCGATGAGC CGGGCCACTT CCGTCTGTTC
GTCGATGGCG AAAACATTAT CGACGCCGAC TACCGCCTGT TCTATGTCCA CCGCGGCATG
GAAAAACTGG CGGAAACCCG TATGGGTTAT AACGAAGTGA CGTTCCTGTC GGATCGCGTG
TGTGGTATCT GCGGCTTCGC GCACAGCACC GCCTACACCA CTTCCGTGGA AAACGCGATG
GGCATTCAGG TGCCGGAACG TGCGCAAATG ATCCGCGCTA TTCTGCTGGA AGTGGAACGT
CTGCACTCGC ATCTGCTCAA CCTCGGCCTG GCCTGCCACT TTACCGGCTT TGACTCCGGC
TTTATGCAGT TCTTCCGCGT GCGTGAAACC TCCATGAAGA TGGCGGAAAT ACTGACCGGG
GCGCGCAAAA CTTACGGTCT GAACCTGATC GGCGGGATTC GCCGCGATCT GCTGAAAGAA
GACATGATCC AGACCCGTCA ACTGGCGCAG CAGATGCGTC GTGACGTGCA GGAGCTGGTG
GACATGCTGC TGAGCACGCC GAATATGGAA CAGCGTACCG TGGGTATCGG CCGTCTGGAC
CCGGAAATTG CCCGTGACTT CAGTAATGTC GGCCCGATGG TGCGCGCCAG CGGTCACGCC
CGCGACACCC GCGCCGACCA CCCGTTTGTC GGTTACGGTC TGCTGCCGAT GGAAGTACAT
AGCGAGCAGG GCTGCGATGT GATTTCTCGT CTGAAAGTCC GTATCAACGA AGTCTACACC
TCGCTGAATA TGATCGATTT CGGTCTGGAT AATCTGCCGG GCGGCCCGCT GATGGTGGAA
GGCTTTACCT ATATTCCGCA CCGTTTTGCG CTCGGCTTCG CTGAAGCGCC GCGTGGTGAC
GATATCCACT GGAGCATGAC CGGCGACAAC CAGAAGCTGT ACCGCTGGCG CTGCCGTGCG
GCGACCTACG CCAACTGGCC GACGCTGCGC TATATGCTGC GCGGCAACAC CGTCTCCGAC
GCGCCGCTGA TTATCGGCAG CCTCGACCCG TGCTACTCCT GTACCGACCG GATGACCGTG
GTCGATGTGC GTAAGAAGAA GAGCAAAGTC GTGCCGTACA AAGAACTTGA GCGCTACAGC
ATTGAGCGTA AAAACTCGCC GCTGAAATAA
 
Protein sequence
MSEEKLGQQY LAALHQAFPG VVLDEAWQTK DQLTITVKVN YLPEVVEFLY YQQGGWLSVL 
FGNDERQLCG HYAVYYVLSM EQGTKCWITV RVEVDANKLE FPSVTPRVPA AVWGEREVRD
MYGLIPVGLP DERRLVLPDD WPDELYPLRK DSMDYRQRPA PTTDAETYEF INELGDKKNN
VVPIGPLHVT SDEPGHFRLF VDGENIIDAD YRLFYVHRGM EKLAETRMGY NEVTFLSDRV
CGICGFAHST AYTTSVENAM GIQVPERAQM IRAILLEVER LHSHLLNLGL ACHFTGFDSG
FMQFFRVRET SMKMAEILTG ARKTYGLNLI GGIRRDLLKE DMIQTRQLAQ QMRRDVQELV
DMLLSTPNME QRTVGIGRLD PEIARDFSNV GPMVRASGHA RDTRADHPFV GYGLLPMEVH
SEQGCDVISR LKVRINEVYT SLNMIDFGLD NLPGGPLMVE GFTYIPHRFA LGFAEAPRGD
DIHWSMTGDN QKLYRWRCRA ATYANWPTLR YMLRGNTVSD APLIIGSLDP CYSCTDRMTV
VDVRKKKSKV VPYKELERYS IERKNSPLK