Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A3052 |
Symbol | hycE |
ID | 6485677 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 2970395 |
End bp | 2972104 |
Gene Length | 1710 bp |
Protein Length | 569 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642738367 |
Product | formate hydrogenlyase subunit E |
Protein accession | YP_002042091 |
Protein GI | 194442683 |
COG category | [C] Energy production and conversion |
COG ID | [COG3261] Ni,Fe-hydrogenase III large subunit [COG3262] Ni,Fe-hydrogenase III component G |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 59 |
Fosmid unclonability p-value | 0.309808 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGAAG AAAAATTAGG TCAACAATAC CTTGCGGCGC TGCACCAGGC GTTTCCGGGC GTCGTGCTGG ACGAAGCCTG GCAGACCAAA GATCAGCTGA CTATTACGGT AAAAGTGAAC TATCTGCCGG AAGTGGTGGA GTTTCTTTAC TACCAGCAGG GTGGGTGGCT GTCGGTGCTG TTCGGTAATG ACGAACGCCA GTTGTGCGGC CACTATGCCG TTTATTACGT GCTGTCGATG GAGCAGGGCA CGAAGTGCTG GATTACCGTC CGCGTTGAAG TGGATGCCAA TAAGCTGGAA TTCCCATCCG TTACGCCGCG CGTGCCGGCT GCCGTGTGGG GTGAGCGCGA AGTACGCGAC ATGTACGGTT TAATCCCGGT CGGTCTGCCG GACGAGCGCC GTCTGGTGCT GCCGGACGAC TGGCCGGATG AACTCTATCC GCTGCGTAAA GACAGCATGG ATTATCGTCA GCGCCCGGCG CCGACCACCG ATGCGGAAAC CTACGAGTTC ATTAACGAGC TGGGTGACAA GAAAAATAAC GTGGTGCCGA TTGGCCCGCT GCATGTCACT TCTGATGAAC CGGGCCACTT CCGTCTGTTC GTCGATGGCG AAAACATTAT CGACGCCGAC TACCGCCTGT TCTACGTCCA CCGTGGTATG GAAAAACTGG CGGAAACCCG TATGGGTTAC AACGAAGTGA CATTCCTGTC GGACCGCGTG TGTGGTATCT GCGGCTTCGC CCACAGCACC GCCTACACCA CTTCCGTGGA AAACGCGATG GGCATTCAGG TGCCGGAGCG TGCGCAAATG ATCCGCGCTA TTCTGCTGGA AGTGGAACGC CTGCACTCGC ATCTGCTCAA CCTTGGCCTC GCCTGTCACT TTACCGGTTT TGACTCCGGC TTTATGCAGT TCTTCCGCGT GCGTGAAACC TCCATGAAAA TGGCAGAGAT CCTTACCGGC GCGCGCAAAA CTTACGGACT GAACCTGATC GGCGGGATTC GCCGCGATCT GCTCAAAGAG GACATGATCC AGACCCGTCA ACTGGCGCAG CAGATGCGTC GTGACGTGCA GGAGCTGGTG GACATGCTGC TGAGCACGCC GAATATGGAA CAGCGTACCG TGGGTATCGG CCGTCTGGAC CCGGAAATTG CCCGTGACTT CAGTAATGTC GGCCCGATGG TGCGCGCCAG CGGTCACGCC CGCGACACCC GCGCCGACCA CCCGTTTGTG GGCTATGGCC TGCTGCCGAT GGAAGTGCAC AGCGAGCAGG GCTGCGATGT GATTTCTCGT CTGAAAGTCC GTATCAACGA AGTTTACACC TCGCTGAATA TGATCGATTT CGGTCTGGAT AATCTGCCGG GCGGCCCGCT GATGGTGGAA GGCTTTACCT ATATTCCGCA CCGTTTTGCG CTCGGCTTTG CCGAAGCGCC GCGTGGCGAT GATATCCACT GGAGCATGAC CGGCGACAAC CAAAAGCTTT ACCGCTGGCG CTGTCGTGCG GCGACCTACG CCAACTGGCC GACGCTGCGC TATATGCTGC GCGGCAACAC CGTCTCCGAT GCGCCGCTGA TTATCGGCAG CCTCGACCCT TGCTACTCCT GTACCGACCG GATGACGGTG GTCGATGTGC GTAAGAAGAA GAGCAAAGTC GTGCCGTACA AAGAACTTGA GCGCTACAGC ATTGAGCGTA AAAACTCGCC GCTGAAATAA
|
Protein sequence | MSEEKLGQQY LAALHQAFPG VVLDEAWQTK DQLTITVKVN YLPEVVEFLY YQQGGWLSVL FGNDERQLCG HYAVYYVLSM EQGTKCWITV RVEVDANKLE FPSVTPRVPA AVWGEREVRD MYGLIPVGLP DERRLVLPDD WPDELYPLRK DSMDYRQRPA PTTDAETYEF INELGDKKNN VVPIGPLHVT SDEPGHFRLF VDGENIIDAD YRLFYVHRGM EKLAETRMGY NEVTFLSDRV CGICGFAHST AYTTSVENAM GIQVPERAQM IRAILLEVER LHSHLLNLGL ACHFTGFDSG FMQFFRVRET SMKMAEILTG ARKTYGLNLI GGIRRDLLKE DMIQTRQLAQ QMRRDVQELV DMLLSTPNME QRTVGIGRLD PEIARDFSNV GPMVRASGHA RDTRADHPFV GYGLLPMEVH SEQGCDVISR LKVRINEVYT SLNMIDFGLD NLPGGPLMVE GFTYIPHRFA LGFAEAPRGD DIHWSMTGDN QKLYRWRCRA ATYANWPTLR YMLRGNTVSD APLIIGSLDP CYSCTDRMTV VDVRKKKSKV VPYKELERYS IERKNSPLK
|
| |