Gene EcSMS35_2846 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2846 
SymbolhycE 
ID6146810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2920654 
End bp2922384 
Gene Length1731 bp 
Protein Length576 aa 
Translation table11 
GC content57% 
IMG OID641617715 
Productformate hydrogenlyase, subunit E 
Protein accessionYP_001744870 
Protein GI170681957 
COG category[C] Energy production and conversion 
COG ID[COG3261] Ni,Fe-hydrogenase III large subunit
[COG3262] Ni,Fe-hydrogenase III component G 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.15789 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATTAAAG AGAGTTTGAG CATGTCTGAA GAAAAATTAG GTCAACATTA TCTCGCCGCG 
CTGAATGAGG CATTTCCGGG CGTCGTGCTG GACCACGCCT GGCAGACCAA AGATCAGCTA
ACCGTCACCG TGAAGGTGAA CTACCTGCCG GAAGTGGTGG AGTTTCTCTA CTACAAACAA
GGGGGCTGGC TGTCGGTGCT GTTTGGTAAC GACGAACGCA AACTGAATGG TCATTACGCC
GTTTACTACG TGCTGTCGAT GGAGAAGGGC ACCAAATGCT GGATAACCGT GCGCGTCGAA
GTTGACGCCA ACAAACCGGA ATATCCGTCC GTGACACCGC GCGTTCCGGC GGCGGTGTGG
GGCGAGCGCG AAGTGCGCGA TATGTACGGT TTGATTCCGG TAGGGCTTCC GGATGAACGT
CGTCTGGTGC TGCCGGATGA CTGGCCGGAT GAACTTTATC CGCTGCGTAA AGACAGCATG
GATTATCGTC AGCGTCCGGC ACCGACTACC GATGCTGAAA CCTACGAGTT CATCAACGAA
CTGGGCGACA AGAAAAACAA TGTCGTGCCG ATTGGTCCGC TGCACGTCAC TTCTGACGAA
CCGGGTCACT TCCGTCTGTT CGTCGATGGC GAAAACATTA TCGACGCCGA CTACCGCCTG
TTCTATGTCC ATCGCGGCAT GGAAAAACTG GCGGAAACCC GCATGGGTTA CAACGAAGTG
ACCTTCCTCT CTGACCGTGT GTGCGGGATC TGCGGCTTTG CTCACAGCAC CGCCTACACC
ACGTCGGTGG AAAACGCGAT GGGTATTCAG GTGCCAGAAC GTGCGCAGAT GATCCGCGCC
ATTCTGCTGG AGGTGGAACG CCTGCACTCG CATCTGCTCA ACCTCGGCCT GGCCTGTCAC
TTTACCGGCT TCGACTCCGG CTTTATGCAG TTCTTCCGCG TGCGTGAAAC CTCCATGAAA
ATGGCAGAGA TCCTTACCGG TGCACGTAAA ACCTACGGCC TGAACCTGAT CGGCGGGATT
CGTCGCGATC TGCTGAAAGA CGACATGATC CAGACCCGCC AGCTGGCACA ACAGATGCGT
CGTGAAGTGC AGGAGCTGGT GGATGTGCTG CTGAGCACAC CGAACATGGA ACAACGCACC
GTCGGCATTG GTCGTCTGGA CCCGGAAATC GCTCGCGACT TCAGTAACGT TGGCCCGATG
GTCCGTGCCA GCGGTCATGC CCGTGATACC CGTGCCGATC ACCCGTTTGT TGGTTATGGC
CTGCTGCCAA TGGAAGTCCA CAGCGAGCAG GGCTGCGACG TTATTTCCCG TCTGAAAGTG
CGTATCAACG AAGTCTATAC CGCGCTGAAT ATGATCGATT ACGGTCTGGA TAACCTGCCG
GGCGGCCCGC TGATGGTGGA AGGCTTTACC TACATTCCGC ACCGTTTCGC GCTGGGCTTT
GCCGAAGCGC CGCGCGGCGA CGATATCCAC TGGAGCATGA CCGGCGACAA CCAGAAGCTG
TACCGCTGGC GCTGCCGTGC GGCGACCTAC GCAAACTGGC CGACCCTGCG CTACATGCTG
CGCGGCAACA CCGTTTCCGA TGCGCCGTTG ATTATCGGTA GCCTGGACCC TTGCTACTCC
TGTACCGACC GCATGACCGT GGTCGATGTA CGTAAGAAGA AGAGCAAAGT GGTGCCGTAC
AAAGAACTCG AGCGCTACAG CATTGAGCGT AAAAACTCGC CGCTGAAATA A
 
Protein sequence
MIKESLSMSE EKLGQHYLAA LNEAFPGVVL DHAWQTKDQL TVTVKVNYLP EVVEFLYYKQ 
GGWLSVLFGN DERKLNGHYA VYYVLSMEKG TKCWITVRVE VDANKPEYPS VTPRVPAAVW
GEREVRDMYG LIPVGLPDER RLVLPDDWPD ELYPLRKDSM DYRQRPAPTT DAETYEFINE
LGDKKNNVVP IGPLHVTSDE PGHFRLFVDG ENIIDADYRL FYVHRGMEKL AETRMGYNEV
TFLSDRVCGI CGFAHSTAYT TSVENAMGIQ VPERAQMIRA ILLEVERLHS HLLNLGLACH
FTGFDSGFMQ FFRVRETSMK MAEILTGARK TYGLNLIGGI RRDLLKDDMI QTRQLAQQMR
REVQELVDVL LSTPNMEQRT VGIGRLDPEI ARDFSNVGPM VRASGHARDT RADHPFVGYG
LLPMEVHSEQ GCDVISRLKV RINEVYTALN MIDYGLDNLP GGPLMVEGFT YIPHRFALGF
AEAPRGDDIH WSMTGDNQKL YRWRCRAATY ANWPTLRYML RGNTVSDAPL IIGSLDPCYS
CTDRMTVVDV RKKKSKVVPY KELERYSIER KNSPLK