Gene EcE24377A_3009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_3009 
SymbolhycE 
ID5587579 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp3007953 
End bp3009662 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content57% 
IMG OID640926657 
Productformate hydrogenlyase, subunit E 
Protein accessionYP_001464033 
Protein GI157158306 
COG category[C] Energy production and conversion 
COG ID[COG0852] NADH:ubiquinone oxidoreductase 27 kD subunit
[COG3261] Ni,Fe-hydrogenase III large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGAAG AAAAATTAGG TCAACATTAT CTCGCCGCGC TGAATGAGGC ATTTCCGGGC 
GTCGTGCTGG ACCACGCCTG GCAGACCAAA GATCAGCTGA CTGTCACCGT AAAGGTGAAC
TACCTGCCGG AAGTGGTGGA GTTTCTTTAC TACAAACAGG GTGGCTGGCT GTCGGTGCTG
TTTGGTAACG ACGAACGCAA ACTGAATGGT CATTACGCCG TTTATTACGT GCTGTCGATG
GAGAAGGGCA CTAAGTGTTG GGTAACGGTT CGCGTCGAAG TTGACGCCAA CAAACCGGAG
TATCCGTCCG TGACGCCGCG CGTTCCGGCG GCTGTGTGGG GCGAGCGCGA AGTGCGCGAT
ATGTACGGTT TGATTCCGGT TGGTCTGCCG GATGAACGTC GTCTGGTGCT GCCGGATGAC
TGGCCGGATG AACTTTATCC GCTGCGTAAA GACAGCATGG ATTATCGTCA GCGTCCGGCG
CCGACCACCG ATGCTGAAAC CTACGAGTTC ATCAATGAAC TGGGTGACAA GAAAAACAAC
GTCGTGCCGA TTGGTCCGTT GCACGTCACT TCCGACGAAC CGGGTCACTT CCGTCTGTTC
GTCGATGGCG AAAATATTAT CGACGCCGAC TACCGCCTGT TCTACGTCCA TCGCGGCATG
GAAAAACTGG CGGAAACCCG TATGGGGTAC AACGAAGTGA CCTTCCTCTC TGACCGTGTG
TGCGGCATCT GCGGCTTTGC TCACAGCACC GCCTACACCA CGTCGGTGGA AAACGCGATG
GGTATTCAGG TGCCAGAACG TGCGCAGATG ATCCGCGCCA TTCTGCTGGA GGTGGAACGC
CTGCACTCGC ATCTGCTCAA CCTCGGCCTG GCCTGTCACT TTACTGGCTT CGACTCCGGC
TTTATGCAGT TCTTCCGCGT GCGTGAAACC TCCATGAAAA TGGCAGAGAT CCTTACCGGT
GCGCGTAAAA CCTACGGCCT GAACCTGATC GGCGGGATTC GTCGCGATCT GCTGAAAGAT
GACATGATCC AGACCCGCCA GCTGGCGCAA CAAATGCGTC GTGAAGTGCA GGAGCTGGTG
GATGTGCTGC TGAGTACGCC GAACATGGAA CAGCGCACTG TCGGCATTGG TCGTCTGGAC
CCGGAAATCG CCCGCGATTT CAGTAACGTT GGCCCGATGG TCCGCGCCAG CGGACACGCT
CGCGATACCC GCGCCGATCA CCCGTTTGTT GGTTATGGCC TGCTGCCAAT GGAAGTCCAC
AGCGAGCAGG GCTGCGACGT TATTTCCCGT CTGAAAGTGC GTATCAACGA AGTCTATACC
GCGCTGAACA TGATCGACTA CGGTCTGGAT AACCTGCCGG GCGGTCCGTT GATGGTGGAA
GGCTTTACCT ACATTCCGCA CCGTTTCGCG CTGGGCTTTG CCGAAGCGCC GCGCGGCGAC
GATATCCACT GGAGCATGAC CGGCGACAAC CAGAAGCTGT ACCGCTGGCG CTGCCGTGCC
GCGACCTACG CGAACTGGCC GACCCTGCGC TACATGCTGC GTGGCAACAC CGTTTCTGAT
GCGCCGCTGA TTATCGGTAG CCTGGACCCT TGCTACTCCT GTACCGACCG CATGACGGTG
GTCGATGTAC GTAAGAAGAA GAGCAAAGTG GTGCCGTACA AAGAACTCGA GCGTTACAGC
ATTGAGCGTA AAAACTCGCC GCTGAAATAA
 
Protein sequence
MSEEKLGQHY LAALNEAFPG VVLDHAWQTK DQLTVTVKVN YLPEVVEFLY YKQGGWLSVL 
FGNDERKLNG HYAVYYVLSM EKGTKCWVTV RVEVDANKPE YPSVTPRVPA AVWGEREVRD
MYGLIPVGLP DERRLVLPDD WPDELYPLRK DSMDYRQRPA PTTDAETYEF INELGDKKNN
VVPIGPLHVT SDEPGHFRLF VDGENIIDAD YRLFYVHRGM EKLAETRMGY NEVTFLSDRV
CGICGFAHST AYTTSVENAM GIQVPERAQM IRAILLEVER LHSHLLNLGL ACHFTGFDSG
FMQFFRVRET SMKMAEILTG ARKTYGLNLI GGIRRDLLKD DMIQTRQLAQ QMRREVQELV
DVLLSTPNME QRTVGIGRLD PEIARDFSNV GPMVRASGHA RDTRADHPFV GYGLLPMEVH
SEQGCDVISR LKVRINEVYT ALNMIDYGLD NLPGGPLMVE GFTYIPHRFA LGFAEAPRGD
DIHWSMTGDN QKLYRWRCRA ATYANWPTLR YMLRGNTVSD APLIIGSLDP CYSCTDRMTV
VDVRKKKSKV VPYKELERYS IERKNSPLK