Gene SbBS512_E3155 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E3155 
SymbolhycE 
ID6269263 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp2946449 
End bp2948158 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content57% 
IMG OID641727073 
Productformate hydrogenlyase, subunit E 
Protein accessionYP_001881532 
Protein GI187733129 
COG category[C] Energy production and conversion 
COG ID[COG0852] NADH:ubiquinone oxidoreductase 27 kD subunit
[COG3261] Ni,Fe-hydrogenase III large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGAAG AAAAATTAGG TCAACATTAT CTCGCCGCGC TGAATGAGGC ATTTCCGGGC 
GTCGTGCTGG ACCACGCCTG GCAGACCAAA GATCAGCTGA CTATCACCGT AAAGGTGAAC
TATCTGCCGG AAGTGGTGGA GTTTCTCTAC TACAAGCAGG GGGGCTGGCT GTCGGTGCTT
TTTGGTAACG ACGAACGCAA ACTGAATGGT CATTACGCCG TTTATTACGT GCTGTCGATG
GAGAAGGGGA CTAAGTGCTG GGTAACGGTT CGCGTCGAAG TTGATGCCAA CAAACCGGAG
TATCCGTCCG TGACGCCGCG CGTTCCGGCG GCGGTGTGGG GCGAGCGCGA AGTGCGCGAT
ATGTACGGTT TGATTCCGGT TGGTCTGCCG GATGAACGCC GTCTGGTGCT GCCGGATGAC
TGGCCGGATG AACTTTATCC GCTGCGTAAA GACAGCATGG ATTATCGTCA GCGTCCGGCA
CCGACCACCG ATGCTGAAAC CTACGAGTTC ATCAACGAAC TGGGCGACAA GAAAAACAAC
GTCGTGCCGA TTGGTCCGCT GCACGTCACT TCTGACGAAC CGGGCCACTT CCGTCTGTTC
GTCGATGGCG AAAACATTAT CGACGCAGAC TACCGCTTGT TCTATGTCCA TCGCGGTATG
GAAAAACTGG CGGAAACCCG CATGGGTTAT AACGAAGTGA CCTTCCTCTC TGACCGTGTG
TGCGGGATCT GCGGCTTCGC CCACAGCACC GCCTACACCA CGTCGGTGGA AAACGCGATG
GGTATTCAGG TGCCAGAACG TGCGCAGATG ATCCGCGCCA TTCTGCTGGA GGTGGAACGC
CTGCACTCGC ATCTGCTCAA CCTCGGCCTC GCCTGCCACT TTACCGGCTT CGACTCCGGC
TTTATGCAGT TCTTCCGCGT GCGTGAAACC TCCATGAAAA TGGCCGAGAT CCTCACCGGT
GCGCGTAAAA CCTACGGCCT GAACCTGATC GGCGGGATTC GTCGCGATCT GTTGAAAGAT
GACATGATCC AGACCCGCCA GCTGGCGCAA CAGATGCGTC GTGAAGTGCA GGAGCTGGTG
GATGTGCTGC TGAGCACACC GAACATGGAA CAGCGCACCA TCGGCATTGG TCGTCTGGAC
CCGGAAATTG CCCGCGATTT CAGTAACGTT GGCCCGATGG TCCGCGCCAG CGGACACGCT
CGCGATACCC GCGCTGATCA CCCGTTTGTT GGTTATGGCC TGCTGCCAAT GGAAGTCCAC
AGCGAGCAGG GCTGCGACGT TATTTCGCGT CTGAAAGTGC GTATCAACGA AGTCTATACC
GCGCTGAACA TGATCGACTA CGGTCTGGAT AACCTGCCGG GTGGCCCGCT GATGGTGGAA
GGCTTTACCT ACATTCCGCA CCGTTTCGCG CTGGGCTTTG CCGAAGCGCC GCGCGGTGAT
GATATCCACT GGAGCATGAC TGGCGACAAC CAGAAGCTGT ACCGCTGGCG CTGCCGTGCC
GCGACCTACG CGAACTGGCC GACCCTGCGC TACATGCTGC GTGGCAACAC TGTTTCCGAT
GCGCCGCTGA TTATCGGTAG CCTGGACCCT TGCTACTCCT GTACCGACCG CATGACTGTG
GTCGATGTAC GTAAGAAGAA GAGCAAAGTG GTGCCGTACA AAGAACTCGA GCGCTACAGC
ATTGAGCGTA AAAACTCGCC GCTGAAATAA
 
Protein sequence
MSEEKLGQHY LAALNEAFPG VVLDHAWQTK DQLTITVKVN YLPEVVEFLY YKQGGWLSVL 
FGNDERKLNG HYAVYYVLSM EKGTKCWVTV RVEVDANKPE YPSVTPRVPA AVWGEREVRD
MYGLIPVGLP DERRLVLPDD WPDELYPLRK DSMDYRQRPA PTTDAETYEF INELGDKKNN
VVPIGPLHVT SDEPGHFRLF VDGENIIDAD YRLFYVHRGM EKLAETRMGY NEVTFLSDRV
CGICGFAHST AYTTSVENAM GIQVPERAQM IRAILLEVER LHSHLLNLGL ACHFTGFDSG
FMQFFRVRET SMKMAEILTG ARKTYGLNLI GGIRRDLLKD DMIQTRQLAQ QMRREVQELV
DVLLSTPNME QRTIGIGRLD PEIARDFSNV GPMVRASGHA RDTRADHPFV GYGLLPMEVH
SEQGCDVISR LKVRINEVYT ALNMIDYGLD NLPGGPLMVE GFTYIPHRFA LGFAEAPRGD
DIHWSMTGDN QKLYRWRCRA ATYANWPTLR YMLRGNTVSD APLIIGSLDP CYSCTDRMTV
VDVRKKKSKV VPYKELERYS IERKNSPLK