Gene B21_02536 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_02536 
SymbolhycE 
ID8113220 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp2683937 
End bp2685646 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content57% 
IMG OID644848736 
Producthypothetical protein 
Protein accessionYP_003000309 
Protein GI251786005 
COG category[C] Energy production and conversion 
COG ID[COG3261] Ni,Fe-hydrogenase III large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.576982 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGAAG AAAAATTAGG TCAACATTAT CTCGCCGCGC TGAATGAGGC ATTTCCGGGC 
GTCGTGCTGG ACCACGCCTG GCAGACCAAA GATCAGCTGA CTGTCACCGT AAAGGTGAAC
TACCTGCCGG AAGTGGTGGA GTTTCTTTAC TACAAACAGG GTGGCTGGCT GTCGGTGCTG
TTTGGTAACG ACGAACGCAA ACTGAATGGT CATTACGCCG TTTACTACGT GCTGTCGATG
GAGAAGGGCA CTAAGTGTTG GATTACGGTT CGCGTCGAAG TTGACGCCAA CAAACCGGAA
TATCCGTCCG TGACACCGCG CGTTCCGGCG GCGGTGTGGG GCGAGCGCGA AGTGCGCGAT
ATGTACGGTT TGATTCCGGT TGGTCTGCCG GATGAACGCC GTCTGGTGCT GCCGGATGAC
TGGCCGGATG AACTTTATCC GCTGCGTAAA GACAGCATGG ATTATCGTCA GCGTCCGGCG
CCGACCACCG ATGCTGAAAC CTACGAGTTC ATCAACGAAC TGGGCGACAA GAAAAACAAC
GTCGTGCCGA TTGGTCCGCT GCACGTCACT TCTGATGAAC CGGGCCACTT CCGTCTGTTC
GTCGATGGCG AAAACATTAT CGACGCCGAC TACCGCCTGT TCTATGTCCA TCGCGGTATG
GAAAAACTGG CGGAAACCCG CATGGGTTAT AACGAAGTGA CCTTCCTCTC TGACCGTGTG
TGCGGGATCT GCGGCTTTGC TCACAGCACC GCCTACACCA CGTCGGTGGA AAACGCGATG
GGTATTCAGG TGCCAGAACG TGCGCAGATG ATCCGCGCCA TTCTGCTGGA GGTGGAACGC
CTGCACTCGC ATCTGCTCAA CCTCGGCCTC GCCTGCCACT TTACCGGCTT CGACTCCGGC
TTTATGCAGT TCTTCCGCGT GCGTGAAACC TCCATGAAAA TGGCAGAGAT CCTTACCGGT
GCGCGTAAAA CCTACGGCCT GAACCTGATC GGCGGGATTC GTCGCGATCT GCTGAAAGAT
GACATGATCC AGACCCGTCA ACTGGCGCAA CAGATGCGTC GTGAAGTGCA GGAGCTGGTG
GATGTGCTGC TGAGTACGCC GAACATGGAA CAGCGCACTG TCGGCATTGG TCGTCTGGAC
CCGGAAATCG CTCGCGACTT CAGTAACGTT GGCCCGATGG TCCGCGCCAG CGGTCACGCC
CGCGATACCC GCGCCGATCA CCCATTTGTC GGCTATGGCC TGCTGCCAAT GGAAGTCCAC
AGCGAGCAGG GCTGCGACGT TATTTCCCGT CTGAAAGTGC GTATCAACGA AGTCTATACC
GCGCTGAACA TGATCGACTA CGGTCTGGAT AACCTGCCGG GTGGCCCACT GATGGTGGAA
GGCTTTACCT ACATTCCGCA CCGCTTTGCG CTGGGCTTTG CCGAAGCGCC GCGCGGCGAT
GATATCCACT GGAGCATGAC CGGCGACAAC CAGAAGCTGT ACCGCTGGCG CTGCCGTGCC
GCGACCTACG CGAACTGGCC GACCCTGCGC TACATGCTGC GCGGAAACAC CGTTTCCGAT
GCGCCGCTGA TTATCGGTAG CCTCGACCCT TGCTACTCCT GTACCGACCG CATGACCGTG
GTCGATGTGC GTAAGAAAAA GAGCAAAGTG GTGCCGTACA AAGAACTCGA GCGTTACAGC
ATTGAGCGTA AAAACTCGCC GCTGAAATAA
 
Protein sequence
MSEEKLGQHY LAALNEAFPG VVLDHAWQTK DQLTVTVKVN YLPEVVEFLY YKQGGWLSVL 
FGNDERKLNG HYAVYYVLSM EKGTKCWITV RVEVDANKPE YPSVTPRVPA AVWGEREVRD
MYGLIPVGLP DERRLVLPDD WPDELYPLRK DSMDYRQRPA PTTDAETYEF INELGDKKNN
VVPIGPLHVT SDEPGHFRLF VDGENIIDAD YRLFYVHRGM EKLAETRMGY NEVTFLSDRV
CGICGFAHST AYTTSVENAM GIQVPERAQM IRAILLEVER LHSHLLNLGL ACHFTGFDSG
FMQFFRVRET SMKMAEILTG ARKTYGLNLI GGIRRDLLKD DMIQTRQLAQ QMRREVQELV
DVLLSTPNME QRTVGIGRLD PEIARDFSNV GPMVRASGHA RDTRADHPFV GYGLLPMEVH
SEQGCDVISR LKVRINEVYT ALNMIDYGLD NLPGGPLMVE GFTYIPHRFA LGFAEAPRGD
DIHWSMTGDN QKLYRWRCRA ATYANWPTLR YMLRGNTVSD APLIIGSLDP CYSCTDRMTV
VDVRKKKSKV VPYKELERYS IERKNSPLK