Gene B21_02545 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_02545 
SymbolhypE 
ID8113240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp2692472 
End bp2693440 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content57% 
IMG OID644848744 
Producthypothetical protein 
Protein accessionYP_003000317 
Protein GI251786013 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0309] Hydrogenase maturation factor 
TIGRFAM ID[TIGR02124] hydrogenase expression/formation protein HypE 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCAAT TAATCAACAG CCTGTTTATG GAAGCCTTTG CCAACCCGTG GCTGGCAGAG 
CAGGAAGATC AGGCACGTCT TGATCTGGCG CAGCTGGTAG CGGAAGGCGA CCGTCTGGCG
TTCTCCACCG ACAGTTACGT TATTGACCCG CTGTTCTTCC CTGGCGGTAA TATCGGCAAG
CTGGCGATTT GCGGCACAGC CAATGACGTT GCGGTCAGTG GCGCTATTCC GCGCTATCTC
TCCTGTGGCT TTATCCTCGA AGAAGGATTG CCGATGGAGA CACTGAAAGC CGTAGTGACC
AGCATGGCAG AAACCGCCCG CGCGGCAGGC ATTGCCATCG TTACTGGCGA TACTAAAGTG
GTGCAGCGCG GCGCGGTAGA TAAACTGTTT ATCAACACCG CTGGCATGGG CGCAATTCCG
GCGAATATTC ACTGGGGCGC ACAGACGCTA ACCGCAGGCG ATGTATTGCT GGTGAGCGGT
ACACTCGGCG ACCACGGGGC GACTATCCTT AACCTGCGTG AGCAGCTGGG GCTGGATGGC
GAACTGGTCA GCGACTGCGC GGTGCTGACG CCGCTTATTC AGACGCTGCG TGACATTCCC
GGCGTGAAAG CGCTGCGTGA TGCCACCCGT GGTGGTGTAA ACGCGGTGGT TCATGAGTTC
GCGGCAGCCT GCGGTTGTGG TATTGAACTT TCAGAAGCGG CACTGCCTGT TAAACCTGCC
GTGCGTGGCG TTTGCGAATT GCTGGGACTG GACGCCCTGA ACTTTGCCAA CGAAGGCAAA
CTAGTAATAG CTGTTGAACG CAACGCGGCA GAGCAAGTGC TGGCAGCGTT ACATTCCCAT
CCACTGGGGA AAGACGCGGC GCTGATTGGT GAAGTGGTGG AACGTAAAGG TGTTCGTCTT
GCCGGTCTGT ATGGCGTGAA ACGAACCCTC GATTTACCAC ACGCCGAACC GCTTCCGCGT
ATATGCTAA
 
Protein sequence
MQQLINSLFM EAFANPWLAE QEDQARLDLA QLVAEGDRLA FSTDSYVIDP LFFPGGNIGK 
LAICGTANDV AVSGAIPRYL SCGFILEEGL PMETLKAVVT SMAETARAAG IAIVTGDTKV
VQRGAVDKLF INTAGMGAIP ANIHWGAQTL TAGDVLLVSG TLGDHGATIL NLREQLGLDG
ELVSDCAVLT PLIQTLRDIP GVKALRDATR GGVNAVVHEF AAACGCGIEL SEAALPVKPA
VRGVCELLGL DALNFANEGK LVIAVERNAA EQVLAALHSH PLGKDAALIG EVVERKGVRL
AGLYGVKRTL DLPHAEPLPR IC