Gene ECH74115_1033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1033 
Symbolhcr 
ID6971411 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1045650 
End bp1046618 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content55% 
IMG OID643385046 
ProductHCP oxidoreductase, NADH-dependent 
Protein accessionYP_002269546 
Protein GI209399450 
COG category[C] Energy production and conversion 
COG ID[COG0633] Ferredoxin
[COG1018] Flavodoxin reductases (ferredoxin-NADPH reductases) family 1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATGC CAACGAATCA ATGCCCGTGG CGGATGCAGG TTCATCACAT TAAGCAAGAA 
ACGCCGGATG TGTGGACGAT TTCCCTGATT TGCCACGATT ACTACCCATA CCGCGCCGGG
CAATATGCAC TGGTCAGCGT GCGTAACTCA GCGGAAACGC TGCGTGCTTA CACCATTTCC
TCCACGCCAG GCGTGAGTGA ATATATCACC CTGACCGTGC GGCGGATTGA TGACGGTGTC
GGCTCCCAGT GGCTGACCCG CGATGTAAAA CGCGGTGATT ATCTCTGGCT TTCAGACGCG
ATGGGGGAAT TTACCTGCGA CGATAAAGCA GAAGATAAAT TCCTGTTGCT GGCTGCAGGC
TGCGGCGTCA CGCCGATTAT GTCGATGCGT CGCTGGCTGG CGAAGAACCG TCCACAGGCC
GATGTGCGGG TGATCTACAA CGTGCGTACG CCGCAGGATG TTATTTTCGC CGATGAGTGG
CGTAACTATC CGGTAACGCT GGTGGCGGAA AATAACGTTA CCGAAGGCTT TATCGCTGGT
CGTCTCACTC GCGAACTCCT GACTCGCGTT CCTGACTTAG CTTCACGTAC CGTGATGACC
TGCGGCCCGG CTCCGTATAT GGATTGGGTA GAGCAGGAAG TGAAAGCACT CGGCGTGACG
CGTTTCTTTA AAGAGAAATT CTTCACCCCA GTAGCGGAAG CTGCGACCAG CGGTCTGAAA
TTCACCAAAC TGCAACCGGC GCAAGAATTT TACGCTCCTG TTGGCACCAC GCTGCTGGAG
GCGCTGGAAA GCAATAACGT TCCGGTTGTC GCCGCCTGCC GCGCGGGTGT TTGCGGCTGC
TGTAAGACGA AAGTGGTTTC CGGTGAATAT ACGGTGAGCA GTACAATGAC GCTGACCGAC
GCCGAAATCG CTGAAGGTTA CGTACTGGCC TGCTCCTGCC ATCCGCAGGG GGATTTGGTT
CTCGCATAA
 
Protein sequence
MTMPTNQCPW RMQVHHIKQE TPDVWTISLI CHDYYPYRAG QYALVSVRNS AETLRAYTIS 
STPGVSEYIT LTVRRIDDGV GSQWLTRDVK RGDYLWLSDA MGEFTCDDKA EDKFLLLAAG
CGVTPIMSMR RWLAKNRPQA DVRVIYNVRT PQDVIFADEW RNYPVTLVAE NNVTEGFIAG
RLTRELLTRV PDLASRTVMT CGPAPYMDWV EQEVKALGVT RFFKEKFFTP VAEAATSGLK
FTKLQPAQEF YAPVGTTLLE ALESNNVPVV AACRAGVCGC CKTKVVSGEY TVSSTMTLTD
AEIAEGYVLA CSCHPQGDLV LA