Gene ECH74115_5123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5123 
Symbol 
ID6969957 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4763465 
End bp4764529 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content51% 
IMG OID643388795 
Productputative oxidoreductase 
Protein accessionYP_002273221 
Protein GI209398169 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.121383 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACATT TCGACGTGGC GATTATTGGC CTCGGCCCGG CAGGGTCGGC GTTGGCACGA 
AAGTTAGCCG GCAAAATGCA GGTGATCGCG CTGGATAAAA AGCACCAGCA TGGTACTGAA
GGTTTCAGCA AGCCTTGTGG CGGTCTGCTG GCACCGGACG CGCAGCGATC TTTTATTCGC
GATGGACTGA CGCTTCCTGT CGATGTGATC GCCAATCCAC AGATTTTCAG CGTCAAAACT
GTCGACGTCG CCGCATCGCT CACGCGTAAC TACCAGCGAA GCTATATCAA TATTAATCGC
CATGCTTTCG ACTTGTGGAT GAAATCGCTG ATCCCCGCCA GCGTTGAGGT TTATCACGAT
AGCCTGTGCC GGAAAATCTG GCGTGAGGAT GATAAATGGC ATGTCATTTT TCGTGCAGAC
GGTTGGGAGC AGCATATTAC TGCCCGCTAT CTGGTCGGTG CAGATGGTGC CAACTCGATG
GTGCGGCGAT ATCTCTACCC GGACCATCAG ATTCGTAAAT ATGTCGCTAT CCAGCAGTGG
TTCGCAGAGA AACATCCGGT GCCGTTCTAC TCATGCATCT TTGATAATGC GATAACTGAC
TGTTACTCAT GGAGTATCAG CAAAGACGGT TATTTTATCT TTGGCGGTGC CTATCCAATG
AAAGACGGTC AGACGCGTTT CACGACGCTG AAAGAGAAAA TGAGCGCCTT TCAATTCCAG
TTTGGTAAGG CGGTAAAAAG CGAAAAATGC ACGGTGCTAT TTCCCTCACG CTGGCAGGAT
TTTGTCTGCG GTAAGGACAA CGCCTTTCTG ATTGGTGAAG CGGCGGGATT TATCAGCGCC
AGCTCGCTGG AGGGGATAAG CTATGCGCTG GATAGCGCAG AGATTCTGCG TTCGGTGTTA
CTGAAGCAGC CAGAGAAGAT CAACGCAGCC TACTGGCACG CCACCCGCAA ACTGCGTTTA
AAACTCTTCG GCAAGATAGT AAAAAGCCGA TGCCTGACCG CACCGGCTTT AAGAAAGTGG
ATTATGCGCA GTGGTATGGC GCATATTCCA CAGTTGAAAG ATTAG
 
Protein sequence
MEHFDVAIIG LGPAGSALAR KLAGKMQVIA LDKKHQHGTE GFSKPCGGLL APDAQRSFIR 
DGLTLPVDVI ANPQIFSVKT VDVAASLTRN YQRSYININR HAFDLWMKSL IPASVEVYHD
SLCRKIWRED DKWHVIFRAD GWEQHITARY LVGADGANSM VRRYLYPDHQ IRKYVAIQQW
FAEKHPVPFY SCIFDNAITD CYSWSISKDG YFIFGGAYPM KDGQTRFTTL KEKMSAFQFQ
FGKAVKSEKC TVLFPSRWQD FVCGKDNAFL IGEAAGFISA SSLEGISYAL DSAEILRSVL
LKQPEKINAA YWHATRKLRL KLFGKIVKSR CLTAPALRKW IMRSGMAHIP QLKD