Gene ECH74115_4168 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4168 
Symbol 
ID6971463 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3858608 
End bp3861706 
Gene Length3099 bp 
Protein Length1032 aa 
Translation table11 
GC content52% 
IMG OID643387914 
Productputative selenate reductase subunit YgfK 
Protein accessionYP_002272353 
Protein GI209398177 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0493] NADPH-dependent glutamate synthase beta chain and related oxidoreductases 
TIGRFAM ID[TIGR03315] putative selenate reductase, YgfK subunit 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGGATA TTATGCGTCC CATTCCGTTT GAGGAACTTT TGACGCGCAT ATTTGATGAA 
TACCAACAAC AACGCTCAAT CTTTGGTATT CCCGAGCAAC AGTTTTACTC ACCTGTAAAA
GGTAAAACTG TTAGCGTCTT CGGTGAAACC TGTGCCACTC CCGTCGGCCC TGCCGCTGGC
CCGCACACGC AGCTCGCGCA AAATATTGTC ACTTCCTGGC TGACTGGCGG ACGTTTCATC
GAACTAAAAA CCGTCCAAAT TCTTGACCGC CTGGAGCTGG AAAAGCCCTG TATTGATGCC
GAAGACGAGT GCTTTAACAC CGAATGGTCT ACCGAATTTA CCCTGCTTAA AGCCTGGGAT
GAATACCTCA AAGCCTGGTT TGCCCTGCAC CTTCTCGAAG CGATGTTCCA ACCTTCTGAT
TCCGGTAAAT CGTTCATCTT TAATATGAGC GTCGGTTACA ACCTCGAAGG TATTAAGCAA
CCGCCAATGC AACAGTTCAT CGACAATATG ATGGACGCAT CTGACCATCC GAAATTCGCG
CAATATCGCG ATACGCTGAA TAAATTACTC CAGGATGACG CATTTTTAGC CCGCCACGGA
TTGCAGGAAA AACGCGAAAG CTTGCAAGCC TTACCCGCTC GCATCCCCAC CAGTATGGTG
CATGGCGTCA CCCTCTCCAC CATGCACGGC TGTCCTCCGC ATGAAATCGA AGCCATTTGC
CGCTACATGC TGGAAGAAAA AGGGCTCAAC ACCTTTGTGA AACTTAACCC GACCTTACTG
GGGTACGCGC GTGTTCGTGA GATCCTCGAT GTCTGCGGTT TCGGTTACAT AGGCTTAAAA
GAAGAGTCAT TTGATCACGA CCTCAAGCTG ACGCAAGCAC TGGAAATGCT GGAACGCCTG
ATGGCACTGG CAAAAGAAAA ATCACTCGGC TTTGGCGTAA AACTGACTAA CACTCTCGGC
ACCATCAACA ATAAAGGCGC ACTGCCTGGT GAAGAGATGT ATATGTCAGG CCGTGCGCTG
TTCCCGCTCT CCATCAATGT TGCAGCAGTT CTCTCTCGCG CCTTTGACGG CAAACTGCCC
ATTTCTTATT CCGGTGGTGC CAGTCAGCTG ACTATCCGCG ATATTTTTGA TACAGGTATT
CGCCCTATTA CTATGGCAAC CGACCTGCTG AAACCTGGCG GCTATTTGCG CTTAAGTGCC
TGCATGCGCG AGCTGGAAGG CTCCGACGCC TGGGGACTTG ACCATGTTGA CGTCGAACGA
CTGAACAGAC TGGCAGCAGA TGCGTTAACC ATGGAATACA CCCAGAAACA CTGGAAGCCA
GAAGAGCGTA TTGAAGTGGC AGAAGACCTG CCGCTGACCG ACTGCTACGT TGCCCCCTGT
GTTACTGCCT GCGCTATCAA GCAAGATATT CCGGAATACA TCCGTCTGCT TGGCGAACAC
CGCTATGCCG ACGCGCTGGA ACTCATCTAC CAACGCAACG CTCTGCCCGC CATTACCGGT
CATATTTGCG ATCACCAGTG CCAATACAAC TGTACCCGCC TGGATTACGA CAGTGCGCTG
AATATCCGCG AACTGAAAAA AGTCGCGCTG GAAAAAGGTT GGGATGAATA TAAGCAACGC
TGGCACAAAC CAGCCGGTTC TGGTTCACGC CATCCGGTTG CCGTGATTGG TGCAGGTCCG
GCGGGTCTGG CAGCAGGTTA CTTCCTTGCC AGAGCGGGCC ATCCGGTTAC GCTGTTTGAA
CGCGAAGCCA ATGCGGGCGG CGTGGTGAAA AATATCATTC CTCAGTTCCG AATTCCGGCT
GAGTTAATTC AGCACGATAT CGATTTTGTT GCCGCTCACG GCGTGAAATT TGAGTATGGC
TGCTCACCCG ATTTAACCGT TGAGCAGTTA AAAAATCAGG ACTTCCACTA TGTTCTGATT
GCCACCGGCA CAGATAAAAA TAGCGGTGTG AAACTGGCGG GCGACAACCA AAATGTCTTG
AAATCACTCC CCTTCCTGCG TGAATACAAC AAGGGTACAG CGCTCAAGCT GGGCAAACAT
GTGGTCGTTG TCGGGGCGGG TAACACCGCA ATGGACTGCG CTCGTGCGGC GTTACGCGTT
CCAGGCGTAG AAAAAGCAAC GGTCGTTTAC CGTCGTTCAC TGCAAGAGAT GCCCGCATGG
CGCGAAGAGT ATGAAGAAGC GTTGCACGAC GGCGTGGAGT TCCGTTTCCT GAATAATCCG
GAACGTTTCG ATGCTGATGG CACCTTAACC TTGCGCGTTA TGTCGCTTGG CGAACCCGAT
GAGAAAGGTC GTCGTCGTCC GGTTGAAACC AACGAAACAG TAACACTGCA TGTAGACAGC
CTGATCACCG CCATTGGTGA ACAGCAGGAT ACTGAAGCCC TGAATGCGAT GGGCGTGCCG
CTGGACAAAA ACGGCTGGCC AGACGTCGAC CATAATGGCG AAACTCGTCT GACTGACGTC
TTTATGATCG GCGACGTACA GCGCGGACCA TCCTCCATTG TCGCTGCTGT CGGAACCGCG
CGTCGGGCGA CCGATGCCAT CCTTAGTCGG GAAAATATCC GTTCCCACCA GAACGATAAA
TACTGGAACA ACGTCAATCC GGCGGAAATC TATCAACGTA AAGGCGATAT CTCTATCACG
CTGGTGAACA GTGACGATCG TGATGCGTTT GTCGCGCAGG AAGCTGCTCG CTGCCTTGAA
TGTAACTACG TTTGCAGCAA GTGTGTGGAT GTCTGCCCGA ACCGCGCCAA CGTCTCCATT
GCGGTCCCAG GCTTCCAGAA TAGATTCCAG ACGCTGCACC TCGACGCTTA CTGTAACGAA
TGCGGCAACT GCGCTCAGTT CTGTCCGTGG AACGGTAAAC CATACAAAGA CAAAATCACC
GTTTTCAGCC TGGCGCAAGA CTTTGATAAC AGCAGCAACC CAGGCTTCCT TGTGGAAGAT
TGCCGGGTAC GAGTACGACT GAATAACCAA AGCTGGGTGT TAAACATCGA CAGCGAAGGT
CAGTTCAACA ACGTACCACC GGAGCTGAAC GATATGTGCC GCATCATCAG CCATGTCCAC
CAGCATCATC ATTATCTGCT GGGCCGCGTG GAGGTGTAA
 
Protein sequence
MGDIMRPIPF EELLTRIFDE YQQQRSIFGI PEQQFYSPVK GKTVSVFGET CATPVGPAAG 
PHTQLAQNIV TSWLTGGRFI ELKTVQILDR LELEKPCIDA EDECFNTEWS TEFTLLKAWD
EYLKAWFALH LLEAMFQPSD SGKSFIFNMS VGYNLEGIKQ PPMQQFIDNM MDASDHPKFA
QYRDTLNKLL QDDAFLARHG LQEKRESLQA LPARIPTSMV HGVTLSTMHG CPPHEIEAIC
RYMLEEKGLN TFVKLNPTLL GYARVREILD VCGFGYIGLK EESFDHDLKL TQALEMLERL
MALAKEKSLG FGVKLTNTLG TINNKGALPG EEMYMSGRAL FPLSINVAAV LSRAFDGKLP
ISYSGGASQL TIRDIFDTGI RPITMATDLL KPGGYLRLSA CMRELEGSDA WGLDHVDVER
LNRLAADALT MEYTQKHWKP EERIEVAEDL PLTDCYVAPC VTACAIKQDI PEYIRLLGEH
RYADALELIY QRNALPAITG HICDHQCQYN CTRLDYDSAL NIRELKKVAL EKGWDEYKQR
WHKPAGSGSR HPVAVIGAGP AGLAAGYFLA RAGHPVTLFE REANAGGVVK NIIPQFRIPA
ELIQHDIDFV AAHGVKFEYG CSPDLTVEQL KNQDFHYVLI ATGTDKNSGV KLAGDNQNVL
KSLPFLREYN KGTALKLGKH VVVVGAGNTA MDCARAALRV PGVEKATVVY RRSLQEMPAW
REEYEEALHD GVEFRFLNNP ERFDADGTLT LRVMSLGEPD EKGRRRPVET NETVTLHVDS
LITAIGEQQD TEALNAMGVP LDKNGWPDVD HNGETRLTDV FMIGDVQRGP SSIVAAVGTA
RRATDAILSR ENIRSHQNDK YWNNVNPAEI YQRKGDISIT LVNSDDRDAF VAQEAARCLE
CNYVCSKCVD VCPNRANVSI AVPGFQNRFQ TLHLDAYCNE CGNCAQFCPW NGKPYKDKIT
VFSLAQDFDN SSNPGFLVED CRVRVRLNNQ SWVLNIDSEG QFNNVPPELN DMCRIISHVH
QHHHYLLGRV EV