Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4168 |
Symbol | |
ID | 6971463 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 3858608 |
End bp | 3861706 |
Gene Length | 3099 bp |
Protein Length | 1032 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643387914 |
Product | putative selenate reductase subunit YgfK |
Protein accession | YP_002272353 |
Protein GI | 209398177 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0493] NADPH-dependent glutamate synthase beta chain and related oxidoreductases |
TIGRFAM ID | [TIGR03315] putative selenate reductase, YgfK subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 63 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGGATA TTATGCGTCC CATTCCGTTT GAGGAACTTT TGACGCGCAT ATTTGATGAA TACCAACAAC AACGCTCAAT CTTTGGTATT CCCGAGCAAC AGTTTTACTC ACCTGTAAAA GGTAAAACTG TTAGCGTCTT CGGTGAAACC TGTGCCACTC CCGTCGGCCC TGCCGCTGGC CCGCACACGC AGCTCGCGCA AAATATTGTC ACTTCCTGGC TGACTGGCGG ACGTTTCATC GAACTAAAAA CCGTCCAAAT TCTTGACCGC CTGGAGCTGG AAAAGCCCTG TATTGATGCC GAAGACGAGT GCTTTAACAC CGAATGGTCT ACCGAATTTA CCCTGCTTAA AGCCTGGGAT GAATACCTCA AAGCCTGGTT TGCCCTGCAC CTTCTCGAAG CGATGTTCCA ACCTTCTGAT TCCGGTAAAT CGTTCATCTT TAATATGAGC GTCGGTTACA ACCTCGAAGG TATTAAGCAA CCGCCAATGC AACAGTTCAT CGACAATATG ATGGACGCAT CTGACCATCC GAAATTCGCG CAATATCGCG ATACGCTGAA TAAATTACTC CAGGATGACG CATTTTTAGC CCGCCACGGA TTGCAGGAAA AACGCGAAAG CTTGCAAGCC TTACCCGCTC GCATCCCCAC CAGTATGGTG CATGGCGTCA CCCTCTCCAC CATGCACGGC TGTCCTCCGC ATGAAATCGA AGCCATTTGC CGCTACATGC TGGAAGAAAA AGGGCTCAAC ACCTTTGTGA AACTTAACCC GACCTTACTG GGGTACGCGC GTGTTCGTGA GATCCTCGAT GTCTGCGGTT TCGGTTACAT AGGCTTAAAA GAAGAGTCAT TTGATCACGA CCTCAAGCTG ACGCAAGCAC TGGAAATGCT GGAACGCCTG ATGGCACTGG CAAAAGAAAA ATCACTCGGC TTTGGCGTAA AACTGACTAA CACTCTCGGC ACCATCAACA ATAAAGGCGC ACTGCCTGGT GAAGAGATGT ATATGTCAGG CCGTGCGCTG TTCCCGCTCT CCATCAATGT TGCAGCAGTT CTCTCTCGCG CCTTTGACGG CAAACTGCCC ATTTCTTATT CCGGTGGTGC CAGTCAGCTG ACTATCCGCG ATATTTTTGA TACAGGTATT CGCCCTATTA CTATGGCAAC CGACCTGCTG AAACCTGGCG GCTATTTGCG CTTAAGTGCC TGCATGCGCG AGCTGGAAGG CTCCGACGCC TGGGGACTTG ACCATGTTGA CGTCGAACGA CTGAACAGAC TGGCAGCAGA TGCGTTAACC ATGGAATACA CCCAGAAACA CTGGAAGCCA GAAGAGCGTA TTGAAGTGGC AGAAGACCTG CCGCTGACCG ACTGCTACGT TGCCCCCTGT GTTACTGCCT GCGCTATCAA GCAAGATATT CCGGAATACA TCCGTCTGCT TGGCGAACAC CGCTATGCCG ACGCGCTGGA ACTCATCTAC CAACGCAACG CTCTGCCCGC CATTACCGGT CATATTTGCG ATCACCAGTG CCAATACAAC TGTACCCGCC TGGATTACGA CAGTGCGCTG AATATCCGCG AACTGAAAAA AGTCGCGCTG GAAAAAGGTT GGGATGAATA TAAGCAACGC TGGCACAAAC CAGCCGGTTC TGGTTCACGC CATCCGGTTG CCGTGATTGG TGCAGGTCCG GCGGGTCTGG CAGCAGGTTA CTTCCTTGCC AGAGCGGGCC ATCCGGTTAC GCTGTTTGAA CGCGAAGCCA ATGCGGGCGG CGTGGTGAAA AATATCATTC CTCAGTTCCG AATTCCGGCT GAGTTAATTC AGCACGATAT CGATTTTGTT GCCGCTCACG GCGTGAAATT TGAGTATGGC TGCTCACCCG ATTTAACCGT TGAGCAGTTA AAAAATCAGG ACTTCCACTA TGTTCTGATT GCCACCGGCA CAGATAAAAA TAGCGGTGTG AAACTGGCGG GCGACAACCA AAATGTCTTG AAATCACTCC CCTTCCTGCG TGAATACAAC AAGGGTACAG CGCTCAAGCT GGGCAAACAT GTGGTCGTTG TCGGGGCGGG TAACACCGCA ATGGACTGCG CTCGTGCGGC GTTACGCGTT CCAGGCGTAG AAAAAGCAAC GGTCGTTTAC CGTCGTTCAC TGCAAGAGAT GCCCGCATGG CGCGAAGAGT ATGAAGAAGC GTTGCACGAC GGCGTGGAGT TCCGTTTCCT GAATAATCCG GAACGTTTCG ATGCTGATGG CACCTTAACC TTGCGCGTTA TGTCGCTTGG CGAACCCGAT GAGAAAGGTC GTCGTCGTCC GGTTGAAACC AACGAAACAG TAACACTGCA TGTAGACAGC CTGATCACCG CCATTGGTGA ACAGCAGGAT ACTGAAGCCC TGAATGCGAT GGGCGTGCCG CTGGACAAAA ACGGCTGGCC AGACGTCGAC CATAATGGCG AAACTCGTCT GACTGACGTC TTTATGATCG GCGACGTACA GCGCGGACCA TCCTCCATTG TCGCTGCTGT CGGAACCGCG CGTCGGGCGA CCGATGCCAT CCTTAGTCGG GAAAATATCC GTTCCCACCA GAACGATAAA TACTGGAACA ACGTCAATCC GGCGGAAATC TATCAACGTA AAGGCGATAT CTCTATCACG CTGGTGAACA GTGACGATCG TGATGCGTTT GTCGCGCAGG AAGCTGCTCG CTGCCTTGAA TGTAACTACG TTTGCAGCAA GTGTGTGGAT GTCTGCCCGA ACCGCGCCAA CGTCTCCATT GCGGTCCCAG GCTTCCAGAA TAGATTCCAG ACGCTGCACC TCGACGCTTA CTGTAACGAA TGCGGCAACT GCGCTCAGTT CTGTCCGTGG AACGGTAAAC CATACAAAGA CAAAATCACC GTTTTCAGCC TGGCGCAAGA CTTTGATAAC AGCAGCAACC CAGGCTTCCT TGTGGAAGAT TGCCGGGTAC GAGTACGACT GAATAACCAA AGCTGGGTGT TAAACATCGA CAGCGAAGGT CAGTTCAACA ACGTACCACC GGAGCTGAAC GATATGTGCC GCATCATCAG CCATGTCCAC CAGCATCATC ATTATCTGCT GGGCCGCGTG GAGGTGTAA
|
Protein sequence | MGDIMRPIPF EELLTRIFDE YQQQRSIFGI PEQQFYSPVK GKTVSVFGET CATPVGPAAG PHTQLAQNIV TSWLTGGRFI ELKTVQILDR LELEKPCIDA EDECFNTEWS TEFTLLKAWD EYLKAWFALH LLEAMFQPSD SGKSFIFNMS VGYNLEGIKQ PPMQQFIDNM MDASDHPKFA QYRDTLNKLL QDDAFLARHG LQEKRESLQA LPARIPTSMV HGVTLSTMHG CPPHEIEAIC RYMLEEKGLN TFVKLNPTLL GYARVREILD VCGFGYIGLK EESFDHDLKL TQALEMLERL MALAKEKSLG FGVKLTNTLG TINNKGALPG EEMYMSGRAL FPLSINVAAV LSRAFDGKLP ISYSGGASQL TIRDIFDTGI RPITMATDLL KPGGYLRLSA CMRELEGSDA WGLDHVDVER LNRLAADALT MEYTQKHWKP EERIEVAEDL PLTDCYVAPC VTACAIKQDI PEYIRLLGEH RYADALELIY QRNALPAITG HICDHQCQYN CTRLDYDSAL NIRELKKVAL EKGWDEYKQR WHKPAGSGSR HPVAVIGAGP AGLAAGYFLA RAGHPVTLFE REANAGGVVK NIIPQFRIPA ELIQHDIDFV AAHGVKFEYG CSPDLTVEQL KNQDFHYVLI ATGTDKNSGV KLAGDNQNVL KSLPFLREYN KGTALKLGKH VVVVGAGNTA MDCARAALRV PGVEKATVVY RRSLQEMPAW REEYEEALHD GVEFRFLNNP ERFDADGTLT LRVMSLGEPD EKGRRRPVET NETVTLHVDS LITAIGEQQD TEALNAMGVP LDKNGWPDVD HNGETRLTDV FMIGDVQRGP SSIVAAVGTA RRATDAILSR ENIRSHQNDK YWNNVNPAEI YQRKGDISIT LVNSDDRDAF VAQEAARCLE CNYVCSKCVD VCPNRANVSI AVPGFQNRFQ TLHLDAYCNE CGNCAQFCPW NGKPYKDKIT VFSLAQDFDN SSNPGFLVED CRVRVRLNNQ SWVLNIDSEG QFNNVPPELN DMCRIISHVH QHHHYLLGRV EV
|
| |