Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3011 |
Symbol | |
ID | 6145666 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3094861 |
End bp | 3097959 |
Gene Length | 3099 bp |
Protein Length | 1032 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641617880 |
Product | putative selenate reductase subunit YgfK |
Protein accession | YP_001745031 |
Protein GI | 170681388 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0493] NADPH-dependent glutamate synthase beta chain and related oxidoreductases |
TIGRFAM ID | [TIGR03315] putative selenate reductase, YgfK subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGGATA TTATGCGTCC CATTCCGTTT GAGGAACTTT TGACGCGCAT ATTTGATGAA TACCAACAAC AACGCTCAAT CTTTGGTATT CCCGAGCAAC AGTTTTACTC ACCCGTAAAA GGTAAAACTG TTAGCGTCTT CGGTGAAACC TGTGCCACTC CCGTCGGCCC TGCCGCTGGC CCGCACACGC AGCTCGCGCA AAATATTGTC ACTTCCTGGC TGACTGGCGG GCGCTTCATC GAACTAAAAA CCGTCCAAAT TCTTGACCGT CTGGAGCTGG AAAAGCCCTG TATCGATGCC GAAGACGAGT GCTTTAACAC CGAATGGTCT ACCGAATTTA CCCTGCTTAA AGCCTGGGAT GAATACCTCA AAGCCTGGTT TGCCCTGCAC CTTCTCGAAG CGATGTTCCA GCCTTCTGAT TCCGGTAAAT CGTTCATCTT TAATATGAGC GTCGGTTATA ACCTCGAAGG TATTAAGCAA CCGCCGATGC AACAGTTCAT CGACAATATG ATGGATGCAT CTGACCATCC GAAATTCGCT CAATATCGCG ATACGCTGAA TAAATTACTC CAGGATGACG CATTTTTAGC CCGCCACGGA TTGCAGGAAA AACGCGAATG CCTGCAGGCC TTACCGGCAC GCATCCCCAC CAGCATGGTA GAAGGCGTTA CCCTCTCCAC CATGCACGGC TGTCCTCCGC ATGAAATCGA AGCCATTTGC CGCTACATGC TGGAAGAAAA AGGGCTCAAT ACCTTTGTGA AACTCAACCC GACCTTACTG GGGTACGCGC GTGTTCGTGA GATCCTCGAT GTCTGCGGTT TCGGTTACAT AGGCTTAAAA GAAGAGTCAT TTGATCACGA CCTCAAGCTG ACGCAAGCGC TGGAAATGCT GGAACGCCTG ATGGTACTGG CAAAAGAAAA ATCACTCGGC TTTGGCGTAA AACTGACTAA CACTCTCGGC ACCATCAACA ATAAAGGCGC ACTGCCTGGT GAAGAGATGT ATATGTCAGG CCGTGCGCTG TTCCCGCTCT CCATCAATGT TGCAGCAGTT CTCTCTCGCG CCTTTGACGG CAAACTGCCC ATTTCTTATT CCGGTGGTGC CAGTCAGCTG ACTATCCGCG ATATTTTTGA TACTGGTATT CGCCCTATTA CTATGGCAAC CGACCTGCTG AAACCTGGCG GCTATCTGCG CTTAAGTGCC TGCATGCGCG AGCTGGAAGG CTCCGACGCC TGGGGGCTTG ACCATGTTGA CGTCGAACGC CTGAACAAAC TGGCAGCAGA CGCGTTAACC ATGGAATACA CTCAGAAACA CTGGAAGCCA GAAGAGCGTA TTGAAGTGGC AGAAGACCTG CCGCTGACCG ACTGCTACGT TGCACCGTGC GTTACTGCTT GCGCCATTAA ACAGGATATT CCGGAATACA TCCGTCTGCT TGGCGAACAC CGCTATGCCG ACGCACTGGA ACTCATCTAC CAACGCAACG CCCTGCCCGC CATTACCGGT CATATTTGCG ATCACCAGTG CCAATACAAC TGTACCCGCC TGGATTACGA CAGTGCGCTG AATATCCGCG AACTGAAAAA AGTCGCGCTG GAAAAAGGTT GGGATGAATA TAAGCAACGC TGGCACAAAC CAGCCGGTTC TGGTTCACGC CATCCGGTTG CCGTGATTGG TGCAGGTCCG GCGGGTCTGG CAGCAGGTTA CTTCCTTGCC AGAGCGGGCC ATCCGGTTAC GCTGTTTGAA CGCGAAGCCA ATGCGGGCGG CGTGGTGAAA AATATCATTC CTCAGTTCCG TATTCCTGCA GAGTTAATTC AGCACGATAT CGATTTTGTT GCCGCTCACG GCGTGAAATT TGAGTATGGC TGCTCACCCG ATTTGACCGT TGAACAGTTA AAAAATCAGG GCTTCCACTA TGTTCTGATT GCCACCGGCA CTGATAAAAA TAGCGGTGTG AAACTGGCGG GCGACAACCA AAATGTCTGG AAATCACTCC CCTTCCTGCG TGAATACAAC AAGGGCACAG CGCTCAAGCT GGGCAAACAT GTGGTCGTTG TCGGGGCGGG TAACACGGCA ATGGACTGCG CTCGTGCGGC GTTACGCGTT CCAGGCGTAG AAAAAGCAAC GGTCGTTTAC CGTCGTTCAC TGCAAGAGAT GCCAGCATGG CGCGAAGAGT ATGAAGAAGC GTTGCACGAC GGTGTGGAGT TCCGTTTCCT GAATAATCCG GAACGTTTCG ATGCTGATGG CACCTTAACC TTGCGCGTTA TGTCGCTTGG CGAACCTGAT GAGAAAGGTC GTCGCCGTCC GGTTGAAACC AACGAAACAG TAACACTGCA TGTAGACAGC CTGATCACCG CCATTGGTGA ACAGCAGGAT ACTGAAGCCC TGAATGCGAT GGGCGTGCCG CTGGACAAAA ACGGCTGGCC AGACGTCGAC CATAATGGCG AAACTCGTCT GACTGACGTC TTTATGATCG GCGACGTACA GCGCGGACCA TCCTCCATTG TCGCTGCTGT CGGAACCGCG CGTCGGGCGA CCGATGCCAT CCTTAGTCGG GAAAACATCC GTTCCCACCA GAACGATAAA TACTGGAACA ACGTCAATCC GGCGGAAATC TATCAACGCA AAGGCGATAT CTCTATCACC CTGGTGGATA GCGACGATCG TGACGCGTTT GTCGCCCAGG AAGCCGCACG CTGCCTTGAA TGTAACTACG TTTGCAGCAA GTGTGTGGAT GTCTGCCCGA ACCGCGCCAA CGTATCCATT GCGGTCCCTG GCTTCCAGAA CCGTTTCCAG ACGCTGCACC TCGACGCTTA CTGTAACGAA TGCGGCAACT GCGCTCAGTT CTGCCCGTGG AACGGTAAAC CGTACAAAGA CAAAATCACC GTCTTCAGCC TGTCGCAAGA CTTTGATAAC AGCAGCAACC CTGGCTTCCT TGTGGAAGAT TGCCGGGTAC GAGTACGTCT GAATAACCAA AGCTGGGTGT TAAACATCGA CAGCGAAGGT CAGTTCAACA ACGTACCACC GGAGCTGAAC GATATGTGCC GCATTATTAG CCATGTCCAC CAGCATCATC ATTATCTGCT GGGCCGCGTG GAGGTGTAA
|
Protein sequence | MGDIMRPIPF EELLTRIFDE YQQQRSIFGI PEQQFYSPVK GKTVSVFGET CATPVGPAAG PHTQLAQNIV TSWLTGGRFI ELKTVQILDR LELEKPCIDA EDECFNTEWS TEFTLLKAWD EYLKAWFALH LLEAMFQPSD SGKSFIFNMS VGYNLEGIKQ PPMQQFIDNM MDASDHPKFA QYRDTLNKLL QDDAFLARHG LQEKRECLQA LPARIPTSMV EGVTLSTMHG CPPHEIEAIC RYMLEEKGLN TFVKLNPTLL GYARVREILD VCGFGYIGLK EESFDHDLKL TQALEMLERL MVLAKEKSLG FGVKLTNTLG TINNKGALPG EEMYMSGRAL FPLSINVAAV LSRAFDGKLP ISYSGGASQL TIRDIFDTGI RPITMATDLL KPGGYLRLSA CMRELEGSDA WGLDHVDVER LNKLAADALT MEYTQKHWKP EERIEVAEDL PLTDCYVAPC VTACAIKQDI PEYIRLLGEH RYADALELIY QRNALPAITG HICDHQCQYN CTRLDYDSAL NIRELKKVAL EKGWDEYKQR WHKPAGSGSR HPVAVIGAGP AGLAAGYFLA RAGHPVTLFE REANAGGVVK NIIPQFRIPA ELIQHDIDFV AAHGVKFEYG CSPDLTVEQL KNQGFHYVLI ATGTDKNSGV KLAGDNQNVW KSLPFLREYN KGTALKLGKH VVVVGAGNTA MDCARAALRV PGVEKATVVY RRSLQEMPAW REEYEEALHD GVEFRFLNNP ERFDADGTLT LRVMSLGEPD EKGRRRPVET NETVTLHVDS LITAIGEQQD TEALNAMGVP LDKNGWPDVD HNGETRLTDV FMIGDVQRGP SSIVAAVGTA RRATDAILSR ENIRSHQNDK YWNNVNPAEI YQRKGDISIT LVDSDDRDAF VAQEAARCLE CNYVCSKCVD VCPNRANVSI AVPGFQNRFQ TLHLDAYCNE CGNCAQFCPW NGKPYKDKIT VFSLSQDFDN SSNPGFLVED CRVRVRLNNQ SWVLNIDSEG QFNNVPPELN DMCRIISHVH QHHHYLLGRV EV
|
| |