Gene EcSMS35_3011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3011 
Symbol 
ID6145666 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3094861 
End bp3097959 
Gene Length3099 bp 
Protein Length1032 aa 
Translation table11 
GC content52% 
IMG OID641617880 
Productputative selenate reductase subunit YgfK 
Protein accessionYP_001745031 
Protein GI170681388 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0493] NADPH-dependent glutamate synthase beta chain and related oxidoreductases 
TIGRFAM ID[TIGR03315] putative selenate reductase, YgfK subunit 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGGATA TTATGCGTCC CATTCCGTTT GAGGAACTTT TGACGCGCAT ATTTGATGAA 
TACCAACAAC AACGCTCAAT CTTTGGTATT CCCGAGCAAC AGTTTTACTC ACCCGTAAAA
GGTAAAACTG TTAGCGTCTT CGGTGAAACC TGTGCCACTC CCGTCGGCCC TGCCGCTGGC
CCGCACACGC AGCTCGCGCA AAATATTGTC ACTTCCTGGC TGACTGGCGG GCGCTTCATC
GAACTAAAAA CCGTCCAAAT TCTTGACCGT CTGGAGCTGG AAAAGCCCTG TATCGATGCC
GAAGACGAGT GCTTTAACAC CGAATGGTCT ACCGAATTTA CCCTGCTTAA AGCCTGGGAT
GAATACCTCA AAGCCTGGTT TGCCCTGCAC CTTCTCGAAG CGATGTTCCA GCCTTCTGAT
TCCGGTAAAT CGTTCATCTT TAATATGAGC GTCGGTTATA ACCTCGAAGG TATTAAGCAA
CCGCCGATGC AACAGTTCAT CGACAATATG ATGGATGCAT CTGACCATCC GAAATTCGCT
CAATATCGCG ATACGCTGAA TAAATTACTC CAGGATGACG CATTTTTAGC CCGCCACGGA
TTGCAGGAAA AACGCGAATG CCTGCAGGCC TTACCGGCAC GCATCCCCAC CAGCATGGTA
GAAGGCGTTA CCCTCTCCAC CATGCACGGC TGTCCTCCGC ATGAAATCGA AGCCATTTGC
CGCTACATGC TGGAAGAAAA AGGGCTCAAT ACCTTTGTGA AACTCAACCC GACCTTACTG
GGGTACGCGC GTGTTCGTGA GATCCTCGAT GTCTGCGGTT TCGGTTACAT AGGCTTAAAA
GAAGAGTCAT TTGATCACGA CCTCAAGCTG ACGCAAGCGC TGGAAATGCT GGAACGCCTG
ATGGTACTGG CAAAAGAAAA ATCACTCGGC TTTGGCGTAA AACTGACTAA CACTCTCGGC
ACCATCAACA ATAAAGGCGC ACTGCCTGGT GAAGAGATGT ATATGTCAGG CCGTGCGCTG
TTCCCGCTCT CCATCAATGT TGCAGCAGTT CTCTCTCGCG CCTTTGACGG CAAACTGCCC
ATTTCTTATT CCGGTGGTGC CAGTCAGCTG ACTATCCGCG ATATTTTTGA TACTGGTATT
CGCCCTATTA CTATGGCAAC CGACCTGCTG AAACCTGGCG GCTATCTGCG CTTAAGTGCC
TGCATGCGCG AGCTGGAAGG CTCCGACGCC TGGGGGCTTG ACCATGTTGA CGTCGAACGC
CTGAACAAAC TGGCAGCAGA CGCGTTAACC ATGGAATACA CTCAGAAACA CTGGAAGCCA
GAAGAGCGTA TTGAAGTGGC AGAAGACCTG CCGCTGACCG ACTGCTACGT TGCACCGTGC
GTTACTGCTT GCGCCATTAA ACAGGATATT CCGGAATACA TCCGTCTGCT TGGCGAACAC
CGCTATGCCG ACGCACTGGA ACTCATCTAC CAACGCAACG CCCTGCCCGC CATTACCGGT
CATATTTGCG ATCACCAGTG CCAATACAAC TGTACCCGCC TGGATTACGA CAGTGCGCTG
AATATCCGCG AACTGAAAAA AGTCGCGCTG GAAAAAGGTT GGGATGAATA TAAGCAACGC
TGGCACAAAC CAGCCGGTTC TGGTTCACGC CATCCGGTTG CCGTGATTGG TGCAGGTCCG
GCGGGTCTGG CAGCAGGTTA CTTCCTTGCC AGAGCGGGCC ATCCGGTTAC GCTGTTTGAA
CGCGAAGCCA ATGCGGGCGG CGTGGTGAAA AATATCATTC CTCAGTTCCG TATTCCTGCA
GAGTTAATTC AGCACGATAT CGATTTTGTT GCCGCTCACG GCGTGAAATT TGAGTATGGC
TGCTCACCCG ATTTGACCGT TGAACAGTTA AAAAATCAGG GCTTCCACTA TGTTCTGATT
GCCACCGGCA CTGATAAAAA TAGCGGTGTG AAACTGGCGG GCGACAACCA AAATGTCTGG
AAATCACTCC CCTTCCTGCG TGAATACAAC AAGGGCACAG CGCTCAAGCT GGGCAAACAT
GTGGTCGTTG TCGGGGCGGG TAACACGGCA ATGGACTGCG CTCGTGCGGC GTTACGCGTT
CCAGGCGTAG AAAAAGCAAC GGTCGTTTAC CGTCGTTCAC TGCAAGAGAT GCCAGCATGG
CGCGAAGAGT ATGAAGAAGC GTTGCACGAC GGTGTGGAGT TCCGTTTCCT GAATAATCCG
GAACGTTTCG ATGCTGATGG CACCTTAACC TTGCGCGTTA TGTCGCTTGG CGAACCTGAT
GAGAAAGGTC GTCGCCGTCC GGTTGAAACC AACGAAACAG TAACACTGCA TGTAGACAGC
CTGATCACCG CCATTGGTGA ACAGCAGGAT ACTGAAGCCC TGAATGCGAT GGGCGTGCCG
CTGGACAAAA ACGGCTGGCC AGACGTCGAC CATAATGGCG AAACTCGTCT GACTGACGTC
TTTATGATCG GCGACGTACA GCGCGGACCA TCCTCCATTG TCGCTGCTGT CGGAACCGCG
CGTCGGGCGA CCGATGCCAT CCTTAGTCGG GAAAACATCC GTTCCCACCA GAACGATAAA
TACTGGAACA ACGTCAATCC GGCGGAAATC TATCAACGCA AAGGCGATAT CTCTATCACC
CTGGTGGATA GCGACGATCG TGACGCGTTT GTCGCCCAGG AAGCCGCACG CTGCCTTGAA
TGTAACTACG TTTGCAGCAA GTGTGTGGAT GTCTGCCCGA ACCGCGCCAA CGTATCCATT
GCGGTCCCTG GCTTCCAGAA CCGTTTCCAG ACGCTGCACC TCGACGCTTA CTGTAACGAA
TGCGGCAACT GCGCTCAGTT CTGCCCGTGG AACGGTAAAC CGTACAAAGA CAAAATCACC
GTCTTCAGCC TGTCGCAAGA CTTTGATAAC AGCAGCAACC CTGGCTTCCT TGTGGAAGAT
TGCCGGGTAC GAGTACGTCT GAATAACCAA AGCTGGGTGT TAAACATCGA CAGCGAAGGT
CAGTTCAACA ACGTACCACC GGAGCTGAAC GATATGTGCC GCATTATTAG CCATGTCCAC
CAGCATCATC ATTATCTGCT GGGCCGCGTG GAGGTGTAA
 
Protein sequence
MGDIMRPIPF EELLTRIFDE YQQQRSIFGI PEQQFYSPVK GKTVSVFGET CATPVGPAAG 
PHTQLAQNIV TSWLTGGRFI ELKTVQILDR LELEKPCIDA EDECFNTEWS TEFTLLKAWD
EYLKAWFALH LLEAMFQPSD SGKSFIFNMS VGYNLEGIKQ PPMQQFIDNM MDASDHPKFA
QYRDTLNKLL QDDAFLARHG LQEKRECLQA LPARIPTSMV EGVTLSTMHG CPPHEIEAIC
RYMLEEKGLN TFVKLNPTLL GYARVREILD VCGFGYIGLK EESFDHDLKL TQALEMLERL
MVLAKEKSLG FGVKLTNTLG TINNKGALPG EEMYMSGRAL FPLSINVAAV LSRAFDGKLP
ISYSGGASQL TIRDIFDTGI RPITMATDLL KPGGYLRLSA CMRELEGSDA WGLDHVDVER
LNKLAADALT MEYTQKHWKP EERIEVAEDL PLTDCYVAPC VTACAIKQDI PEYIRLLGEH
RYADALELIY QRNALPAITG HICDHQCQYN CTRLDYDSAL NIRELKKVAL EKGWDEYKQR
WHKPAGSGSR HPVAVIGAGP AGLAAGYFLA RAGHPVTLFE REANAGGVVK NIIPQFRIPA
ELIQHDIDFV AAHGVKFEYG CSPDLTVEQL KNQGFHYVLI ATGTDKNSGV KLAGDNQNVW
KSLPFLREYN KGTALKLGKH VVVVGAGNTA MDCARAALRV PGVEKATVVY RRSLQEMPAW
REEYEEALHD GVEFRFLNNP ERFDADGTLT LRVMSLGEPD EKGRRRPVET NETVTLHVDS
LITAIGEQQD TEALNAMGVP LDKNGWPDVD HNGETRLTDV FMIGDVQRGP SSIVAAVGTA
RRATDAILSR ENIRSHQNDK YWNNVNPAEI YQRKGDISIT LVDSDDRDAF VAQEAARCLE
CNYVCSKCVD VCPNRANVSI AVPGFQNRFQ TLHLDAYCNE CGNCAQFCPW NGKPYKDKIT
VFSLSQDFDN SSNPGFLVED CRVRVRLNNQ SWVLNIDSEG QFNNVPPELN DMCRIISHVH
QHHHYLLGRV EV