Gene EcolC_0830 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0830 
Symbol 
ID6065389 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp893585 
End bp896683 
Gene Length3099 bp 
Protein Length1032 aa 
Translation table11 
GC content52% 
IMG OID641600235 
Productputative selenate reductase subunit YgfK 
Protein accessionYP_001723829 
Protein GI170018875 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0493] NADPH-dependent glutamate synthase beta chain and related oxidoreductases 
TIGRFAM ID[TIGR03315] putative selenate reductase, YgfK subunit 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGGATA TTATGCGTCC CATTCCGTTT GAGGAACTTT TGACGCGCAT ATTTGATGAA 
TACCAACAAC AACGCTCAAT CTTTGGTATT CCCGAGCAAC AGTTTTACTC ACCTGTAAAA
GGTAAAACTG TTAGCGTCTT CGGTGAAACC TGTGCCACTC CCGTCGGCCC TGCCGCTGGC
CCGCACACGC AGCTCGCGCA AAATATTGTC ACTTCCTGGC TGACTGGCGG ACGCTTCATC
GAACTAAAAA CCGTCCAAAT TCTTGACCGC CTGGAGCTGG AAAAGCCCTG TATCGATGCC
GAAGACGAGT GCTTTAACAC CGAATGGTCT ACCGAATTTA CCCTGCTTAA AGCCTGGGAT
GAATACCTCA AAGCCTGGTT TGCCCTGCAC CTTCTCGAAG CGATGTTCCA GCCTTCTGAT
TCCGGTAAAT CGTTCATCTT TAATATGAGC GTCGGTTACA ACCTCGAAGG TATTAAGCAA
CCGCCGATGC AACAGTTCAT CGACAATATG ATGAACGCAT CTGACCATCC GAAATTCGCT
CAATATCGCG ATACGCTGAA TAAATTACTC CAGGATGACG CATTTTTAGC TCGCCACGGA
TTGCAGGAAA AACGCGAAAG CTTGCAAGCC TTACCCGCTC GCATCCCCAC CAGTATGGTG
CATGGCGTCA CCCTCTCCAC CATGCACGGC TGTCCTCCGC ATGAAATCGA AGCCATTTGC
CGCTACATGC TGGAAGAAAA AGGGCTCAAC ACCTTTGTGA AACTTAACCC GACCTTACTG
GGGTACGCGC GTGTTCGTGA GATCCTCGAT GTCTGCGGTT TCGGTTACAT AGGCTTAAAA
GAAGAGTCAT TTGATCACGA CCTCAAGCTG ACGCAAGCAC TGGAAATGCT GGAACGCCTG
ATGGCACTGG CAAAAGAAAA ATCACTCGGC TTTGGCGTAA AACTGACTAA CACTCTCGGC
ACCATCAACA ATAAAGGCGC ACTGCCTGGT GAAGAGATGT ATATGTCAGG CCGTGCGCTG
TTCCCGCTCT CCATCAATGT TGCAGCAGTT CTCTCTCGCG CCTTTGACGG CAAACTGCCC
ATTTCTTATT CCGGTGGTGC CAGTCAGCTG ACTATCCGCG ATATTTTTGA TACAGGTATT
CGCCCTATTA CTATGGCAAC CGACCTGCTG AAACCTGGTG GCTATCTGCG TTTAAGTGCC
TGCATGCGCG AGCTGGAAGG CTCCGACGCC TGGGGACTTG ACCATGTTGA CGTCGAACGA
CTGAACAGAC TGGCAGCAGA CGCGCTAACC ATGGAATACA CCCAGAAACA CTGGAAGCCA
GAAGAGCGTA TTGAAGTGGC AGAAGACCTG CCGCTGACCG ACTGCTACGT TGCCCCCTGT
GTTACTGCCT GCGCTATCAA GCAAGATATT CCGGAATACA TCCGTCTGCT TGGCGAACAC
CGCTATGCCG ACGCGCTGGA ACTCATCTAC CAACGCAACG CTCTGCCCGC CATTACCGGT
CATATTTGCG ATCACCAGTG CCAATACAAC TGTACCCGCC TGGATTACGA CAGTGCGCTG
AATATCCGCG AACTGAAAAA AGTCGCGCTG GAAAAAGGTT GGGATGAATA TAAGCAACGC
TGGCACAAAC CAGCCGGTTC TGGTTCACGC CATCCGGTTG CCGTGATTGG TGCAGGTCCG
GCGGGTCTGG CAGCAGGTTA CTTCCTTGCC AGAGCGGGCC ATCCGGTTAC GCTGTTTGAA
CGCGAAGCCA ATGCGGGCGG CGTGGTGAAA AATATCATTC CTCAGTTCCG TATTCCTGCA
GAGTTAATTC AGCACGATAT CGATTTTGTT GCCGCTCACG GCGTGAAATT TGAGTATGGC
TGCTCACCCG ATTTAACCAT TGAGCAGTTA AAAAATCAGG GCTTCCACTA TGTTCTGATT
GCCACCGGCA CTGATAAAAA TAGCGGTGTG AAACTGGCGG GCGACAACCA AAATGTCTGG
AAATCACTCC CCTTCCTGCG TGAATACAAC AAGGGTACAG CGCTCAAGCT GGGCAAACAT
GTGGTCGTTG TCGGGGCGGG TAACACCGCA ATGGACTGCG CTCGTGCGGC GTTACGCGTT
CCAGGCGTAG AAAAAGCAAC GATCGTTTAC CGTCGTTCAC TACAAGAGAT GCCCGCATGG
CGCGAAGAGT ATGAAGAAGC GTTGCACGAC GGCGTAGAGT TCCGTTTCCT GAATAATCCG
GAACGTTTCG ATGCTGATGG CACCTTAACC TTGCGCGTTA TGTCGCTTGG CGAACCGGAT
GAGAAAGGTC GTCGTCGTCC GGTTGAAACC AATGAAACAG TAACACTGCT TGTAGACAGC
CTGATCACCG CCATTGGTGA ACAGCAGGAT ACTGAAGCCC TGAATGCGAT GGGCGTGCCG
CTGGACAAAA ACGGCTGGCC AGACGTCGAC CATAATGGCG AAACTCGTCT GACTGACGTC
TTTATGATCG GCGACGTACA GCGCGGACCA TCCTCCATTG TCGCTGCTGT CGGAACCGCG
CGTCGGGCGA CCGATGCCAT CCTTAGTCGG GAAAATATCC GTTCCCACCA GAACGATAAA
TACTGGAACA ACGTCAATCC AGTGGAAATC TATCAACGTA AAGGCGATAT CTCTATCACG
CTGGTGAACA GTGACGATCG TGACGCGTTT GTCGCCCAGG AAGCCGCTCG CTGCCTCGAA
TGTAACTACG TTTGCAGCAA GTGTGTGGAT GTCTGCCCGA ACCGCGCCAA CGTCTCCATT
GCGGTCCCAG GCTTCCAGAA CCGTTTCCAG ACGCTGCACC TCGACGCTTA CTGTAACGAA
TGCGGCAACT GCGCTCAGTT CTGTCCGTGG AACGGTAAAC CGTACAAAGA CAAAATCACC
GTCTTCAGCC TGGCGCAAGA CTTTGATAAC AGCAGCAACC CAGGCTTCCT TGTGGAAGAT
TGCCGGGTAC GAGTACGTCT GAATAACCAA AGCTGGGTGT TAAACATCGA CAGCAAAGGT
CAGTTTAACA ACGTACCACC GGAGCTGAAC GATATGTGCC GCATCATCAG CCATGTCCAC
CAGCATCATC ATTATCTGCT GGGCCGCGTG GAGGTGTAA
 
Protein sequence
MGDIMRPIPF EELLTRIFDE YQQQRSIFGI PEQQFYSPVK GKTVSVFGET CATPVGPAAG 
PHTQLAQNIV TSWLTGGRFI ELKTVQILDR LELEKPCIDA EDECFNTEWS TEFTLLKAWD
EYLKAWFALH LLEAMFQPSD SGKSFIFNMS VGYNLEGIKQ PPMQQFIDNM MNASDHPKFA
QYRDTLNKLL QDDAFLARHG LQEKRESLQA LPARIPTSMV HGVTLSTMHG CPPHEIEAIC
RYMLEEKGLN TFVKLNPTLL GYARVREILD VCGFGYIGLK EESFDHDLKL TQALEMLERL
MALAKEKSLG FGVKLTNTLG TINNKGALPG EEMYMSGRAL FPLSINVAAV LSRAFDGKLP
ISYSGGASQL TIRDIFDTGI RPITMATDLL KPGGYLRLSA CMRELEGSDA WGLDHVDVER
LNRLAADALT MEYTQKHWKP EERIEVAEDL PLTDCYVAPC VTACAIKQDI PEYIRLLGEH
RYADALELIY QRNALPAITG HICDHQCQYN CTRLDYDSAL NIRELKKVAL EKGWDEYKQR
WHKPAGSGSR HPVAVIGAGP AGLAAGYFLA RAGHPVTLFE REANAGGVVK NIIPQFRIPA
ELIQHDIDFV AAHGVKFEYG CSPDLTIEQL KNQGFHYVLI ATGTDKNSGV KLAGDNQNVW
KSLPFLREYN KGTALKLGKH VVVVGAGNTA MDCARAALRV PGVEKATIVY RRSLQEMPAW
REEYEEALHD GVEFRFLNNP ERFDADGTLT LRVMSLGEPD EKGRRRPVET NETVTLLVDS
LITAIGEQQD TEALNAMGVP LDKNGWPDVD HNGETRLTDV FMIGDVQRGP SSIVAAVGTA
RRATDAILSR ENIRSHQNDK YWNNVNPVEI YQRKGDISIT LVNSDDRDAF VAQEAARCLE
CNYVCSKCVD VCPNRANVSI AVPGFQNRFQ TLHLDAYCNE CGNCAQFCPW NGKPYKDKIT
VFSLAQDFDN SSNPGFLVED CRVRVRLNNQ SWVLNIDSKG QFNNVPPELN DMCRIISHVH
QHHHYLLGRV EV