Gene EcE24377A_3204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_3204 
Symbol 
ID5587624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp3219522 
End bp3222620 
Gene Length3099 bp 
Protein Length1032 aa 
Translation table11 
GC content52% 
IMG OID640926844 
Productputative selenate reductase subunit YgfK 
Protein accessionYP_001464216 
Protein GI157156678 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0493] NADPH-dependent glutamate synthase beta chain and related oxidoreductases 
TIGRFAM ID[TIGR03315] putative selenate reductase, YgfK subunit 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGGATA TTATGCGTCC CATTCCGTTT GAGGAACTTT TGACGCGCAT ATTTGATGAA 
TACCAACAAC AACGCTCAAT CTTTGGTATT CCCGAGCAAC AGTTTTACTC ACCTGTAAAA
GGTAAAACTG TTAGCGTCTT CGGTGAAACC TGTGCCACTC CCGTCGGCCC TGCCGCTGGC
CCGCACACGC AGCTCGCGCA AAATATTGTC ACTTCCTGGC TGACTGGCGG ACGCTTCATC
GAACTAAAAA CCGTCCAAAT TCTTGACCGC CTGGAGCTGG AAAAGCCCTG TATCGATGCC
GAAGACGAGT GCTTTAACAC CGAATGGTCT ACCGAATTTA CCCTGCTTAA AGCCTGGGAT
GAATACCTCA AAGCCTGGTT TGCCCTGCAC CTTCTCGAAG CGATGTTCCA GCCTTCTGAT
TCCGGTAAAT CGTTCATCTT TAATATGAGC GTCGGTTACA ACCTCGAAGG TATTAAGCAA
CCGCCGATGC AACAGTTCAT CGACAATATG ATGGACGCAT CTGACCATCC GAAATTCGCT
CAATATCGCG ATACGCTGAA TAAATTACTC CAGGATGACG CATTTTTAGC TCGCCACGGA
TTGCAGGAAA AACGCGAAAG CTTGCAAGCC TTACCCGCTC GCATCCCCAC CAGTATGGTG
CATGGCGTCA CCCTCTCCAC CATGCACGGC TGTCCTCCGC ATGAAATCGA AGCCATTTGC
CGCTACATGC TGGAAGAAAA AGGGCTCAAC ACCTTTGTGA AACTTAACCC GACCTTACTG
GGATACGCGC GTGTTCGTGA GATCCTCGAT GTCTGCGGTT TCGGTTACAT AGGCTTAAAA
GAAGAGTCAT TTGATCACGA CCTCAAGCTG ACGCAAGCAC TGGAAATGCT GGAACGCCTG
ATGGCACTGG CAAAAGAAAA ATCACTCGGC TTTGGCGTAA AACTGACTAA CACTCTCGGC
ACCATCAACA ATAAAGGCGC ACTGCCTGGT GAAGAGATGT ATATGTCAGG CCGTGCGCTG
TTCCCGCTCT CCATCAATGT TGCAGCAGTT CTCTCTCGCG CCTTTGACGG CAAACTGCCC
ATTTCTTATT CCGGTGGTGC CAGTCAGCTG ACTATCCGCG ATATTTTTGA TACTGGTATT
CGCCCTATTA CTATGGCAAC CGACCTGCTG AAACCAGGCG GCTATCTGCG CTTAAGTGCC
TGCATGCGCG AGCTGGAAGG CTCCGACGCC TGGGGGCTTG ACCATGTTGA CGTCGAACGA
CTGAACAGAC TGGCAGCAGA TGCGTTAACC ATGGAATACA CCCAGAAACA CTGGAAGCCA
GAAGAGCGTA TTGAAGTGGC AGAAGACCTG CCGCTGACCG ATTGCTACGT TGCCCCCTGT
GTTACTGCCT GTGCTATCAA GCAAGATATT CCGGAATACA TCCGTCTGCT TGGCGAACAC
CGCTATGCCG ACGCGCTGGA ACTCATCTAC CAACGCAACG CTCTGCCCGC CATTACCGGT
CATATTTGCG ATCACCAGTG CCAATACAAC TGTACCCGCC TGGATTACGA CAGTGCGCTG
AATATCCGCG AACTGAAAAA AGTCGCGCTG GAAAAAGGTT GGGATGAATA TAAGCAACGC
TGGCACAAAC CAGCCGGTTC TGGTTCACGC CATCCAGTTG CCGTGATTGG TGCAGGTCCG
GCGGGTCTGG CAGCAGGTTA CTTCCTTGCC AGAGCGGGCC ATCCGGTTAC GCTGTTTGAA
CGCGAAGCCA ATGCTGGCGG CGTGGTGAAA AATATCATTC CTCAGTTCCG TATTCCTGCA
GAGTTAATTC AGCACGATAT CGATTTTGTT GCCGCTCACG GCGTGAAATT TGAGTATGGC
TGCTCACCCG ATTTGACCGT TGAACAGTTA AAAAATCAGG GCTTCCACTA TGTTCTGATT
GCCACCGGCA CTGATAAAAA TAGCGGTGTG AAACTGGCGG GCGACAACCA AAATGTCTGG
AAATCACTCC CCTTCCTGCG TGAATACAAC AAGGGTACAG CGCTCAAGCT GGGCAAACAT
GTGGTCGTTG TCGGGGCGGG TAACACCGCA ATGGACTGCG CTCGTGCGGC GTTACGCGTT
CCAGGCGTAG AAAAAGCAAC GATCGTTTAC CGTCGTTCAC TGCAAGAGAT GCCCGCATGG
CGCGAAGAGT ATGAAGAAGC GTTGCACGAC GGCGTGGAGT TCCGTTTCCT GAATAATCCG
GAATGTTTCG ATGCTGATGG CACCTTAACC TTGCGCGTTA TGTCGCTTGG CGAACCCGAT
GAGAAAGGTC GTCGCCGTCC GGTTGAAACC AACGAAACAG TAACACTGCA TGTAGACAGT
CTGATCACCG CCATCGGTGA ACAGCAGGAT ACTGAAGCCC TGAATGCGAT GGGCGTACCG
CTGGACAAAA ACGGCTGGCC AGACGTCGAC CATAATGGCG AAACTCGTCT GACTGACGTC
TTTATGATCG GCGACGTACA GCGCGGACCA TCCTCCATTG TCGCTGCTGT CGGAACCGCG
CGTCGGGCGA CCGATGCCAT CCTTAGTCGG GAAAATATCC GTTCCCACCA GAACAATAAA
TACTGGAACA ACGTCAATCC GGCGGAAATC TATCAACGTA AAGGCGATAT CTCTATCACG
CTGGTGAACA GTGACGATCG TGATGCGTTT GTCGCGCAGG AAGCCGCTCG CTGCCTCGAA
TGTAACTACG TTTGCAGCAA GTGTGTGGAT GTCTGCCCGA ACCGCGCCAA CGTCTCCATT
GCGGTCCCAG GCTTCCAGAA CCGTTTCCAG ACGCTGCACC TCGACGCTTA CTGTAACGAA
TGCGGCAACT GCGCTCAGTT CTGTCCGTGG AACGGTAAAC CGTACAAAGA CAAAATCACC
GTCTTCAGCC TGTCGCAAGA CTTTGATAAC AGCAGCAACC CAGGCTTCCT GGTGGAAGAT
TGTCGGGTAC GCGTACGTCT GAATAACCAA AGCTGGGTGC TGAACATCGA CAGCGAAGGC
CAGTTCGACA ACGTACCACC GGAGCTGAAC GATATGTGCC GCATCATCAG CCATGTCCAC
CAGCATCATC ATTATCTGCT GGGCCGCGTG GAGGTGTAA
 
Protein sequence
MGDIMRPIPF EELLTRIFDE YQQQRSIFGI PEQQFYSPVK GKTVSVFGET CATPVGPAAG 
PHTQLAQNIV TSWLTGGRFI ELKTVQILDR LELEKPCIDA EDECFNTEWS TEFTLLKAWD
EYLKAWFALH LLEAMFQPSD SGKSFIFNMS VGYNLEGIKQ PPMQQFIDNM MDASDHPKFA
QYRDTLNKLL QDDAFLARHG LQEKRESLQA LPARIPTSMV HGVTLSTMHG CPPHEIEAIC
RYMLEEKGLN TFVKLNPTLL GYARVREILD VCGFGYIGLK EESFDHDLKL TQALEMLERL
MALAKEKSLG FGVKLTNTLG TINNKGALPG EEMYMSGRAL FPLSINVAAV LSRAFDGKLP
ISYSGGASQL TIRDIFDTGI RPITMATDLL KPGGYLRLSA CMRELEGSDA WGLDHVDVER
LNRLAADALT MEYTQKHWKP EERIEVAEDL PLTDCYVAPC VTACAIKQDI PEYIRLLGEH
RYADALELIY QRNALPAITG HICDHQCQYN CTRLDYDSAL NIRELKKVAL EKGWDEYKQR
WHKPAGSGSR HPVAVIGAGP AGLAAGYFLA RAGHPVTLFE REANAGGVVK NIIPQFRIPA
ELIQHDIDFV AAHGVKFEYG CSPDLTVEQL KNQGFHYVLI ATGTDKNSGV KLAGDNQNVW
KSLPFLREYN KGTALKLGKH VVVVGAGNTA MDCARAALRV PGVEKATIVY RRSLQEMPAW
REEYEEALHD GVEFRFLNNP ECFDADGTLT LRVMSLGEPD EKGRRRPVET NETVTLHVDS
LITAIGEQQD TEALNAMGVP LDKNGWPDVD HNGETRLTDV FMIGDVQRGP SSIVAAVGTA
RRATDAILSR ENIRSHQNNK YWNNVNPAEI YQRKGDISIT LVNSDDRDAF VAQEAARCLE
CNYVCSKCVD VCPNRANVSI AVPGFQNRFQ TLHLDAYCNE CGNCAQFCPW NGKPYKDKIT
VFSLSQDFDN SSNPGFLVED CRVRVRLNNQ SWVLNIDSEG QFDNVPPELN DMCRIISHVH
QHHHYLLGRV EV