Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_3204 |
Symbol | |
ID | 5587624 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 3219522 |
End bp | 3222620 |
Gene Length | 3099 bp |
Protein Length | 1032 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640926844 |
Product | putative selenate reductase subunit YgfK |
Protein accession | YP_001464216 |
Protein GI | 157156678 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0493] NADPH-dependent glutamate synthase beta chain and related oxidoreductases |
TIGRFAM ID | [TIGR03315] putative selenate reductase, YgfK subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGGATA TTATGCGTCC CATTCCGTTT GAGGAACTTT TGACGCGCAT ATTTGATGAA TACCAACAAC AACGCTCAAT CTTTGGTATT CCCGAGCAAC AGTTTTACTC ACCTGTAAAA GGTAAAACTG TTAGCGTCTT CGGTGAAACC TGTGCCACTC CCGTCGGCCC TGCCGCTGGC CCGCACACGC AGCTCGCGCA AAATATTGTC ACTTCCTGGC TGACTGGCGG ACGCTTCATC GAACTAAAAA CCGTCCAAAT TCTTGACCGC CTGGAGCTGG AAAAGCCCTG TATCGATGCC GAAGACGAGT GCTTTAACAC CGAATGGTCT ACCGAATTTA CCCTGCTTAA AGCCTGGGAT GAATACCTCA AAGCCTGGTT TGCCCTGCAC CTTCTCGAAG CGATGTTCCA GCCTTCTGAT TCCGGTAAAT CGTTCATCTT TAATATGAGC GTCGGTTACA ACCTCGAAGG TATTAAGCAA CCGCCGATGC AACAGTTCAT CGACAATATG ATGGACGCAT CTGACCATCC GAAATTCGCT CAATATCGCG ATACGCTGAA TAAATTACTC CAGGATGACG CATTTTTAGC TCGCCACGGA TTGCAGGAAA AACGCGAAAG CTTGCAAGCC TTACCCGCTC GCATCCCCAC CAGTATGGTG CATGGCGTCA CCCTCTCCAC CATGCACGGC TGTCCTCCGC ATGAAATCGA AGCCATTTGC CGCTACATGC TGGAAGAAAA AGGGCTCAAC ACCTTTGTGA AACTTAACCC GACCTTACTG GGATACGCGC GTGTTCGTGA GATCCTCGAT GTCTGCGGTT TCGGTTACAT AGGCTTAAAA GAAGAGTCAT TTGATCACGA CCTCAAGCTG ACGCAAGCAC TGGAAATGCT GGAACGCCTG ATGGCACTGG CAAAAGAAAA ATCACTCGGC TTTGGCGTAA AACTGACTAA CACTCTCGGC ACCATCAACA ATAAAGGCGC ACTGCCTGGT GAAGAGATGT ATATGTCAGG CCGTGCGCTG TTCCCGCTCT CCATCAATGT TGCAGCAGTT CTCTCTCGCG CCTTTGACGG CAAACTGCCC ATTTCTTATT CCGGTGGTGC CAGTCAGCTG ACTATCCGCG ATATTTTTGA TACTGGTATT CGCCCTATTA CTATGGCAAC CGACCTGCTG AAACCAGGCG GCTATCTGCG CTTAAGTGCC TGCATGCGCG AGCTGGAAGG CTCCGACGCC TGGGGGCTTG ACCATGTTGA CGTCGAACGA CTGAACAGAC TGGCAGCAGA TGCGTTAACC ATGGAATACA CCCAGAAACA CTGGAAGCCA GAAGAGCGTA TTGAAGTGGC AGAAGACCTG CCGCTGACCG ATTGCTACGT TGCCCCCTGT GTTACTGCCT GTGCTATCAA GCAAGATATT CCGGAATACA TCCGTCTGCT TGGCGAACAC CGCTATGCCG ACGCGCTGGA ACTCATCTAC CAACGCAACG CTCTGCCCGC CATTACCGGT CATATTTGCG ATCACCAGTG CCAATACAAC TGTACCCGCC TGGATTACGA CAGTGCGCTG AATATCCGCG AACTGAAAAA AGTCGCGCTG GAAAAAGGTT GGGATGAATA TAAGCAACGC TGGCACAAAC CAGCCGGTTC TGGTTCACGC CATCCAGTTG CCGTGATTGG TGCAGGTCCG GCGGGTCTGG CAGCAGGTTA CTTCCTTGCC AGAGCGGGCC ATCCGGTTAC GCTGTTTGAA CGCGAAGCCA ATGCTGGCGG CGTGGTGAAA AATATCATTC CTCAGTTCCG TATTCCTGCA GAGTTAATTC AGCACGATAT CGATTTTGTT GCCGCTCACG GCGTGAAATT TGAGTATGGC TGCTCACCCG ATTTGACCGT TGAACAGTTA AAAAATCAGG GCTTCCACTA TGTTCTGATT GCCACCGGCA CTGATAAAAA TAGCGGTGTG AAACTGGCGG GCGACAACCA AAATGTCTGG AAATCACTCC CCTTCCTGCG TGAATACAAC AAGGGTACAG CGCTCAAGCT GGGCAAACAT GTGGTCGTTG TCGGGGCGGG TAACACCGCA ATGGACTGCG CTCGTGCGGC GTTACGCGTT CCAGGCGTAG AAAAAGCAAC GATCGTTTAC CGTCGTTCAC TGCAAGAGAT GCCCGCATGG CGCGAAGAGT ATGAAGAAGC GTTGCACGAC GGCGTGGAGT TCCGTTTCCT GAATAATCCG GAATGTTTCG ATGCTGATGG CACCTTAACC TTGCGCGTTA TGTCGCTTGG CGAACCCGAT GAGAAAGGTC GTCGCCGTCC GGTTGAAACC AACGAAACAG TAACACTGCA TGTAGACAGT CTGATCACCG CCATCGGTGA ACAGCAGGAT ACTGAAGCCC TGAATGCGAT GGGCGTACCG CTGGACAAAA ACGGCTGGCC AGACGTCGAC CATAATGGCG AAACTCGTCT GACTGACGTC TTTATGATCG GCGACGTACA GCGCGGACCA TCCTCCATTG TCGCTGCTGT CGGAACCGCG CGTCGGGCGA CCGATGCCAT CCTTAGTCGG GAAAATATCC GTTCCCACCA GAACAATAAA TACTGGAACA ACGTCAATCC GGCGGAAATC TATCAACGTA AAGGCGATAT CTCTATCACG CTGGTGAACA GTGACGATCG TGATGCGTTT GTCGCGCAGG AAGCCGCTCG CTGCCTCGAA TGTAACTACG TTTGCAGCAA GTGTGTGGAT GTCTGCCCGA ACCGCGCCAA CGTCTCCATT GCGGTCCCAG GCTTCCAGAA CCGTTTCCAG ACGCTGCACC TCGACGCTTA CTGTAACGAA TGCGGCAACT GCGCTCAGTT CTGTCCGTGG AACGGTAAAC CGTACAAAGA CAAAATCACC GTCTTCAGCC TGTCGCAAGA CTTTGATAAC AGCAGCAACC CAGGCTTCCT GGTGGAAGAT TGTCGGGTAC GCGTACGTCT GAATAACCAA AGCTGGGTGC TGAACATCGA CAGCGAAGGC CAGTTCGACA ACGTACCACC GGAGCTGAAC GATATGTGCC GCATCATCAG CCATGTCCAC CAGCATCATC ATTATCTGCT GGGCCGCGTG GAGGTGTAA
|
Protein sequence | MGDIMRPIPF EELLTRIFDE YQQQRSIFGI PEQQFYSPVK GKTVSVFGET CATPVGPAAG PHTQLAQNIV TSWLTGGRFI ELKTVQILDR LELEKPCIDA EDECFNTEWS TEFTLLKAWD EYLKAWFALH LLEAMFQPSD SGKSFIFNMS VGYNLEGIKQ PPMQQFIDNM MDASDHPKFA QYRDTLNKLL QDDAFLARHG LQEKRESLQA LPARIPTSMV HGVTLSTMHG CPPHEIEAIC RYMLEEKGLN TFVKLNPTLL GYARVREILD VCGFGYIGLK EESFDHDLKL TQALEMLERL MALAKEKSLG FGVKLTNTLG TINNKGALPG EEMYMSGRAL FPLSINVAAV LSRAFDGKLP ISYSGGASQL TIRDIFDTGI RPITMATDLL KPGGYLRLSA CMRELEGSDA WGLDHVDVER LNRLAADALT MEYTQKHWKP EERIEVAEDL PLTDCYVAPC VTACAIKQDI PEYIRLLGEH RYADALELIY QRNALPAITG HICDHQCQYN CTRLDYDSAL NIRELKKVAL EKGWDEYKQR WHKPAGSGSR HPVAVIGAGP AGLAAGYFLA RAGHPVTLFE REANAGGVVK NIIPQFRIPA ELIQHDIDFV AAHGVKFEYG CSPDLTVEQL KNQGFHYVLI ATGTDKNSGV KLAGDNQNVW KSLPFLREYN KGTALKLGKH VVVVGAGNTA MDCARAALRV PGVEKATIVY RRSLQEMPAW REEYEEALHD GVEFRFLNNP ECFDADGTLT LRVMSLGEPD EKGRRRPVET NETVTLHVDS LITAIGEQQD TEALNAMGVP LDKNGWPDVD HNGETRLTDV FMIGDVQRGP SSIVAAVGTA RRATDAILSR ENIRSHQNNK YWNNVNPAEI YQRKGDISIT LVNSDDRDAF VAQEAARCLE CNYVCSKCVD VCPNRANVSI AVPGFQNRFQ TLHLDAYCNE CGNCAQFCPW NGKPYKDKIT VFSLSQDFDN SSNPGFLVED CRVRVRLNNQ SWVLNIDSEG QFDNVPPELN DMCRIISHVH QHHHYLLGRV EV
|
| |