Gene EcDH1_0814 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_0814 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp864605 
End bp867703 
Gene Length3099 bp 
Protein Length1032 aa 
Translation table11 
GC content52% 
IMG OID 
Productselenate reductase YgfK 
Protein accessionACX38498 
Protein GI260448076 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGGATA TTATGCGTCC CATTCCGTTT GAGGAACTTT TGACGCGCAT ATTTGATGAA 
TACCAACAAC AACGCTCAAT CTTTGGTATT CCCGAGCAAC AGTTTTACTC ACCTGTAAAA
GGTAAAACTG TTAGCGTCTT CGGTGAAACC TGTGCCACTC CCGTCGGCCC TGCCGCTGGC
CCGCACACGC AGCTCGCGCA AAATATTGTC ACTTCCTGGC TGACTGGCGG ACGCTTCATC
GAACTAAAAA CCGTCCAAAT TCTTGACCGC CTGGAGCTGG AAAAGCCCTG TATCGATGCC
GAAGACGAGT GCTTTAACAC CGAATGGTCT ACCGAGTTTA CCCTGCTTAA AGCCTGGGAT
GAATACCTCA AAGCCTGGTT TGCCCTGCAC CTTCTCGAAG CGATGTTCCA GCCTTCTGAT
TCCGGTAAAT CGTTCATCTT TAATATGAGC GTCGGTTACA ACCTCGAAGG TATTAAGCAA
CCGCCGATGC AACAGTTCAT CGACAATATG ATGGACGCAT CTGACCATCC GAAATTCGCT
CAATATCGCG ATACGCTGAA TAAATTACTC CAGGATGACG CATTTTTAGC TCGCCACGGA
TTGCAGGAAA AACGCGAAAG CTTGCAAGCC TTACCCGCTC GCATCCCCAC CAGTATGGTG
CATGGCGTCA CCCTCTCCAC CATGCACGGC TGTCCTCCGC ATGAAATCGA AGCCATTTGC
CGCTACATGC TGGAAGAAAA AGGGCTCAAC ACCTTTGTGA AACTTAACCC GACCTTACTG
GGGTACGCGC GTGTTCGTGA GATCCTCGAT GTCTGCGGTT TCGGTTACAT AGGCTTAAAA
GAAGAGTCAT TTGATCACGA CCTCAAGCTG ACGCAAGCAC TGGAAATGCT GGAACGCCTG
ATGGCACTGG CAAAAGAAAA ATCACTCGGC TTTGGCGTAA AACTGACTAA CACTCTCGGC
ACCATCAACA ATAAAGGCGC ACTGCCTGGT GAAGAGATGT ATATGTCAGG CCGTGCGCTG
TTCCCGCTCT CCATCAATGT TGCAGCAGTT CTCTCTCGCG CCTTTGACGG CAAACTGCCC
ATTTCTTATT CCGGTGGTGC CAGTCAGCTG ACTATCCGCG ATATTTTTGA TACAGGTATT
CGCCCTATTA CTATGGCAAC CGACCTGCTG AAACCTGGCG GCTATCTGCG CTTAAGTGCC
TGCATGCGCG AGCTGGAAGG CTCCGACGCC TGGGGACTTG ACCATGTTGA CGTCGAACGA
CTGAACAGAC TGGCAGCAGA TGCGTTAACC ATGGAATACA CCCAGAAACA CTGGAAGCCA
GAAGAGCGTA TTGAAGTGGC AGAAGACCTG CCGCTGACCG ACTGCTACGT TGCCCCCTGT
GTTACTGCCT GCGCTATCAA GCAAGATATT CCGGAATACA TCCGTCTGCT TGGCGAACAC
CGCTATGCCG ACGCGCTGGA ACTCATCTAC CAACGCAACG CTCTGCCCGC CATTACCGGT
CATATTTGCG ATCACCAGTG CCAATACAAC TGTACCCGCC TGGATTACGA CAGTGCGCTG
AATATCCGCG AACTGAAAAA AGTCGCGCTG GAAAAAGGTT GGGATGAATA TAAGCAACGC
TGGCACAAAC CAGCCGGTTC TGGTTCACGC CATCCGGTTG CCGTGATTGG TGCAGGTCCG
GCGGGTCTGG CAGCAGGTTA CTTCCTTGCC AGAGCGGGCC ATCCGGTTAC GCTGTTTGAA
CGCGAAGCCA ATGCGGGCGG CGTGGTGAAA AATATCATTC CTCAGTTCCG TATTCCTGCA
GAGTTAATTC AGCACGATAT CGATTTTGTT GCCGCTCACG GCGTGAAATT TGAGTATGGC
TGCTCACCCG ATTTAACCAT TGAGCAGTTA AAAAATCAGG GCTTCCACTA TGTTCTGATT
GCCACCGGCA CTGATAAAAA TAGCGGTGTG AAACTGGCGG GCGACAACCA AAATGTCTGG
AAATCACTCC CCTTCCTGCG TGAATACAAC AAGGGTACAG CGCTCAAGCT GGGCAAACAT
GTGGTCGTTG TCGGGGCGGG TAACACCGCA ATGGACTGCG CTCGTGCGGC GTTACGCGTT
CCAGGCGTAG AAAAAGCAAC GATCGTTTAC CGTCGTTCAC TACAAGAGAT GCCCGCATGG
CGCGAAGAGT ATGAAGAAGC GTTGCACGAC GGCGTAGAGT TCCGTTTCCT GAATAATCCG
GAACGTTTCG ATGCTGATGG CACCTTAACC TTGCGCGTTA TGTCGCTTGG CGAACCGGAT
GAGAAAGGTC GTCGTCGTCC GGTTGAAACC AATGAAACAG TAACACTGCT TGTAGACAGC
CTGATCACCG CCATTGGTGA ACAGCAGGAT ACTGAAGCCC TGAATGCGAT GGGCGTGCCG
CTGGACAAAA ACGGCTGGCC AGACGTCGAC CATAATGGCG AAACTCGTCT GACTGACGTC
TTTATGATCG GCGACGTACA GCGCGGACCA TCCTCCATTG TCGCTGCTGT CGGAACCGCG
CGTCGGGCGA CCGATGCCAT CCTTAGTCGG GAAAATATCC GTTCCCACCA GAACGATAAA
TACTGGAACA ACGTCAATCC AGCGGAAATC TATCAACGTA AAGGCGATAT CTCTATCACG
CTGGTGAACA GTGACGATCG TGACGCGTTT GTCGCCCAGG AAGCCGCTCG CTGCCTCGAA
TGTAACTACG TTTGCAGCAA GTGTGTGGAT GTCTGCCCGA ACCGCGCCAA CGTCTCCATT
GCGGTCCCAG GCTTCCAGAA CCGTTTCCAG ACGCTGCACC TCGACGCTTA CTGTAACGAA
TGCGGCAACT GCGCTCAGTT CTGTCCGTGG AACGGTAAAC CGTACAAAGA CAAAATCACC
GTCTTCAGCC TGGCGCAAGA CTTTGATAAC AGCAGCAACC CAGGCTTCCT TGTGGAAGAT
TGCCGGGTAC GAGTACGTCT GAATAACCAA AGCTGGGTGT TAAACATCGA CAGCAAAGGT
CAGTTTAACA ACGTACCACC GGAGCTGAAC GATATGTGCC GCATCATCAG CCATGTCCAC
CAGCATCATC ATTATCTGCT GGGCCGCGTG GAGGTGTAA
 
Protein sequence
MGDIMRPIPF EELLTRIFDE YQQQRSIFGI PEQQFYSPVK GKTVSVFGET CATPVGPAAG 
PHTQLAQNIV TSWLTGGRFI ELKTVQILDR LELEKPCIDA EDECFNTEWS TEFTLLKAWD
EYLKAWFALH LLEAMFQPSD SGKSFIFNMS VGYNLEGIKQ PPMQQFIDNM MDASDHPKFA
QYRDTLNKLL QDDAFLARHG LQEKRESLQA LPARIPTSMV HGVTLSTMHG CPPHEIEAIC
RYMLEEKGLN TFVKLNPTLL GYARVREILD VCGFGYIGLK EESFDHDLKL TQALEMLERL
MALAKEKSLG FGVKLTNTLG TINNKGALPG EEMYMSGRAL FPLSINVAAV LSRAFDGKLP
ISYSGGASQL TIRDIFDTGI RPITMATDLL KPGGYLRLSA CMRELEGSDA WGLDHVDVER
LNRLAADALT MEYTQKHWKP EERIEVAEDL PLTDCYVAPC VTACAIKQDI PEYIRLLGEH
RYADALELIY QRNALPAITG HICDHQCQYN CTRLDYDSAL NIRELKKVAL EKGWDEYKQR
WHKPAGSGSR HPVAVIGAGP AGLAAGYFLA RAGHPVTLFE REANAGGVVK NIIPQFRIPA
ELIQHDIDFV AAHGVKFEYG CSPDLTIEQL KNQGFHYVLI ATGTDKNSGV KLAGDNQNVW
KSLPFLREYN KGTALKLGKH VVVVGAGNTA MDCARAALRV PGVEKATIVY RRSLQEMPAW
REEYEEALHD GVEFRFLNNP ERFDADGTLT LRVMSLGEPD EKGRRRPVET NETVTLLVDS
LITAIGEQQD TEALNAMGVP LDKNGWPDVD HNGETRLTDV FMIGDVQRGP SSIVAAVGTA
RRATDAILSR ENIRSHQNDK YWNNVNPAEI YQRKGDISIT LVNSDDRDAF VAQEAARCLE
CNYVCSKCVD VCPNRANVSI AVPGFQNRFQ TLHLDAYCNE CGNCAQFCPW NGKPYKDKIT
VFSLAQDFDN SSNPGFLVED CRVRVRLNNQ SWVLNIDSKG QFNNVPPELN DMCRIISHVH
QHHHYLLGRV EV