Gene ECH74115_2114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2114 
Symbol 
ID6971824 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2020053 
End bp2022332 
Gene Length2280 bp 
Protein Length759 aa 
Translation table11 
GC content50% 
IMG OID643386012 
Productputative oxidoreductase 
Protein accessionYP_002270501 
Protein GI209400692 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR01701] oxidoreductase alpha (molybdopterin) subunit 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.131651 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAA AAATTGAATC CTACCAGGGT GCTGCCGGTG GTTGGGGTGC TGTTAAATCC 
GTAGCGAATG CAGTACGTAA GCAGATGGAT ATACGCCAGG ATGTTATTGC CATGTTTGAC
ATGAATAAGC CAGAGGGCTT TGACTGTCCG GGTTGTGCAT GGCCAGATCC TAAGCACAGT
GCGTCATTCG ACATTTGTGA AAACGGCGCA AAAGCAATCG CCTGGGAAGT CACGGATAAG
CAGGTAAATG CCTCTTTCTT TGCTCAGAAT ACGGTTCAAT CATTACTTAC CTGGGGAGAT
CACGAGCTTG AGGCTGCGGG ACGTCTCACT CAGCCTTTGA AATATGATGC TGTCAGCGAC
TGTTACAAGC CATTAAGCTG GCAACAAGCT TTCGACGAAA TTGGCGCACG CCTTCAAAGC
TATAGTGATC CCAATCAGGT TGAATTCTAT ACTTCGGGCC GCACTTCCAA TGAAGCTGCC
TTTCTTTATC AGCTTTTTGC CCGTGAATAC GGGAGCAATA ACTTTCCCGA CTGCTCCAAC
ATGTGCCATG AACCGACAAG CGTGGGTTTG GCAGCGAGTA TCGGTGTAGG TAAAGGGACC
GTGTTGCTGG AAGACTTTGA GAAGTGCGAT TTAGTCATTT GCATTGGGCA TAACCCTGGT
ACAAACCACC CTCGCATGCT GACTTCGTTG CGCGCTTTAG TGAAACGGGG AGCGAAAATG
ATCGCCATCA ATCCTCTACA GGAACGTGGC CTGGAGCGAT TTACCGCACC GCAAAACCCG
TTTGAAATGC TGACGAACTC TGAGACTCAG TTGGCCAGTG CCTACTATAA CGTGCGCATT
GGTGGTGATA TGGCGTTGCT CAAGGGGATG ATGCGCCTGT TAATTGAGCG CGATGATGCT
GCAAGCGCCG CAGGTCGGCC CTCATTGCTG GATGACGAAT TTATTCAAAC GCATACCGTC
GGCTTTGACG AGCTACGCCG TGACGTGCTC AATTCCGAGT GGAAAGATAT CGAACGTATT
TCTGGACTAA GTCAGACACA AATCGCCGAA CTGGCTGATG CATATGCCGC TGCCGAACGA
ACCATTATCT GTTACGGAAT GGGGATCACT CAGCACGAAC ATGGTACCCA GAATGTACAG
CAACTGGTCA ATCTGCTGTT GATGAAAGGT AACATTGGCA AGCCTGGTGC GGGTATCTGC
CCACTACGTG GCCACTCTAA TGTACAGGGC GACCGAACCG TCGGTATCAC CGAGAAACCG
TCTGTAGAGT TTCTGGCTCG CCTGGGTGAG CGCTATGGCT TCACCCCACC TCATGTACCT
GGACATGCTG CAATTGCCAG CATGCAAGCA ATATGTACGG GGCAGGCTCG AGCATTGATC
TGCATGGGGG GCAACTTTGC ACTGGCAATG CCAGATCGGG AAGCGAGCGC TGTACCGTTA
ACGCAATTAG ATTTGGCGGT ACACGTAGCC ACTAAGCTTA ACCGCTCTCA TCTGCTGACC
GCACGGCATA GCTATATTCT GCCGGTTCTG GGACGTAGCG AGATTGACAT GCAAAAAAGC
GGTGCGCAGG CGGTAACCGT TGAGGATTCA ATGTCGATGA TTCATGCCTC GCGTGGCGTG
TTAAAACCCG CCGGTGTAAT GCTGAAATCA GAGTGTGCTG TGGTCGCGGG AATCGCGCAG
GCAACACTAC CCCAGAGCGT GGTAGCCTGG GAGTATCTGG TGGAAGATTA TGATCGCATT
CGCAATGACA TTGAAGCTGT GCTGCCAGAG TTCGCCGACT ATAACCAACG CATCCGTCAT
CCCGGTGGTT TTCACCTGAT AAATGCAGCT GCTGAAAGGC GCTGGATGAC GCCGTCAGGT
AAGGCTAATT TCATTACCAG CAAAGGGCTG TTAGAAGATC CCTCTTCAGC GTTTAACAGT
AAGCTGGTCA TGGCGACAGT ACGCAGCCAC GATCAGTACA ACACGACGAT TTATGGTATG
GATGATCGCT ATCGAGGGGT ATTCGGTCAA CGAGATGTGG TCTTTATGAG TGCTAAACAA
GCTAAAATTT GCCGTGTAAA AAACGGCGAA AGAGTTAATC TTATTGCGCT TACGCCAGAC
GGTAAGCGCA GCTCACGCCG CATGGATAGA TTAAAAGTGG TCATTTACCC TATGGCTGAC
CGCTCACTGG TGACCTATTT TCCAGAATCG AATCACATGC TAACACTTGA TAACCACGAT
CCATTAAGTG GCATTCCTGG CTATAAAAGT ATTCCGGTTG AATTAGAACC ATCAAATTAA
 
Protein sequence
MKKKIESYQG AAGGWGAVKS VANAVRKQMD IRQDVIAMFD MNKPEGFDCP GCAWPDPKHS 
ASFDICENGA KAIAWEVTDK QVNASFFAQN TVQSLLTWGD HELEAAGRLT QPLKYDAVSD
CYKPLSWQQA FDEIGARLQS YSDPNQVEFY TSGRTSNEAA FLYQLFAREY GSNNFPDCSN
MCHEPTSVGL AASIGVGKGT VLLEDFEKCD LVICIGHNPG TNHPRMLTSL RALVKRGAKM
IAINPLQERG LERFTAPQNP FEMLTNSETQ LASAYYNVRI GGDMALLKGM MRLLIERDDA
ASAAGRPSLL DDEFIQTHTV GFDELRRDVL NSEWKDIERI SGLSQTQIAE LADAYAAAER
TIICYGMGIT QHEHGTQNVQ QLVNLLLMKG NIGKPGAGIC PLRGHSNVQG DRTVGITEKP
SVEFLARLGE RYGFTPPHVP GHAAIASMQA ICTGQARALI CMGGNFALAM PDREASAVPL
TQLDLAVHVA TKLNRSHLLT ARHSYILPVL GRSEIDMQKS GAQAVTVEDS MSMIHASRGV
LKPAGVMLKS ECAVVAGIAQ ATLPQSVVAW EYLVEDYDRI RNDIEAVLPE FADYNQRIRH
PGGFHLINAA AERRWMTPSG KANFITSKGL LEDPSSAFNS KLVMATVRSH DQYNTTIYGM
DDRYRGVFGQ RDVVFMSAKQ AKICRVKNGE RVNLIALTPD GKRSSRRMDR LKVVIYPMAD
RSLVTYFPES NHMLTLDNHD PLSGIPGYKS IPVELEPSN