Gene ECH74115_4171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4171 
Symbol 
ID6968193 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3863863 
End bp3866733 
Gene Length2871 bp 
Protein Length956 aa 
Translation table11 
GC content53% 
IMG OID643387917 
Productputative selenate reductase subunit YgfN 
Protein accessionYP_002272356 
Protein GI209399134 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs
[COG2080] Aerobic-type carbon monoxide dehydrogenase, small subunit CoxS/CutS homologs 
TIGRFAM ID[TIGR03313] probable selenate reductase, molybdenum-binding subunit 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones72 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCATCC ACTTTACTTT AAATGGCGCG CCTCAGGAGC TAACCGTTAA TCCAGGCGAA 
AACGTGCAAA AGCTGTTGTT TAACATGGGA ATGCACTCTG TACGCAACAG TGATGATGGT
TTCGGGTTTG CCGGTTCTGA CGCAATCATC TTTAACGGTA ATATCGTTAA CGCGTCCTTG
CTTATTGCCG CACAGTTAGA GAAGGCAGAT ATTCGTACCG CAGAATCTCT GGGCAAATGG
AACGAGTTAA GTCTGGTTCA ACAGGCAATG GTTGATGTTG GCGTGGTGCA GTCTGGTTAT
AACGATCCAG CTGCAGCTCT GATTATCACC GATCTTCTCG ATCGCATCGC CGCACCTACC
CGCGAAGAGA TCGACGACGC GCTTTCTGGT TTGTTCAGCC GCGATGCTGG CTGGCAGCAA
TACTATCAGG TCATTGAACT GGCGGTTGCA CGTAAAAATA ATCCGCAGGC CACCATTGAT
ATCGCTCCGA CTTTCCGTGA CGACCTTGAA GTCATTGGCA AGCATTATCC TAAAACTGAT
GCCGCGAAAA TGGTGCAGGC GAAACCCTGC TATGTTGAAG ACCGCGTAAC GGCTGACGCC
TGCGTCATTA AAATGTTACG TAGCCCACAC GCTCACGCAC TGATTACTCA TCTGGATGTC
AGCAAAGCTG AAGCCTTACC GGGCGTCGTT CACGTTATTA CTCACCTGAA TTGCCCTGAT
ATCTACTATA CCCCGGGGGG TCAGAGCGCA CCGGAACCGT CACCGCTTGA CCGCCGTATG
TTCGGCAAAA AAATGCGTCA CGTCGGCGAT CGCGTTGCTG CGGTGGTCGC TGAAAGTGAA
GAAATTGCGC TCGAAGCATT GAAACTCATC GAGGTTGAAT ATGAAGTGCT TAAGCCGGTA
ATGTCGATCG ACGAAGCAAT GGCGGAAGAT GCGCCTGTCG TGCACGATGA ACCGGTGGTG
TATGTTGCTG GTGCGCCAGA TACTCTGGAA GACGATAACA GCCATGCAGC CCAGCGCGGC
GAGCATATGA TCATCAACTT CCCGATCGGT TCTCGCCCTC GCAAAAATAT CGCCGCTAGT
ATTCATGGTC ATATTGGCGA TATGGACAAA GGCTTTGCCG ATGCCGATGT GATCATTGAG
CGAACCTATA ACTCAACGCA GGCGCAGCAG TGCCCGACTG AAACACATAT CTGCTTTACC
CGGATGGACG GCGATCGTCT GGTTATCCAC GCCTCCACCC AGGTACCATG GCACTTACGC
CGCCAGGTCG CGCGCCTCGT GGACATGAAA CAGCATAAAG TTCATGTCAT TAAAGAGCGA
GTTGGCGGCG GTTTTGGTTC CAAACAGGAC ATCCTGCTGG AAGAAGTGTG CGCCTGGGCA
ACCTGCGTGA CCGGGCGTCC GGTACTGTTC CGCTACACCC GTGAAGAAGA GTTTATTGCT
AACACCTCTC GTCACGTCGC GAAAGTCACC GTCAAACTGG GAGCGAAAAA AGATGGTCGC
CTGACGGCAG TGAAGATGGA TTTCCGCGCC AACACTGGCC CTTACGGCAA CCACTCACTC
ACCGTACCGT GTAACGGACC GGCGCTGTCG CTGCCGTTAT ATCCGTGCGA TAACGTCGAT
TTCCAGGTCA CCACCTACTA CAGCAACATT TGCCCAAATG GTGCTTATCA GGGTTATGGC
GCACCGAAAG GTAACTTCGC TATCACCATG GCATTAGCGG AACTGGCTGA ACAGTTACAG
ATCGACCAAC TGGAAATTAT CGAACGTAAC CGGGTACACG AAGGGCAAGA GCTGAAAATT
CTCGGTGCAA TCGGTGAAGG TAAAGCGCCG ACCTCCGTTC CTTCCGCCGC CAGCTGCGCA
CTGGAAGAGA TCCTGCGTCA GGGGCGCGAG ATGATCCAAT GGTCTTCACC AAAACCACAA
AATGGTGACT GGCACATCGG TCGTGGTGTC GCCATTATCA TGCAGAAATC AGGGATCCCG
GATATCGATC AGGCTAACTG CATGATCAAA CTGGAATCAG ACGGTACCTT TATCGTTCAT
TCTGGCGGTG CGGATATTGG TACTGGTCTG GATACCGTGG TGACGAAACT GGCAGCAGAA
GTGCTGCACT GCCCACCGCA GGACGTGCAT GTTATCTCCG GTGATACCGA TCATGCGTTG
TTTGATAAAG GCGCATATGC CTCGTCCGGT ACTTGCTTCT CGGGTAACGC GGCGCGTTTG
GCAGCGGAAA ATCTGCGGGA GAAAATTCTG TTCCACGGCG CGCAAATGTT GGGTGAGCCA
GTGGCAGATG TTCAACTAGC AACGCCGGGC GTCGTGCGCG GCAAGAAAGG CGAAGTTAGT
TTCGGGGATA TTGCCCATAA AGGCGAAACC GGCACCGGCT TTGGTTCACT GGTGGGAACT
GGCAGTTATA TCACGCCTGA TTTCGCCTTC CCGTATGGCG CAAACTTCGC TGAAGTTGCC
GTCAACACGC GTACGGGTGA AATCCGCCTG GATAAATTCT ACGCCTTGCT GGACTGCGGT
ACACCGGTCA ATCCAGAGTT AGCGTTGGGA CAAATCTACG GTGCTACCCT GCGAGCTATC
GGCCACAGTA TGAGCGAAGA GATCATTTAT GACGCCGAAG GTCACCCGTT AACGCGTGAT
TTACGCAGCT ACGGCGCACC GAAAATTGGT GACATTCCGC GTGATTTCCG CGCAGTGCTG
GTGCCGAGCG ACGATAAAGT CGGCCCGTTC GGGGCGAAAT CGATCTCGGA AATCGGTGTA
AATGGCGCAG CTCCGGCGAT TGCTACCGCA ATTCACGATG CATGCGGCAT CTGGTTACGC
GAATGGCATT TCACACCGGA GAAAATACTC ACCGCGCTGG AAAAGATATA A
 
Protein sequence
MIIHFTLNGA PQELTVNPGE NVQKLLFNMG MHSVRNSDDG FGFAGSDAII FNGNIVNASL 
LIAAQLEKAD IRTAESLGKW NELSLVQQAM VDVGVVQSGY NDPAAALIIT DLLDRIAAPT
REEIDDALSG LFSRDAGWQQ YYQVIELAVA RKNNPQATID IAPTFRDDLE VIGKHYPKTD
AAKMVQAKPC YVEDRVTADA CVIKMLRSPH AHALITHLDV SKAEALPGVV HVITHLNCPD
IYYTPGGQSA PEPSPLDRRM FGKKMRHVGD RVAAVVAESE EIALEALKLI EVEYEVLKPV
MSIDEAMAED APVVHDEPVV YVAGAPDTLE DDNSHAAQRG EHMIINFPIG SRPRKNIAAS
IHGHIGDMDK GFADADVIIE RTYNSTQAQQ CPTETHICFT RMDGDRLVIH ASTQVPWHLR
RQVARLVDMK QHKVHVIKER VGGGFGSKQD ILLEEVCAWA TCVTGRPVLF RYTREEEFIA
NTSRHVAKVT VKLGAKKDGR LTAVKMDFRA NTGPYGNHSL TVPCNGPALS LPLYPCDNVD
FQVTTYYSNI CPNGAYQGYG APKGNFAITM ALAELAEQLQ IDQLEIIERN RVHEGQELKI
LGAIGEGKAP TSVPSAASCA LEEILRQGRE MIQWSSPKPQ NGDWHIGRGV AIIMQKSGIP
DIDQANCMIK LESDGTFIVH SGGADIGTGL DTVVTKLAAE VLHCPPQDVH VISGDTDHAL
FDKGAYASSG TCFSGNAARL AAENLREKIL FHGAQMLGEP VADVQLATPG VVRGKKGEVS
FGDIAHKGET GTGFGSLVGT GSYITPDFAF PYGANFAEVA VNTRTGEIRL DKFYALLDCG
TPVNPELALG QIYGATLRAI GHSMSEEIIY DAEGHPLTRD LRSYGAPKIG DIPRDFRAVL
VPSDDKVGPF GAKSISEIGV NGAAPAIATA IHDACGIWLR EWHFTPEKIL TALEKI