Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4171 |
Symbol | |
ID | 6968193 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 3863863 |
End bp | 3866733 |
Gene Length | 2871 bp |
Protein Length | 956 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643387917 |
Product | putative selenate reductase subunit YgfN |
Protein accession | YP_002272356 |
Protein GI | 209399134 |
COG category | [C] Energy production and conversion |
COG ID | [COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs [COG2080] Aerobic-type carbon monoxide dehydrogenase, small subunit CoxS/CutS homologs |
TIGRFAM ID | [TIGR03313] probable selenate reductase, molybdenum-binding subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 72 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCATCC ACTTTACTTT AAATGGCGCG CCTCAGGAGC TAACCGTTAA TCCAGGCGAA AACGTGCAAA AGCTGTTGTT TAACATGGGA ATGCACTCTG TACGCAACAG TGATGATGGT TTCGGGTTTG CCGGTTCTGA CGCAATCATC TTTAACGGTA ATATCGTTAA CGCGTCCTTG CTTATTGCCG CACAGTTAGA GAAGGCAGAT ATTCGTACCG CAGAATCTCT GGGCAAATGG AACGAGTTAA GTCTGGTTCA ACAGGCAATG GTTGATGTTG GCGTGGTGCA GTCTGGTTAT AACGATCCAG CTGCAGCTCT GATTATCACC GATCTTCTCG ATCGCATCGC CGCACCTACC CGCGAAGAGA TCGACGACGC GCTTTCTGGT TTGTTCAGCC GCGATGCTGG CTGGCAGCAA TACTATCAGG TCATTGAACT GGCGGTTGCA CGTAAAAATA ATCCGCAGGC CACCATTGAT ATCGCTCCGA CTTTCCGTGA CGACCTTGAA GTCATTGGCA AGCATTATCC TAAAACTGAT GCCGCGAAAA TGGTGCAGGC GAAACCCTGC TATGTTGAAG ACCGCGTAAC GGCTGACGCC TGCGTCATTA AAATGTTACG TAGCCCACAC GCTCACGCAC TGATTACTCA TCTGGATGTC AGCAAAGCTG AAGCCTTACC GGGCGTCGTT CACGTTATTA CTCACCTGAA TTGCCCTGAT ATCTACTATA CCCCGGGGGG TCAGAGCGCA CCGGAACCGT CACCGCTTGA CCGCCGTATG TTCGGCAAAA AAATGCGTCA CGTCGGCGAT CGCGTTGCTG CGGTGGTCGC TGAAAGTGAA GAAATTGCGC TCGAAGCATT GAAACTCATC GAGGTTGAAT ATGAAGTGCT TAAGCCGGTA ATGTCGATCG ACGAAGCAAT GGCGGAAGAT GCGCCTGTCG TGCACGATGA ACCGGTGGTG TATGTTGCTG GTGCGCCAGA TACTCTGGAA GACGATAACA GCCATGCAGC CCAGCGCGGC GAGCATATGA TCATCAACTT CCCGATCGGT TCTCGCCCTC GCAAAAATAT CGCCGCTAGT ATTCATGGTC ATATTGGCGA TATGGACAAA GGCTTTGCCG ATGCCGATGT GATCATTGAG CGAACCTATA ACTCAACGCA GGCGCAGCAG TGCCCGACTG AAACACATAT CTGCTTTACC CGGATGGACG GCGATCGTCT GGTTATCCAC GCCTCCACCC AGGTACCATG GCACTTACGC CGCCAGGTCG CGCGCCTCGT GGACATGAAA CAGCATAAAG TTCATGTCAT TAAAGAGCGA GTTGGCGGCG GTTTTGGTTC CAAACAGGAC ATCCTGCTGG AAGAAGTGTG CGCCTGGGCA ACCTGCGTGA CCGGGCGTCC GGTACTGTTC CGCTACACCC GTGAAGAAGA GTTTATTGCT AACACCTCTC GTCACGTCGC GAAAGTCACC GTCAAACTGG GAGCGAAAAA AGATGGTCGC CTGACGGCAG TGAAGATGGA TTTCCGCGCC AACACTGGCC CTTACGGCAA CCACTCACTC ACCGTACCGT GTAACGGACC GGCGCTGTCG CTGCCGTTAT ATCCGTGCGA TAACGTCGAT TTCCAGGTCA CCACCTACTA CAGCAACATT TGCCCAAATG GTGCTTATCA GGGTTATGGC GCACCGAAAG GTAACTTCGC TATCACCATG GCATTAGCGG AACTGGCTGA ACAGTTACAG ATCGACCAAC TGGAAATTAT CGAACGTAAC CGGGTACACG AAGGGCAAGA GCTGAAAATT CTCGGTGCAA TCGGTGAAGG TAAAGCGCCG ACCTCCGTTC CTTCCGCCGC CAGCTGCGCA CTGGAAGAGA TCCTGCGTCA GGGGCGCGAG ATGATCCAAT GGTCTTCACC AAAACCACAA AATGGTGACT GGCACATCGG TCGTGGTGTC GCCATTATCA TGCAGAAATC AGGGATCCCG GATATCGATC AGGCTAACTG CATGATCAAA CTGGAATCAG ACGGTACCTT TATCGTTCAT TCTGGCGGTG CGGATATTGG TACTGGTCTG GATACCGTGG TGACGAAACT GGCAGCAGAA GTGCTGCACT GCCCACCGCA GGACGTGCAT GTTATCTCCG GTGATACCGA TCATGCGTTG TTTGATAAAG GCGCATATGC CTCGTCCGGT ACTTGCTTCT CGGGTAACGC GGCGCGTTTG GCAGCGGAAA ATCTGCGGGA GAAAATTCTG TTCCACGGCG CGCAAATGTT GGGTGAGCCA GTGGCAGATG TTCAACTAGC AACGCCGGGC GTCGTGCGCG GCAAGAAAGG CGAAGTTAGT TTCGGGGATA TTGCCCATAA AGGCGAAACC GGCACCGGCT TTGGTTCACT GGTGGGAACT GGCAGTTATA TCACGCCTGA TTTCGCCTTC CCGTATGGCG CAAACTTCGC TGAAGTTGCC GTCAACACGC GTACGGGTGA AATCCGCCTG GATAAATTCT ACGCCTTGCT GGACTGCGGT ACACCGGTCA ATCCAGAGTT AGCGTTGGGA CAAATCTACG GTGCTACCCT GCGAGCTATC GGCCACAGTA TGAGCGAAGA GATCATTTAT GACGCCGAAG GTCACCCGTT AACGCGTGAT TTACGCAGCT ACGGCGCACC GAAAATTGGT GACATTCCGC GTGATTTCCG CGCAGTGCTG GTGCCGAGCG ACGATAAAGT CGGCCCGTTC GGGGCGAAAT CGATCTCGGA AATCGGTGTA AATGGCGCAG CTCCGGCGAT TGCTACCGCA ATTCACGATG CATGCGGCAT CTGGTTACGC GAATGGCATT TCACACCGGA GAAAATACTC ACCGCGCTGG AAAAGATATA A
|
Protein sequence | MIIHFTLNGA PQELTVNPGE NVQKLLFNMG MHSVRNSDDG FGFAGSDAII FNGNIVNASL LIAAQLEKAD IRTAESLGKW NELSLVQQAM VDVGVVQSGY NDPAAALIIT DLLDRIAAPT REEIDDALSG LFSRDAGWQQ YYQVIELAVA RKNNPQATID IAPTFRDDLE VIGKHYPKTD AAKMVQAKPC YVEDRVTADA CVIKMLRSPH AHALITHLDV SKAEALPGVV HVITHLNCPD IYYTPGGQSA PEPSPLDRRM FGKKMRHVGD RVAAVVAESE EIALEALKLI EVEYEVLKPV MSIDEAMAED APVVHDEPVV YVAGAPDTLE DDNSHAAQRG EHMIINFPIG SRPRKNIAAS IHGHIGDMDK GFADADVIIE RTYNSTQAQQ CPTETHICFT RMDGDRLVIH ASTQVPWHLR RQVARLVDMK QHKVHVIKER VGGGFGSKQD ILLEEVCAWA TCVTGRPVLF RYTREEEFIA NTSRHVAKVT VKLGAKKDGR LTAVKMDFRA NTGPYGNHSL TVPCNGPALS LPLYPCDNVD FQVTTYYSNI CPNGAYQGYG APKGNFAITM ALAELAEQLQ IDQLEIIERN RVHEGQELKI LGAIGEGKAP TSVPSAASCA LEEILRQGRE MIQWSSPKPQ NGDWHIGRGV AIIMQKSGIP DIDQANCMIK LESDGTFIVH SGGADIGTGL DTVVTKLAAE VLHCPPQDVH VISGDTDHAL FDKGAYASSG TCFSGNAARL AAENLREKIL FHGAQMLGEP VADVQLATPG VVRGKKGEVS FGDIAHKGET GTGFGSLVGT GSYITPDFAF PYGANFAEVA VNTRTGEIRL DKFYALLDCG TPVNPELALG QIYGATLRAI GHSMSEEIIY DAEGHPLTRD LRSYGAPKIG DIPRDFRAVL VPSDDKVGPF GAKSISEIGV NGAAPAIATA IHDACGIWLR EWHFTPEKIL TALEKI
|
| |