Gene ECH74115_4327 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4327 
Symbol 
ID6967507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4002360 
End bp4004579 
Gene Length2220 bp 
Protein Length739 aa 
Translation table11 
GC content55% 
IMG OID643388054 
Producthypothetical protein 
Protein accessionYP_002272492 
Protein GI209399855 
COG category[C] Energy production and conversion 
COG ID[COG1032] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTCTA TCTCCCTGAT CCAACCGGAT CGCGACCTGT TCTCCTGGCC GCAGTACTGG 
GCCGCCTGTT TTGGACCGGC ACCGTTTTTG CCGATGTCTC GTGAAGAGAT GGATCAACTT
GGCTGGGATA GCTGCGACAT CATTTTGGTT ACTGGCGACG CGTATGTCGA TCACCCAAGC
TTCGGGATGG CGATTTGCGG TCGTATGCTG GAAGCACAGG GCTTTCGCGT CGGGATCATC
GCCCAGCCAG ACTGGAGCAG CAAAGACGAT TTCATGCGTC TGGGTAAACC GAATCTGTTT
TTCGGTGTTA CTGCTGGCAA CATGGATTCG ATGATCAACC GTTATACCGC CGATCGCCGT
TTACGTCATG ACGATGCCTA CACGCCGGAT AACGTCGCGG GTAAGCGCCC GGATCGCGCC
ACACTGGTTT ATACCCAGCG TTGTAAAGAG GCGTGGAAAG ATGTACCGGT GATCCTCGGT
GGTATTGAGG CTAGTCTGCG CCGTACCGCG CATTATGATT ACTGGTCCGA TACCGTGCGC
CGTTCCGTGC TGGTGGATTC GAAAGCCGAC ATGCTGATGT TTGGTAACGG TGAGCGTCCG
CTGGTGGAGG TGGCGCACCG TCTGGCGATG GGCGAGCCGA TTAGTGAAAT CCGCGATGTG
CGTAATACCG CGATTATCGT GAAAGAGGCG CTGCCTGGCT GGAGCGGCGT GGATTCCACC
CGTCTTGATA CCCCTGGAAA AATCGACCCA ATCCCGCATC CGTATGGTGA AGATTTGCCG
TGCGCGGATA ACAAACCGGT GGCACCGAAA AAGCAGGAAG CCAAAGCCGT AACCGTGCAG
CCACCGCGCC CGAAACCGTG GGAAAAAACC TACGTGTTGC TGCCTTCTTT CGAGAAAGTG
AAGGGCGATA AAGTGCTGTA CGCCCATGCT TCGCGTATTC TGCACCACGA AACTAACCCA
GGTTGCGCCC GCGCATTGAT GCAAAAACAC GGTGACCGCT ATGTGTGGAT CAACCCGCCT
GCTATCCCGC TTTCTACCGA AGAGATGGAC AGCGTTTTTG CGCTGCCGTA CAAGCGCGTG
CCACATCCGG CCTATGGCAA TGCCCGTATT CCGGCTTACG AAATGATCCG TTTTTCGGTC
AACATTATGC GCGGCTGCTT TGGCGGCTGC TCTTTCTGTT CTATCACCGA GCACGAAGGG
CGCATTATTC AGAGCCGTTC CGAAGATTCG ATCATTAATG AGATCGAAGC GATCCGCGAC
ACCGTTCCAG GTTTTACGGG CGTAATTTCC GATCTCGGTG GGCCAACTGC CAACATGTAT
ATGTTGCGCT GCAAATCGCC ACGCGCTGAA CAAACCTGCC GTCGTTTGTC CTGCGTTTAC
CCGGATATTT GTCCGCATAT GGACACTAAC CACGAACCGA CGATCAACCT CTATCGCCGC
GCTCGCGATC TGAAAGGCAT TAAAAAGATT CTGATTGCCT CAGGCGTACG TTATGACATC
GCCGTGGAAG ATCCGCGCTA TATCAAAGAA CTGGCGACCC ATCACGTCGG CGGTTATCTG
AAGATTGCTC CGGAACATAC CGAAGAAGGG CCGTTGTCGA AGATGATGAA GCCGGGCATG
GGCAGTTATG ACCGCTTTAA AGAGCTGTTC GATACCTACT CGAAACAGGC AGGTAAAGAG
CAGTATCTGA TCCCCTATTT CATCTCCGCG CACCCCGGTA CGCGTGATGA AGATATGGTG
AATTTGGCGC TGTGGCTGAA AAAGCATCGT TTCCGTCTCG ACCAGGTACA GAACTTCTAC
CCATCGCCAC TGGCTAACTC GACCACCATG TATTACACCG GGAAAAACCC GCTGGCGAAG
ATTGGTTATA AGAGTGAAGA CGTCTTCGTA CCGAAGGGCG ACAAACAGCG TCGTTTGCAT
AAAGCGTTGT TGCGTTACCA CGATCCGGCA AACTGGCCGT TAATCCGCCA GGCGCTGGAA
GCGATGGGCA AAAAGCATCT GATTGGCAGC CGTCGCGATT GCTTAGTGCC TGCGCCAACC
ATTGAAGAGA TGCGTGAGGC TCGTCGCCAG AACCGCAATA CCCGTCCGGC GTTGACGAAA
CATACGCCGA TGGCGACCCA GTGTCAGACG CCTGCTACGG CAAAAAAAGC GTCGTCTACG
CAATCTCGTC CGGTGAATGC TGGTGCGAAG AAACGCCCTA AAGCGGCGGT TGGACGTTAA
 
Protein sequence
MSSISLIQPD RDLFSWPQYW AACFGPAPFL PMSREEMDQL GWDSCDIILV TGDAYVDHPS 
FGMAICGRML EAQGFRVGII AQPDWSSKDD FMRLGKPNLF FGVTAGNMDS MINRYTADRR
LRHDDAYTPD NVAGKRPDRA TLVYTQRCKE AWKDVPVILG GIEASLRRTA HYDYWSDTVR
RSVLVDSKAD MLMFGNGERP LVEVAHRLAM GEPISEIRDV RNTAIIVKEA LPGWSGVDST
RLDTPGKIDP IPHPYGEDLP CADNKPVAPK KQEAKAVTVQ PPRPKPWEKT YVLLPSFEKV
KGDKVLYAHA SRILHHETNP GCARALMQKH GDRYVWINPP AIPLSTEEMD SVFALPYKRV
PHPAYGNARI PAYEMIRFSV NIMRGCFGGC SFCSITEHEG RIIQSRSEDS IINEIEAIRD
TVPGFTGVIS DLGGPTANMY MLRCKSPRAE QTCRRLSCVY PDICPHMDTN HEPTINLYRR
ARDLKGIKKI LIASGVRYDI AVEDPRYIKE LATHHVGGYL KIAPEHTEEG PLSKMMKPGM
GSYDRFKELF DTYSKQAGKE QYLIPYFISA HPGTRDEDMV NLALWLKKHR FRLDQVQNFY
PSPLANSTTM YYTGKNPLAK IGYKSEDVFV PKGDKQRRLH KALLRYHDPA NWPLIRQALE
AMGKKHLIGS RRDCLVPAPT IEEMREARRQ NRNTRPALTK HTPMATQCQT PATAKKASST
QSRPVNAGAK KRPKAAVGR