Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0671 |
Symbol | entF |
ID | 6972207 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 699088 |
End bp | 702969 |
Gene Length | 3882 bp |
Protein Length | 1293 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643384707 |
Product | enterobactin synthase subunit F |
Protein accession | YP_002269220 |
Protein GI | 209400326 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 74 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCAGC ATTTACCTTT GGTCGCCGCA CAGCCCGGCA TCTGGATGGC AGAAAAACTG TCAGAATTAC CCTCCGCCTG GAGCGTGGCG CATTACGTTG AGTTAACCGG AGAGGTTGAT GCGCCATTAC TGGCCCGCGC GGTGGTTGCC GGATTAGCGC AAGCAGATAC GCTGCGGATG CGTTTTACGG AAGATAACGG CGAAGTCTGG CAATGGGTCG ATGATGCGCT GATATTCGAA CTGCCAGAAA TTATCGACCT GCGAACCAAT ATTGATCCGC ACGGTACTGC GCAGGCATTA ATGCAGGCGG ATTTGCAACA AGATTTGCGC GTCGATAGCG GTAAACCACT GGTCTTTCAC CAGCTGATAC AGGTGGCAGA TAACCGCTGG TACTGGTATC AGCGTTATCA CCATTTGCTG GTCGATGGCT TCAGTTTCCC GGCCATTACT CGGCAGATCG CCAACATTTA CTGCGCATTG CTGCGTGGCG AACAAACGCC TGCTTCGCCG TTTACGCCTT TCGCTGATGT AGTGGAAGAG TACCAGCAAT ACCGCGAAAG CGAAGCCTGG CAGCGTGATG CGGCATTCTG GGCGGAACAG CGTCGTCAAC TGCCGCCGCC CGCGTCACTT TCTCCGGCAC CTTTAGCGGG GCGCAGCGCT TCGGCAGATA TTCTACGCCT GAAACTGGAA TTTACCGACG GGGAATTCCG CCAGCTGGCT ACGCAACTTT CAGGTGTGCA GCGTACCGAT TTAGCCCTTG CGCTGGCAGC CTTTTGGCTG GGGCGATTGT GCAATCGCAT GGACTACGCC GCCGGATTTA TCTTTATGCG TCGACTGGGC TCGGCGGCGC TGACGGCTAC CGGACCCGTG CTCAACGTTT TACCGTTGGG TATTCACATT GCGGCACAAG AAACGCTGCC GGAACTGGCA ACCCGACTGG CAGCACAACT GAAAAAAATG CGTCGTCATC AACGTTACGA TGCCGAACAA ATTGTCCGTG ACAGCGGGCG AGCGGCAGGT GATGAACCGC TGTTTGGTCC GGTACTCAAT ATCAAGGTAT TTGATTACCA ACTGGATATT CCTGGTGTTC AGGCGCAAAC CCATACCCTG GCAACCGGTC CGGTTAATGA CCTTGAACTG GCCCTGTTCC CGGATGAACA CGGTGATTTG AGTATTGAGA TCCTCGCCAA TAAACAGCAT TACGATGAGC CAACGTTAAT CCAGCATGCT GAACGCCTGA AAATGCTGAT CGCCCAATTC GCTGCGGATC CGGCTCTGTT GTGCGGCGAT GTTGATATTA TGCTGCCAGG TGAGTATGCG CAGCTGGCGC AGATCAACGC CACTCAGGTT GAGATTCCAG AAACCACGCT TAGCGCGCTG GTGGCAGAAC AAGCGGCAAA AACACCGGAT GCTCCGGCGC TGGCAGATGC GCGTTACCAG TTCAGCTATC GGGAAATGCG CGAGCAGGTG GTGGCGCTGG CGAATCTGCT GCGTGAGCAC GGCGTTAAAC CAGGGGACAG CGTGGCGGTG GCATTACCGC GCTCGGTCTT TTTGACCCTG GCGCTACATG CGATTGTTGA AGCAGGTGCG GCCTGGTTAC CGCTGGATAC CGGTTATCCG GACGATCGCC TGAAAATGAT GCTGGAAGAT GCGCGTCCGT CGCTGTTAAT CACCACCGAC GATCAACTGC CGCGCTTTGC CGATGTTCCA GATTTAACCA ACCTTTGCTA TAACGCCCCG CTTACACCGC AGGGCAGTGC GCCGCTGCAA CTTTCACAAC CGCATCACAC GGCTTATATC ATCTTTACCT CTGGCTCCAC CGGCAGGCCG AAAGGGGTAA TGGTCGGGCA GACGGCTATC GTTAACCGCC TGTTGTGGAT GCAAAATCAT TATCCACTTA CAGGTGAAGA TGTCGTTGCC CAAAAAACGC CGTGCAGTTT TGATGTCTCG GTGTGGGAGT TTTTCTGGCC GTTTATTGCC GGGGCTAAAC TGGTGATGGC TGAACCGGAA GCGCACCGCG ACCCGCTCGC TATGCAGCAA TTCTTTGCCG AATATGGCGT AACGACCACG CACTTTGTGC CGTCGATGCT GGCGGCATTT GTTGCATCGC TGACGCCGCA AACCGCTCGC CAGAATTGCG CGACGTTGAA ACAGGTTTTC TGTAGTGGTG AGGCCTTACC GGCTGATTTA TGCCGCGAAT GGCAACAGTT AACGGGCGCG CCGTTGCATA ATCTATATGG CCCGACGGAA GCGGCGGTAG ATGTGAGTTG GTATCCGGCT TTTGGCGAGG AACTGGCACA GGTGCGCGGC AGCAGTGTGC CGATTGGTTA TCCGGTGTGG AATACGGGCT TGCGCATTCT CGATGCGATG ATGCATCCGG TGCCGCCGGG TGTGGCGGGA GATCTCTATC TCACCGGTAT TCAACTGGCG CAGGGGTATC TTGGACGACC CGATCTGACC GCCAGCCGCT TCATTGCCGA TCCTTTTGTC CCTGGTGAAC GGATGTACCG TACCGGAGAC GTTGCCCGCT GGCTGGATAA CGGCGCGGTG GAGTACCTCG GGCGCAGTGA CGATCAGCTA AAAATTCGCG GGCAGCGTAT CGAACTGGGC GAAATCGATC GCGTGATGCA GGCGCTGCCG GATGTCGAAC AAGCCGTTAC CCACGCCTGT GTGATTAACC AGGCGGCAGC CACCGGTGGT GATGCGCGTC AGTTGGTGGG CTATCTGGTG TCGCAATCAG GTCTGCCGTT GGATACCAGC GCATTACAGG CACAGCTTCG CGAAACATTG CCGCCGCATA TGGCGCCGGT CGTTCTGCTG CAACTTCCAC AGTTACCTCT TAGCGCCAAC GGCAAGCTGG ATCGCAAAGC CTTACCGTTG CCTGAACTTA AGGCACAAAC GCCGGGGCGT GCGCCGAAAG CGGGCAGTGA AACGATTATC GCTGCGGCAT TCGCGTCGTT GCTGGGTTGT GACGTGCAGG ATGCCGATGC TGATTTCTTC GCGCTTGGCG GTCATTCGCT ACTGGCAATG AAACTGGCAG CGCAGTTAAG TCGGCAGTTT GCCCGTCAGG TGACGCCGGG GCAGGTGATG GTCGCGTCAA CCGTCGCCAA ACTGGCAACG ATTATTGATG GTGAAGAGGA CAGCTCCCGG CGCATGGGAT TCGAAACCAT TCTGCCGTTG CGTGAAGGTA ATGGCCCGAC GCTGTTTTGT TTCCATCCGG CATCCGGTTT TGCCTGGCAG TTTAGCGTGC TCTCGCGTTA TCTCGATCCA CTATGGTCGA TTATCGGCAT TCAGTCGCCG CGCCCTCATG GCCCCATGCA GACAGCGACG AACCTGGATG AAGTCTGCGA AGCGCATCTG GCAACGTTAC TTGAACAACA ACCGCACGGC CCTTATTACC TGCTGGGGTA TTCCCTTGGC GGTACACTGG CGCAGGGCAT TGCGGCGCGA CTACGTGCCC GTGGCGAACA GGTGGCATTT CTTGGCTTGC TGGATACCTG GCCGCCAGAA ACGCAAAACT GGCAGGAAAA AGAAGCTAAT GGTCTGGACC CGGAAGTGCT GGCGGAGATT AACCGCGAGC GCGAGGCCTT CCTGGCGGCA CAGCAGGGAA GTACTTCAAC GGAGTTGTTT ACCACCATTG AAGGCAACTA CGCTGATGCT GTGCGCCTGC TGACGACTGC TCATAGCGTA CCGTTTGACG GAAAAGCGAC GCTGTTTGTT GCTGAACGTA CGCTTCAGGA AGGTATGAGC CCCGAACGCG CCTGGTCGCC GTGGATTGCG GAGCTGGATA TCTATCGTCA GGATTGTGCG CATGTGGATA TTATCTCTCC AGGGGCATTT GAAAAAATTG GGCCGATTAT TCGCGCAACG CTAAACAGAT AA
|
Protein sequence | MSQHLPLVAA QPGIWMAEKL SELPSAWSVA HYVELTGEVD APLLARAVVA GLAQADTLRM RFTEDNGEVW QWVDDALIFE LPEIIDLRTN IDPHGTAQAL MQADLQQDLR VDSGKPLVFH QLIQVADNRW YWYQRYHHLL VDGFSFPAIT RQIANIYCAL LRGEQTPASP FTPFADVVEE YQQYRESEAW QRDAAFWAEQ RRQLPPPASL SPAPLAGRSA SADILRLKLE FTDGEFRQLA TQLSGVQRTD LALALAAFWL GRLCNRMDYA AGFIFMRRLG SAALTATGPV LNVLPLGIHI AAQETLPELA TRLAAQLKKM RRHQRYDAEQ IVRDSGRAAG DEPLFGPVLN IKVFDYQLDI PGVQAQTHTL ATGPVNDLEL ALFPDEHGDL SIEILANKQH YDEPTLIQHA ERLKMLIAQF AADPALLCGD VDIMLPGEYA QLAQINATQV EIPETTLSAL VAEQAAKTPD APALADARYQ FSYREMREQV VALANLLREH GVKPGDSVAV ALPRSVFLTL ALHAIVEAGA AWLPLDTGYP DDRLKMMLED ARPSLLITTD DQLPRFADVP DLTNLCYNAP LTPQGSAPLQ LSQPHHTAYI IFTSGSTGRP KGVMVGQTAI VNRLLWMQNH YPLTGEDVVA QKTPCSFDVS VWEFFWPFIA GAKLVMAEPE AHRDPLAMQQ FFAEYGVTTT HFVPSMLAAF VASLTPQTAR QNCATLKQVF CSGEALPADL CREWQQLTGA PLHNLYGPTE AAVDVSWYPA FGEELAQVRG SSVPIGYPVW NTGLRILDAM MHPVPPGVAG DLYLTGIQLA QGYLGRPDLT ASRFIADPFV PGERMYRTGD VARWLDNGAV EYLGRSDDQL KIRGQRIELG EIDRVMQALP DVEQAVTHAC VINQAAATGG DARQLVGYLV SQSGLPLDTS ALQAQLRETL PPHMAPVVLL QLPQLPLSAN GKLDRKALPL PELKAQTPGR APKAGSETII AAAFASLLGC DVQDADADFF ALGGHSLLAM KLAAQLSRQF ARQVTPGQVM VASTVAKLAT IIDGEEDSSR RMGFETILPL REGNGPTLFC FHPASGFAWQ FSVLSRYLDP LWSIIGIQSP RPHGPMQTAT NLDEVCEAHL ATLLEQQPHG PYYLLGYSLG GTLAQGIAAR LRARGEQVAF LGLLDTWPPE TQNWQEKEAN GLDPEVLAEI NREREAFLAA QQGSTSTELF TTIEGNYADA VRLLTTAHSV PFDGKATLFV AERTLQEGMS PERAWSPWIA ELDIYRQDCA HVDIISPGAF EKIGPIIRAT LNR
|
| |