Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3058 |
Symbol | entF |
ID | 6066145 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 3339198 |
End bp | 3343079 |
Gene Length | 3882 bp |
Protein Length | 1293 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641602474 |
Product | enterobactin synthase subunit F |
Protein accession | YP_001726009 |
Protein GI | 170021055 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.184112 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCAGC ATTTACCTTT GGTCGCCGCA CAGCCCGGCA TCTGGATGGC AGAAAAACTG TCAGAATTAC CCTCCGCCTG GAGCGTGGCG CATTACGTTG AGTTAACCGG AGAGGTTGAT TCGCCATTAC TGGCCCGCGC GGTGGTTGCC GGACTAGCGC AAGCAGATAC GCTGCGGATG CGTTTTACGG AAGATAACGG CGAAGTCTGG CAGTGGGTCG ATGATGCGCT GACGTTCGAA CTGCCAGAAA TTATCGACCT ACGAACCAAC ATTGATCCGC ACGGTACTGC GCAGGCATTA ATGCAGGCGG ATTTGCAACA AGATCTGCGC GTCGATAGCG GTAAACCACT GGTCTTTCAT CAGCTGATAC AGGTGGCGGA TAACCGCTGG TACTGGTATC AGCGTTATCA CCATTTGCTG GTCGATGGCT TCAGTTTCCC GGCCATTACC CGCCAGATCG CCAATATTTA CTGCACATGG CTGCGTGGCG AACCAACGCC TGCTTCGCCA TTTACGCCTT TCGCTGATGT AGTGGAAGAG TACCAGCAAT ACCGCGAAAG CGAAGCCTGG CAGCGTGATG CGGCATTCTG GGCAGAACAG CGTCGTCAAC TGCCGCCGCC CGCGTCACTT TCTCCGGCAC CTTTACCGGG GCGCAGCGCC TCGGCAGATA TTCTGCGCCT GAAACTGGAA TTTACCGACG GGGAATTCCG CCAGCTGGCT ACGCAACTTT CAGGTGTGCA GCGTACCGAT TTAGCCCTTG CGCTGGCAGC CTTGTGGCTG GGGCGATTGT GCAATCGTAT GGACTACGCC GCCGGATTTA TCTTTATGCG TCGACTGGGC TCGGCGGCGC TGACGGCTAC CGGACCCGTG CTCAACGTTT TGCCGTTGGG TATTCACATT GCGGCGCAAG AAACGCTGCC GGAACTGGCA ACCCGACTGG CAGCACAACT GAAAAAAATG CGTCGTCATC AACGTTACGA TGCCGAACAA ATTGTCCGTG ACAGCGGGCG AGTGGCAGGT GATGAACCGC TGTTTGGTCC GGTACTCAAT ATCAAGGTAT TTGATTACCA ACTGGATATT CCTGGTGTTC AGGCGCAAAC CCATACCCTG GCAACCGGTC CGGTTAATGA CCTTGAACTG GCCCTGTTCC CGGATGAACA CGGTGATTTG AGTATTGAGA TCCTCGCCAA TAAACAGCGT TACGCTGAGC CAACGTTAAT CCAGCATGCT GAACGCCTGA AAATGCTGAT TGCCCAGTTC GCCGCAGATC CGGCGCTGTT GTGCGGTGAT GTCGATATTA TGCTGCCAGG TGAATACGCG CAGCTGGCGC AGATCAACGC CACTCAGGTT GAGATTCCAG AAACCACGCT TAGCGCTCTG GTGGCAGAAC AAGCGGCAAA AACTCCGGAT GCTCCGGCGC TGGCAGATGC GCATTACCAG TTCAGCTATC GGGAAATGCG TGAGCAGGTG GTGGCGCTGG CGAATCTGCT GCGTGAGCGC GGCGTTAAAC CGGGCGACAG CGTAGCGGTG GCGCTACCGC GCTCGGTCTT TTTGACCCTG GCACTCCATG CGATTGTTGA AGCAGGTGCG GCCTGGCTAC CGCTGGATAC CGGTTATCCG GACGATCGCC TGAAAATGAT GCTAGAAGAT GCGCGTCCGT CGCTGTTAAT CACCACCGAC GATCAACTGC CGCGCTTTGC CGATGTTCCA GATTTAACCA ACCTTTGCTA TAACGCCCCG CTTACACCGC AGGGCAGTGC GCCGCTGCAA CTTTCACAAC CGCATCACAC GGCTTATATC ATCTTTACCT CTGGCTCCAC CGGCAGGCCG AAAGGGGTAA TGGTCGGGCA GACGGCTATC GTCAACCGCC TGCTTTGGAT GCAAAATCAT TATCCGCTTA CAGGCGAAGA TGTCGTTGCC CAAAAAACGC CGTGCAGTTT TGATGTCTCG GTGTGGGAGT TTTTCTGGCC GTTTATCGCA GGGGCAAAAC TGGTGATGGC TGAACCGGAA GCGCACCGCG ACCCGCTCGC TATGCAGCAA TTCTTTGCCG AATATGGCGT AACGACCACG CACTTTGTGC CGTCGATGCT GGCGGCGTTT GTTGCCTCGC TGACGCCGCA AACCGCTCGC CAGAGTTGCG CGACGTTGAA ACAGGTTTTC TGTAGTGGTG AGGCCTTACC GGCTGATTTA TGCCGCGAAT GGCAACAGTT AACTGGCGCG CCGTTGCATA ATCTATATGG CCCGACGGAA GCGGCGGTAG ATGTCAGCTG GTATCCGGCT TTTGGCGAGG AACTGGCACA GGTGCGCGGC AGCAGTGTGC CGATTGGTTA TCCGGTGTGG AACACGGGCC TGCGTATTCT TGATGCGATG ATGCATCCGG TGCCGCCGGG TGTGGCGGGA GATCTCTATC TCACTGGCAT TCAACTGGCG CAGGGGTATC TTGGACGACC CGATCTGACC GCCAGCCGCT TTATTGCCGA TCCTTTTGCT CCTGGTGAAC GGATGTACCG TACCGGAGAC GTTGCCCGCT GGCTGGATAA CGGCGCGGTG GAGTACCTCG GGCGCAGTGA CGATCAGCTA AAAATTCGCG GGCAGCGTAT TGAACTGGGC GAAATCGATC GCGTGATGCA GGCGCTGCCG GATGTCGAAC AAGCCGTTAC CCACGCCTGT GTGATTAACC AAGCGGCAGC CACTGGTGGT GATGCGCGTC AGTTGGTGGG CTATCTGGTA TCGCAATCGG GCCTGCCGTT GGATACCAGC GCATTGCAGG CGCAGCTTCG TGAAACATTG CCGCCGCATA TGGTGCCAGT CGTTCTGCTG CAACTGCCGC AGTTGCCACT TAGCGCCAAC GGCAAGCTGG ATCGCAAAGC CTTACCGTTG CCTGAACTGA AGGCACAAGC GCCAGGGCGT GCGCCGAAAG CGGGCAGTGA AACGATTATC GCCGCGGCAT TCTCGTCGTT GCTGGGGTGT GACGTGCAGG ATGCTGACGC TGATTTCTTC GCGCTTGGCG GTCATTCGCT ACTGGCAATG AAACTGGCAG CGCAGTTAAG TCGGCAGTTT GCCCGTCAGG TGACGCCGGG GCAGGTGATG GTTGCGTCAA CCGTCGCCAA ACTGGCAACG ATTATTGATG GTGAAGAGGA CAGCTCCCGG CGCATGGGAT TCGAAACCAT TCTGCCGTTG CGTGAAGGTA ATGGCCCGAC GCTGTTTTGT TTCCATCCGG CATCCGGTTT TGCCTGGCAG TTTAGCGTGC TCTCGCGTTA TATCGATCCA CAATGGTCGA TTATCGGCAT TCAGTCGCCG CGCCCTCATG GCCCCATGCA GACGGCGACG AACCTGGATG AAGTCTGCGA AGCGCATCTG GCAACGTTAC TTGAACAACA ACCGCGCGGC CCTTATTACC TGCTGGGGTA TTCGCTGGGC GGTACGCTGG CGCAGGGTAT TGCGGCGCGG CTGCGTGCCC GTGGCGAACA GGTGGCATTT CTTGGCTTGC TGGATACCTG GCCGCCAGAA ACGCAAAACT GGCAGGAAAA AGAAGCTAAT GGTCTGGACC CGGAAGTGCT GGCGGAGATT AACCGCGAAC GCGAGGCCTT CCTGGCGGCA CAGCAGGGAA GTACTTCAAC GGAGTTGTTT ACCACCATTG AAGGCAACTA CGCTGATGCT GTGCGCCTGC TGACGACTGC TCATAGCGTA CCGTTTGATG GTAAAGCGAC GCTGTTTGTT GCTGAACGCA CGCTTCAGGA AGGTATGAGC CCCGAACGCG CCTGGTCGCC GTGGATAGCC GAGCTGGATA TCTATCGTCA GGATTGTGCG CATGTGGATA TTATCTCTCC AGGGGCGTTT GAAAAAATGG GGCCGATTAT TCGCGCAACG CTAAACAGGT AA
|
Protein sequence | MSQHLPLVAA QPGIWMAEKL SELPSAWSVA HYVELTGEVD SPLLARAVVA GLAQADTLRM RFTEDNGEVW QWVDDALTFE LPEIIDLRTN IDPHGTAQAL MQADLQQDLR VDSGKPLVFH QLIQVADNRW YWYQRYHHLL VDGFSFPAIT RQIANIYCTW LRGEPTPASP FTPFADVVEE YQQYRESEAW QRDAAFWAEQ RRQLPPPASL SPAPLPGRSA SADILRLKLE FTDGEFRQLA TQLSGVQRTD LALALAALWL GRLCNRMDYA AGFIFMRRLG SAALTATGPV LNVLPLGIHI AAQETLPELA TRLAAQLKKM RRHQRYDAEQ IVRDSGRVAG DEPLFGPVLN IKVFDYQLDI PGVQAQTHTL ATGPVNDLEL ALFPDEHGDL SIEILANKQR YAEPTLIQHA ERLKMLIAQF AADPALLCGD VDIMLPGEYA QLAQINATQV EIPETTLSAL VAEQAAKTPD APALADAHYQ FSYREMREQV VALANLLRER GVKPGDSVAV ALPRSVFLTL ALHAIVEAGA AWLPLDTGYP DDRLKMMLED ARPSLLITTD DQLPRFADVP DLTNLCYNAP LTPQGSAPLQ LSQPHHTAYI IFTSGSTGRP KGVMVGQTAI VNRLLWMQNH YPLTGEDVVA QKTPCSFDVS VWEFFWPFIA GAKLVMAEPE AHRDPLAMQQ FFAEYGVTTT HFVPSMLAAF VASLTPQTAR QSCATLKQVF CSGEALPADL CREWQQLTGA PLHNLYGPTE AAVDVSWYPA FGEELAQVRG SSVPIGYPVW NTGLRILDAM MHPVPPGVAG DLYLTGIQLA QGYLGRPDLT ASRFIADPFA PGERMYRTGD VARWLDNGAV EYLGRSDDQL KIRGQRIELG EIDRVMQALP DVEQAVTHAC VINQAAATGG DARQLVGYLV SQSGLPLDTS ALQAQLRETL PPHMVPVVLL QLPQLPLSAN GKLDRKALPL PELKAQAPGR APKAGSETII AAAFSSLLGC DVQDADADFF ALGGHSLLAM KLAAQLSRQF ARQVTPGQVM VASTVAKLAT IIDGEEDSSR RMGFETILPL REGNGPTLFC FHPASGFAWQ FSVLSRYIDP QWSIIGIQSP RPHGPMQTAT NLDEVCEAHL ATLLEQQPRG PYYLLGYSLG GTLAQGIAAR LRARGEQVAF LGLLDTWPPE TQNWQEKEAN GLDPEVLAEI NREREAFLAA QQGSTSTELF TTIEGNYADA VRLLTTAHSV PFDGKATLFV AERTLQEGMS PERAWSPWIA ELDIYRQDCA HVDIISPGAF EKMGPIIRAT LNR
|
| |