Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0645 |
Symbol | entE |
ID | 5591300 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 662151 |
End bp | 663761 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640919826 |
Product | enterobactin synthase subunit E |
Protein accession | YP_001457408 |
Protein GI | 157160090 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1021] Peptide arylation enzymes |
TIGRFAM ID | [TIGR02275] 2,3-dihydroxybenzoate-AMP ligase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 69 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCATTC CATTCACCCG CTGGCCGGAA GAGTTTGCCC GTCGCTATCG GGAAAAAGGC TACTGGCAGG ATTTGCCGCT GACCGACATT CTGACGCGAC ATGCTGCGAG TGACAGCATC GCGGTTATCG ACGGCGAGCG ACAGTTGAGT TATCGGGAGC TGAATCAGGC GGCGGATAAC CTCGCGTGTA GTTTACGCCG TCAGGGCATT AAACCTGGTG AAACCGCGCT GGTACAACTG GGTAACGTCG CTGAATTGTA TATTACCTTT TTCGCGCTGC TGAAACTGGG CGTTGCGCCG GTGCTGGCGT TGTTCAGCCA TCAGCGTAGT GAACTGAACG CCTATGCCAG CCAGATTGAA CCCGCATTGC TGATTGCCGA TCGCCAACAT GCGCTGTTTA GCGGGGATGA TTTCCTCAAT ACTTTCGTCA CAGAACATTC CTCCATTCGC GTGGTGCAAC TGCTCAACGA CAGCGGTGAG CATAACTTGC AGGATGCGAT TAACCATCCG GCTGAGGATT TTACTGCCAC GCCATCACCT GCTGATGAAG TGGCCTATTT CCAGCTTTCC GGTGGCACCA CCGGCACACC GAAACTGATC CCGCGCACTC ATAACGACTA CTACTACAGC GTGCGTCGTA GCGTCGAGAT TTGTCAGTTC ACACAACAGA CACGCTACCT GTGCGCGATC CCGGCGGCTC ATAACTACGC CATGAGTTCG CCAGGATCGC TGGGCGTCTT TCTTGCCGGA GGAACGGTTG TTCTGGCGGC CGATCCCAGC GCCACGCTCT GTTTCCCATT GATTGAAAAA CATCAGGTTA ACGTTACCGC GCTGGTGCCA CCCGCAGTCA GCCTGTGGTT GCAGGCGCTG ATCGAAGGCG AAAGCCGGGC GCAGCTTGCC TCGCTGAAAC TGTTACAGGT CGGCGGCGCA CGTCTTTCTG CCACCCTTGC GGCGCGTATT CCCGCTGAGA TTGGCTGTCA GTTGCAGCAG GTGTTTGGCA TGGCGGAAGG GCTGGTGAAC TACACCCGAC TTGATGATAG CGCGGAGAAA ATTATCCATA CCCAGGGTTA CCCAATGTGT CCGGATGACG AAGTATGGGT TGCCGATGCC GAAGGAAATC CACTGCCGCA AGGGGAAGTC GGACGCCTGA TGACGCGCGG GCCGTACACC TTCCGCGGCT ATTACAAAAG TCCACAGCAC AATGCCAGCG CCTTTGATGC CAACGGTTTT TACTGTTCCG GCGATCTGAT CTCTATTGAT CCAGAGGGTT ACATCACCGT GCAGGGGCGC GAGAAAGATC AGATTAACCG TGGCGGCGAG AAGATCGCTG CCGAAGAGAT CGAAAACCTG CTGCTGCGCC ACCCGGCGGT GATCTACGCC GCACTGGTGA GCATGGAAGA TGAGCTGATG GGCGAAAAAA GCTGCGCTTA TCTGGTGGTA AAAGAGCCGC TGCGCGCGGT GCAGGTGCGT CGTTTCCTGC GTGAACAGGG TATTGCCGAA TTTAAATTAC CGGATCGCGT GGAGTGTGTG GATTCACTTC CGCTGACGGC GGTCGGGAAA GTCGATAAAA AACAATTACG TCAGTGGCTG GCGTCACGCG CATCAGCCTG A
|
Protein sequence | MSIPFTRWPE EFARRYREKG YWQDLPLTDI LTRHAASDSI AVIDGERQLS YRELNQAADN LACSLRRQGI KPGETALVQL GNVAELYITF FALLKLGVAP VLALFSHQRS ELNAYASQIE PALLIADRQH ALFSGDDFLN TFVTEHSSIR VVQLLNDSGE HNLQDAINHP AEDFTATPSP ADEVAYFQLS GGTTGTPKLI PRTHNDYYYS VRRSVEICQF TQQTRYLCAI PAAHNYAMSS PGSLGVFLAG GTVVLAADPS ATLCFPLIEK HQVNVTALVP PAVSLWLQAL IEGESRAQLA SLKLLQVGGA RLSATLAARI PAEIGCQLQQ VFGMAEGLVN YTRLDDSAEK IIHTQGYPMC PDDEVWVADA EGNPLPQGEV GRLMTRGPYT FRGYYKSPQH NASAFDANGF YCSGDLISID PEGYITVQGR EKDQINRGGE KIAAEEIENL LLRHPAVIYA ALVSMEDELM GEKSCAYLVV KEPLRAVQVR RFLREQGIAE FKLPDRVECV DSLPLTAVGK VDKKQLRQWL ASRASA
|
| |