Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_21210 |
Symbol | entE |
ID | 7761046 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 2117050 |
End bp | 2118696 |
Gene Length | 1647 bp |
Protein Length | 548 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643805016 |
Product | enterobactin synthetase component E (2,3-dihydroxybenzoate-AMP ligase) |
Protein accession | YP_002799297 |
Protein GI | 226944224 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1021] Peptide arylation enzymes |
TIGRFAM ID | [TIGR02275] 2,3-dihydroxybenzoate-AMP ligase |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0419167 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACCT CGACCCCGAC CTCTTCTTCC GCCGAAGGCG CGGACTTCAC GCCCTGGCCC GCCGAATTCG CCGCGCGCTA CCGGGCGGCC GGCTACTGGC GCGGCGAACC GCTGGACAGC CTGCTGCGCG CCGGCGCCGA CCGCCATGCC GGACGCACCG CGCTGGTCTG CGGCGAGCGG CGCTGGACCT ACGCCGAACT GGACGCCCGC GTCGACCGGG TGGCGGCCGG GCTGGTCGGA CAAGGCATCG CCGCCGGCGA CCGCGTCGTG GTGCAGTTGC CCAACATCGC CGAATTCGTG ATGGTCATCT TCGCCCTGCT GCGCCTGGGC GCCCTGCCGG TCTTCGCCCT GCCGGCCCAC CGCCGGGCCG AAATCGGCTA CTTCTGCGCC TTCGCCGAAG CCAAGGGGCT GGTCATCCGG GATCGCCACG CCGGTTTCGA CTATCGCCAG ATGGCCCGCG ACATCCGCGA CGAAGCAGCG ACCCTGAGCA CCGTCGTGGT GGTCGGCGAG GCCGAGGAAT TCATCCCCTT CGAGCGGCTC GACGCCGAGC CGCTGCCGCT GCCGGAACCC AAGGCCGACA CGCTGGCCTT CCTGCAACTG TCGGGCGGCA GCACGGGACG GCCGAAGATG ATCCCGCGCA CCCACGACGA CTATTTCTAC AGCGTGCGGG CCAGCGCCGA GATCTGCGGC CTCGGCCCGG ACACCGTGTT CCTCTGCGCC CTGCCGGCGG CCCACAACTT CGCGATGAGT TCGCCGGGCA TCCTGGGCGT CCTCTACGCC GGCGGCAGCG TGGTGCTGGC GCCCGATCCC AGTCCCGACA CCTGTTTCGC CCTGATCGCC CGCGAGCGGG TCGACATGAC CGCGCTGGTG CCCTCCGTGG CGCTGGCCTG GATGGAGGCC GCGCCGGCCC GGCAGGCCGA ACTGGCCAGC CTGAAGGTGC TGCAGGTCGG CGGTTCGCGC CTCAGCGACG AAGCCGCGCA ACGGGTCGAC AGCCTGCTCG GCTGCAAGCT GCAACAGGTG TTCGGCATGG CCGAGGGACT GGTCAACTAC ACCCGGTTCG ACGATCCCCA GGAGCTGATC GTCGGCACCC AGGGCCGCCC CATCTCCCCG GACGACGAAG TGCGCATCGT CGACGACGAG GACCGCGACG TGCCGCCGGG CGAAACCGGG CACCTGATCA CCCGTGGCCC CTACACCATT CGCGGCTACT TCCGCGCCGA TGTGCACAAC GCCCGCTCCT TCACCCGCGA CGGCTTCTAC CGCACCGGCG ATGTGGCCCG CCGCCTGCCC AGCGGGCACC TGATCGTCGA GGGCCGCGAC AAGGACCAGA TCAACCGCGG TGGCGACAAG GTGGCCGCCG AGGAAGTGGA AAACCACCTG CTGGCGCATC CCGCCGTGCT GGATGTCGCC GTGGTCGCGA TGCCCGACGC CTTCCTCGGC GAGCGCACCT GCGCCTTCAT CGTGCCGCGC GGCGAAGCGC CCCGGCCGCT GGAGATCAAC CGCTTCATGC GCGAACGCGG CGTCGCCGGC TACAAGGTGC CGGACCGCAT CGAGTTCGTC GACCAGTTGC CCAAGACCGG CGTCGGCAAG ATCGACAAGC GCGCCCTGCG CGAACGCATC GCGGCACGCC TGCAGGCCAC GGCCTGA
|
Protein sequence | MSTSTPTSSS AEGADFTPWP AEFAARYRAA GYWRGEPLDS LLRAGADRHA GRTALVCGER RWTYAELDAR VDRVAAGLVG QGIAAGDRVV VQLPNIAEFV MVIFALLRLG ALPVFALPAH RRAEIGYFCA FAEAKGLVIR DRHAGFDYRQ MARDIRDEAA TLSTVVVVGE AEEFIPFERL DAEPLPLPEP KADTLAFLQL SGGSTGRPKM IPRTHDDYFY SVRASAEICG LGPDTVFLCA LPAAHNFAMS SPGILGVLYA GGSVVLAPDP SPDTCFALIA RERVDMTALV PSVALAWMEA APARQAELAS LKVLQVGGSR LSDEAAQRVD SLLGCKLQQV FGMAEGLVNY TRFDDPQELI VGTQGRPISP DDEVRIVDDE DRDVPPGETG HLITRGPYTI RGYFRADVHN ARSFTRDGFY RTGDVARRLP SGHLIVEGRD KDQINRGGDK VAAEEVENHL LAHPAVLDVA VVAMPDAFLG ERTCAFIVPR GEAPRPLEIN RFMRERGVAG YKVPDRIEFV DQLPKTGVGK IDKRALRERI AARLQATA
|
| |