Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_1463 |
Symbol | trpE |
ID | 5590594 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 1455149 |
End bp | 1456711 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640925157 |
Product | anthranilate synthase component I |
Protein accession | YP_001462562 |
Protein GI | 157157578 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00565] anthranilate synthase component I, proteobacterial subset |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.40703 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAACAC AAAAACCGAC TCTCGAACTG CTAACCTGCG AAGGCGCTTA TCGCGACAAC CCGACTGCGC TTTTTCACCA GTTGTGTGGG AATCGTCCGG CAACGCTGCT GCTGGAATCC GCAGATATCG ACAGCAAAGA TGATTTAAAA AGCCTGCTGC TGGCAGACAG TGCGCTGCGC ATTACAGCTT TAGGTGACAC TGTCACAATC CAGGCATTTT CCGGCAACGG CGAAGCCCTG CTGACACTAC TGGATAACGC CCTGCCTGCG GGTGTGGAAA ATGAACAATT ACCAAACTGC CGTGTGCTGC GCTTCCCCCC TGTCAGTCCA CTGCTGGATG AAGACGCTCG CTTATGCTCC CTTTCGGTTT TTGACGCTTT CCGTTTATTG CAGAATCTGT TGAATGTACC GAAGGAAGAA CGAGAAGCAA TGTTCTTCGG CGGCCTGTTC TCTTATGACC TTGTGGCGGG ATTTGAAGAT TTACCGCAAC TGTCAGCGGA AAATAACTGC CCTGATTTCT GTTTTTATCT CGCTGAAACG CTGATGGTGA TTGACCATCA GAAAAAAAGC ACCCGTATTC AGGCCAGCCT GTTTGCTCCG AATGAAGAAG AAAAACAACG TCTCACTGCT CGCCTGAACG AACTACGTCA GCAACTGACC GAAGCCGCGC CGCCGCTGCC GGTGGTTTCC GTGCCGCATA TGCGTTGTGA ATGTAATCAG AGCGATGAAG AGTTCGGTGG CGTAGTGCGT TTGTTGCAAA AAGCGATTCG CGCTGGAGAA ATTTTCCAGG TGGTGCCATC TCGCCGTTTT TCTCTGCCCT GCCCGTCACC GCTGGCGGCC TATTACGTGC TGAAAAAGAG TAATCCCAGC CCGTACATGT TTTTTATGCA GGATAATGAT TTCACCCTGT TTGGCGCGTC GCCGGAAAGT TCGCTCAAGT ATGACGCCAC CAGCCGCCAG ATTGAGATCT ACCCGATTGC CGGAACACGT CCACGCGGTC GTCGTGCCGA TGGCTCGCTG GACAGAGACC TCGACAGCCG CATCGAACTG GAAATGCGTA CCGATCATAA AGAGCTTTCT GAACATCTGA TGCTGGTGGA TCTTGCCCGT AATGATCTGG CACGCATTTG CACCCCCGGC AGCCGCTACG TCGCCGACCT CACCAAAGTT GACCGTTACT CTTACGTGAT GCACCTGGTC TCCCGCGTGG TCGGTGAGCT GCGCCACGAT CTCGACGCCC TACACGCTTA CCGCGCCTGT ATGAATATGG GGACGTTAAG CGGTGCGCCG AAAGTGCGCG CCATGCAGTT AATTGCCGAG GCGGAAGGTC GTCGCCGCGG CAGCTACGGC GGCGCGGTAG GTTATTTCAC CGCGCACGGC GATCTCGACA CCTGCATTGT GATCCGCTCG GCGCTGGTGG AAAACGGTAT CGCCACCGTG CAAGCCGGTG CTGGCGTAGT CCTTGATTCT ATTCCGCAGT CGGAAGCCGA CGAAACCCGT AATAAAGCCC GCGCTGTACT GCGCGCTATT GCCACCGCGC ATCATGCACA GGAGACTTTC TGA
|
Protein sequence | MQTQKPTLEL LTCEGAYRDN PTALFHQLCG NRPATLLLES ADIDSKDDLK SLLLADSALR ITALGDTVTI QAFSGNGEAL LTLLDNALPA GVENEQLPNC RVLRFPPVSP LLDEDARLCS LSVFDAFRLL QNLLNVPKEE REAMFFGGLF SYDLVAGFED LPQLSAENNC PDFCFYLAET LMVIDHQKKS TRIQASLFAP NEEEKQRLTA RLNELRQQLT EAAPPLPVVS VPHMRCECNQ SDEEFGGVVR LLQKAIRAGE IFQVVPSRRF SLPCPSPLAA YYVLKKSNPS PYMFFMQDND FTLFGASPES SLKYDATSRQ IEIYPIAGTR PRGRRADGSL DRDLDSRIEL EMRTDHKELS EHLMLVDLAR NDLARICTPG SRYVADLTKV DRYSYVMHLV SRVVGELRHD LDALHAYRAC MNMGTLSGAP KVRAMQLIAE AEGRRRGSYG GAVGYFTAHG DLDTCIVIRS ALVENGIATV QAGAGVVLDS IPQSEADETR NKARAVLRAI ATAHHAQETF
|
| |