Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1868 |
Symbol | trpE |
ID | 6145204 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1891466 |
End bp | 1893028 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641616744 |
Product | anthranilate synthase component I |
Protein accession | YP_001743922 |
Protein GI | 170680165 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00565] anthranilate synthase component I, proteobacterial subset |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.00000000112472 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAAACAC AAAAACCGAC TCTCGAACTG CTAACTTGCA AAGGCGCTTA TCGCGACAAC CCGACTGCGC TTTTTCACCA GTTGTGTGGG GATCGTCCGG CAACGCTGCT GCTGGAATCC GCAGATATCG ACAGCAAAGA TGATTTAAAA AGTCTATTGC TAGTGGACAG TGCGCTGCGC ATTACAGCAT TAGGTGACAC TGTCACTATT CAGGCACTTT CCGGCAATGG CGAAGCCCTG TTGGCATTAC TGGATAACGC CCTGCCTGCG GGTGTGGAAA ATGAACAATC ACCAAACTGC CGCGTGCTGC GCTTCCCTCC TGTCAGTCCA TTGCTGGATG AAGACGCCCG CTTATGCTCC CTTTCGGTTT TTGACGCTTT CCGCTTATTG CAGAATCTGT TGAATGTACC GAAGGAAGAA CGGGAAGCGA TGTTCTTTGG TGGCCTGTTC TCTTACGACC TGGTCGCAGG ATTTGAAGAT TTACCGCAAC TGGCAGCGGA AAATAACTGC CCTGATTTCT GTTTTTATCT CGCTGAAACG CTGATGGTGA TTGACCATCA GAAAAAAAGC ACCCGTATTC AGGCCAGCCT GTTTGCTCCC AATGAAAAAG AAAAACAACG TCTCACTGCT CGCCTGAACG AACTTCGCCA GCAACTGACC GAAACCGCGC CACCGCTGCC GGTGGTTTCC GTGCCGCATA TGCGTTGTGA ATGTAATCAG AGCGATGAAG AGTTCGGTGG CGTGGTGCGT TCGTTGCAAA AAGCGATTCG CGCCGGAGAA ATTTTCCAGG TGGTGCCATC TCGCCGTTTC TCTCTACCCT GCCCGTCACC GCTGGCGGCT TATTACGTGC TGAAAAAGAG TAATCCCAGC CCGTACATGT TTTTTATGCA GGATAATGAT TTCACCCTGT TTGGCGCGTC GCCGGAAAGT TCGCTCAAGT ATGACGCCAC CAGCCGCCAG ATTGAGATCT ACCCGATTGC CGGAACACGC CCACGCGGTC GTCGTGCCGA TGGTTCACTG GACAGAGACC TCGACAGCCG CATCGAACTC GAAATGCGTA CCGACCATAA AGAGCTTTCT GAACATCTGA TGCTGGTGGA TCTCGCCCGT AATGACCTGG CTCGCATTTG CACACCCGGC AGCCGCTACG TTGCCGATCT CACCAAAGTT GACCGTTACT CTTACGTGAT GCACCTGGTC TCCCGCGTTG TTGGTGAGCT GCGCCACGAT CTCGACGCCC TGCACGCTTA CCGCGCCTGT ATGAATATGG GGACGTTAAG CGGTGCGCCG AAAGTACGCG CTATGCAGTT AATTGCCGAA GCAGAAGGTC GTCGACGCGG CAGCTACGGC GGCGCGGTAG GTTATTTTAC CGCGCATGGC GATCTCGACA CCTGCATTGT GATCCGCTCG GCGCTGGTGG AAAACGGTAT CGCCACCGTG CAAGCCGGTG CTGGCGTAGT CCTTGATTCT GTTCCGCAGT CGGAAGCCGA CGAAACCCGT AATAAAGCCC GCGCTGTACT GCGCGCTATT GCCACCGCGC ATCATGCACA GGAGACCTTC TGA
|
Protein sequence | MQTQKPTLEL LTCKGAYRDN PTALFHQLCG DRPATLLLES ADIDSKDDLK SLLLVDSALR ITALGDTVTI QALSGNGEAL LALLDNALPA GVENEQSPNC RVLRFPPVSP LLDEDARLCS LSVFDAFRLL QNLLNVPKEE REAMFFGGLF SYDLVAGFED LPQLAAENNC PDFCFYLAET LMVIDHQKKS TRIQASLFAP NEKEKQRLTA RLNELRQQLT ETAPPLPVVS VPHMRCECNQ SDEEFGGVVR SLQKAIRAGE IFQVVPSRRF SLPCPSPLAA YYVLKKSNPS PYMFFMQDND FTLFGASPES SLKYDATSRQ IEIYPIAGTR PRGRRADGSL DRDLDSRIEL EMRTDHKELS EHLMLVDLAR NDLARICTPG SRYVADLTKV DRYSYVMHLV SRVVGELRHD LDALHAYRAC MNMGTLSGAP KVRAMQLIAE AEGRRRGSYG GAVGYFTAHG DLDTCIVIRS ALVENGIATV QAGAGVVLDS VPQSEADETR NKARAVLRAI ATAHHAQETF
|
| |