Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_01248 |
Symbol | trpE |
ID | 8112855 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 1306382 |
End bp | 1307944 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 644847499 |
Product | hypothetical protein |
Protein accession | YP_002999072 |
Protein GI | 251784768 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00565] anthranilate synthase component I, proteobacterial subset |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAACAC AAAAACCGAC TCTCGAACAG CTAACCTGCG AAGGCGCTTA TCGCGACAAT CCCACCGCGC TTTTTCACCA GTTGTGTGGG GATCGTCCGG CAACGCTGCT GCTGGAATCC GCAGATATCG ACAGCAAAGA TGATTTAAAA AGCCTGCTGC TGGTAGACAG TGCGCTGCGC ATTACAGCTT TAGGTGACAC TGTCACAATC CAGGCACTTT CCGGCAACGG CGAAGCCCTG CTGGCACTAC TGGATAACGC CCTGCCTGCG GGTGTGGAAA GTGAACAATC ACCAAACTGC CGTGTGCTGC GCTTCCCCCC TGTCAGTCCA CTGCTGGATG AAGACGCCCG CTTATGCTCC CTTTCGGTTT TTGACGCTTT CCGTTTATTG CAGAATCTGT TGAATGTACC GAAGGAAGAA CGAGAAGCCA TGTTCTTCGG CGGCCTGTTC TCTTATGACC TTGTGGCGGG ATTTGAAGAT TTACCGCAAC TGTCAGCGGA AAATAACTGC CCTGATTTCT GTTTTTATCT CGCTGAAACG CTGATGGTGA TTGACCATCA GAAAAAAAGC ACCCGTATTC AGGCCAGCCT GTTTGCTCCG AATGAAGAAG AAAAACAACG TCTCACTGCT CGCCTGAACG AACTACGTCA GCAACTGACC GAAGCCGCGC CGCCGCTGCC AGTGGTTTCC GTGCCGCATA TGCGTTGTGA ATGTAATCAG AGCGATGAAG AGTTCGGTGG CGTAGTGCGT TTGTTGCAAA AAGCGATTCG CGCTGGAGAA ATTTTCCAGG TGGTGCCATC TCGCCGTTTC TCTCTGCCCT GCCCGTCACC GCTGGCGGCC TATTACGTGC TGAAAAAGAG TAATCCCAGC CCGTACATGT TTTTTATGCA GGATAATGAT TTCACCCTAT TTGGCGCGTC GCCGGAAAGC TCGCTCAAGT ATGATGCCAC CAGCCGCCAG ATTGAGATCT ACCCGATTGC CGGAACACGC CCACGCGGTC GTCGCGCCGA TGGTTCACTG GACAGAGATC TCGACAGCCG TATTGAACTG GAAATGCGTA CCGATCATAA AGAGCTGTCT GAACATCTGA TGCTGGTTGA TCTCGCCCGT AATGATCTGG CACGCATTTG CACCCCCGGC AGCCGCTACG TCGCCGATCT CACCAAAGTT GACCGTTATT CCTATGTGAT GCACCTCGTC TCTCGCGTAG TCGGCGAACT GCGTCACGAT CTTGACGCCC TGCACGCTTA TCGCGCCTGT ATGAATATGG GGACGTTAAG CGGTGCGCCG AAAGTACGCG CTATGCAGTT AATTGCCGAG GCGGAAGGTC GTCGCCGCGG CAGCTACGGC GGCGCGGTAG GTTATTTCAC CGCGCATGGC GATCTCGACA CCTGCATTGT GATCCGCTCG GCGCTGGTGG AAAACGGTAT CGCCACCGTG CAAGCGGGTG CTGGTGTAGT CCTTGATTCT GTTCCGCAGT CGGAAGCCGA CGAAACCCGT AACAAAGCCC GCGCTGTACT GCGCGCTATT GCCACCGCGC ATCATGCACA GGAGACTTTC TGA
|
Protein sequence | MQTQKPTLEQ LTCEGAYRDN PTALFHQLCG DRPATLLLES ADIDSKDDLK SLLLVDSALR ITALGDTVTI QALSGNGEAL LALLDNALPA GVESEQSPNC RVLRFPPVSP LLDEDARLCS LSVFDAFRLL QNLLNVPKEE REAMFFGGLF SYDLVAGFED LPQLSAENNC PDFCFYLAET LMVIDHQKKS TRIQASLFAP NEEEKQRLTA RLNELRQQLT EAAPPLPVVS VPHMRCECNQ SDEEFGGVVR LLQKAIRAGE IFQVVPSRRF SLPCPSPLAA YYVLKKSNPS PYMFFMQDND FTLFGASPES SLKYDATSRQ IEIYPIAGTR PRGRRADGSL DRDLDSRIEL EMRTDHKELS EHLMLVDLAR NDLARICTPG SRYVADLTKV DRYSYVMHLV SRVVGELRHD LDALHAYRAC MNMGTLSGAP KVRAMQLIAE AEGRRRGSYG GAVGYFTAHG DLDTCIVIRS ALVENGIATV QAGAGVVLDS VPQSEADETR NKARAVLRAI ATAHHAQETF
|
| |