Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E1489 |
Symbol | trpE |
ID | 6271937 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 1358253 |
End bp | 1359815 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641725589 |
Product | anthranilate synthase component I |
Protein accession | YP_001880095 |
Protein GI | 187732406 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00565] anthranilate synthase component I, proteobacterial subset |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAACAC AAAAACCGAC TCTCGAACTG CTAACCTGCG AAGGCGCTTA TCGCGACAAC CCGACTGCGC TTTTTCACCA GTTGTGTGGG GATCGTCCGG CAACGCTGCT GCTGGAATCC GCAGATATCG ACAGCAAAGA TGATTTAAAA AGCCTGCTAC TGGTAGACAG TGCGCTGCGC ATTACAGCAT TAGGTGACAC TGTCACAATT CAGGCACTTT CCGGCAACGG CGAAGCCCTG CTGACACTAC TGGATAACGC CCTGCCTGCG GGTGTGGAAA ATGAACAATC ACCAAACTGC CGTGTGCTGC GCTTCCCCCC TGTCAGTCCA CTGCTGGATG AAGACGCTCG CTTATGCTCC CTTTCGGTTT TTGACGCTTT CCGTTTATTG CAGAATCTGT TGAATGTACC GAAGGAAGAA CGAGAAGCAA TGTTCTTCGG CGGCCTGTTC TCTTATGACC TTGTGGCGGG ATTTGAAGAT TTACCGCAAC TGTCAGCGGA AAATAACTGC CCTGATTTCT GTTTTTATCT CGCTGAAACG CTGATGGTGA TTGACCATCA GAAAAAAAAC ACCCGCATTC AGGCCAGCCT GTTTGCTCCG AATGAAGAAG AAAAACAACG TCTCACTGCT CGCCTGAACG ATCTTCGCCA GCAGCTGACC GAAGCCGCGC CGCCGCTGCC GGTGGTTTCC GTGCCGCATA TGCGTTGTGA ATGTAACCAG AGCGATGAAG AGTTCGGTGG CGTGGTGCGT TTGTTGCAAA AAGCGATTCG CACCGGAGAA ATTTTCCAGG TGGTGCCGTC TCGCCGTTTC TCTCTGCCCT GCCCGTCACC GCTGGCGGCC TATTACGTGC TGAAAAAGAG TAATCCCAGC CCGTACATGT TTTTTATGCA GGATAATGAT TTCACCCTGT TTGGCGCGTC GCCGGAAAGT TCGCTCAAGT ATGACGCCAC CAGCCGCCAG ATTGAGATCT ACCCGATTGC CGGAACACGC CCGCGCGGTC GTCGCGCCGA TGGTTCACTG GACAGAGACC TCGACAGCCG CATCGAACTG GAAATGCGTA CCGATCATAA AGAGCTTTCT GAACATCTGA TGCTGGTGGA TCTCGCCCGT AATGATCTGG CACGCATTTG CACCCCCGGC AGCCGCTACG TCGCCGATCT TACCAAAGTT GACCGTTACT CTTACGTGAT GCACCTGGTC TCCCGCGTGG TCGGTGAGCT GCGCCACGAT CTCGACGCCC TGCACGCTTA CCGCGCCTGT ATGAATATGG GGACGTTAAG CGGTGCGCCG AAAGTACGCG CCATGCAGTT AATTGCCGAG GCGGAAGGTC GTCGCCGCGG CAGCTACGGC GGCGCGGTAG GTTATTTCAC CGCGCATGGC GATCTCGACA CCTGCATTGT GATCCGCTCA GCGCTGGTGG AAAACGGTAT CGCCACCGTG CAAGCCGGTG CTGGCATAGT CCTTGATTCT GTTCCGCAGT CGGAAGCCGA CGAAACCCGT AATAAAGCCC GCGCTGTACT GCGCGCTATT GCCACCGCGC ATCATGCACA GGAAACTTTC TGA
|
Protein sequence | MQTQKPTLEL LTCEGAYRDN PTALFHQLCG DRPATLLLES ADIDSKDDLK SLLLVDSALR ITALGDTVTI QALSGNGEAL LTLLDNALPA GVENEQSPNC RVLRFPPVSP LLDEDARLCS LSVFDAFRLL QNLLNVPKEE REAMFFGGLF SYDLVAGFED LPQLSAENNC PDFCFYLAET LMVIDHQKKN TRIQASLFAP NEEEKQRLTA RLNDLRQQLT EAAPPLPVVS VPHMRCECNQ SDEEFGGVVR LLQKAIRTGE IFQVVPSRRF SLPCPSPLAA YYVLKKSNPS PYMFFMQDND FTLFGASPES SLKYDATSRQ IEIYPIAGTR PRGRRADGSL DRDLDSRIEL EMRTDHKELS EHLMLVDLAR NDLARICTPG SRYVADLTKV DRYSYVMHLV SRVVGELRHD LDALHAYRAC MNMGTLSGAP KVRAMQLIAE AEGRRRGSYG GAVGYFTAHG DLDTCIVIRS ALVENGIATV QAGAGIVLDS VPQSEADETR NKARAVLRAI ATAHHAQETF
|
| |