Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeSA_A1855 |
Symbol | trpE |
ID | 6516718 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 |
Kingdom | Bacteria |
Replicon accession | NC_011094 |
Strand | + |
Start bp | 1795929 |
End bp | 1797491 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642746951 |
Product | anthranilate synthase component I |
Protein accession | YP_002114754 |
Protein GI | 194735813 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00565] anthranilate synthase component I, proteobacterial subset |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.945253 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.19487 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAACAC TAAAACCCAC GCTCGAACTG TTGACCTGCG ATGCCGCCTA CCGGGAAAAC CCAACGGCGC TTTTTCACCA GGTCTGCGGC GATCGCCCGG CAACGCTGCT GCTGGAATCC GCGGATATCG ACAGTAAAGA TGATTTAAAA AGCCTGCTGC TGGTAGATAG CGCGCTGCGC ATTACCGCTT TAGGTGACAC TGTCACCATT CAGGCGTTAT CTGATAATGG CGCCTCGTTA TTGCCGCTAC TGGATACCGC CCTGCCCGCT GGCGTGGAGA ACGAAGTCCT GCCTGCCGGT CGCGTTCTAC GCTTCCCGCC CGTCAGCCCA TTATTAGATG AAGACGCCCG TTTATGCTCT CTGTCGGTAT TTGATGCGTT CCGCCTATTA CAGGGGGTGG TGAACATACC AACGCAAGAG CGGGAGGCTA TGTTTTTCGG CGGTCTGTTT GCCTACGACC TGGTCGCTGG CTTTGAAGCG CTGCCACACC TTGAGGCTGG CAATAACTGC CCGGACTACT GCTTTTATTT AGCGGAAACG CTAATGGTGA TAGATCATCA GAAAAAAAGC ACCCGTATTC AGGCCAGTCT GTTCACCGCC AGTGACCGGG AAAAACAGCG CCTGAACGCC CGCCTGGCGT ACCTTAGCCA ACAGTTAACC CAGCCTGCGC CGCCGTTGCC GGTGACGCCG GTGCCGGACA TGCGCTGTGA ATGCAATCAG AGCGATGACG CGTTCGGCGC GGTGGTACGC CAGTTGCAAA AAGCCATCCG CGCGGGCGAG ATATTTCAGG TGGTGCCGTC GCGCCGCTTT TCACTGCCCT GCCCGTCGCC GTTGGCTGCC TACTACGTGC TGAAAAAAAG CAATCCCAGC CCGTATATGT TCTTTATGCA GGATAATGAT TTCACGCTTT TCGGCGCGTC GCCGGAAAGC TCGCTGAAAT ATGACGCCAC CAGCCGTCAG ATTGAGATTT ATCCCATCGC GGGTACCCGT CCACGCGGTC GCCGCGCCGA TGGTACGCTG GACAGAGATC TCGACAGCCG TATTGAGCTG GACATGCGTA CCGACCATAA AGAGCTTTCC GAACATCTGA TGCTGGTCGA TCTGGCGCGC AATGACCTGG CGCGCATCTG TACGCCGGGC AGTCGCTACG TTGCCGATCT GACAAAAGTT GACCGCTATT CGTACGTGAT GCATCTGGTT TCCCGGGTGG TGGGCGAACT GCGTCACGAT CTCGACGCTC TGCACGCCTA TCGCGCCTGC ATGAACATGG GCACCCTGAG CGGCGCGCCG AAAGTACGCG CCATGCAGTT GATTGCCGAT GCGGAAGGAC AGCGCCGCGG CAGCTATGGC GGCGCTGTCG GTTACTTCAC CGCCCACGGC GATCTGGACA CCTGTATTGT TATCCGCTCC GCGCTGGTGG AGAACGGTAT CGCCACCGTA CAGGCGGGCG CCGGAATCGT GCTGGACTCT GTTCCGCAGT CTGAAGCCGA TGAAACCCGT AATAAAGCGC GCGCCGTATT GCGTGCTATC GCCACCGCGC ATCATGCACA GGAGACCTTC TGA
|
Protein sequence | MQTLKPTLEL LTCDAAYREN PTALFHQVCG DRPATLLLES ADIDSKDDLK SLLLVDSALR ITALGDTVTI QALSDNGASL LPLLDTALPA GVENEVLPAG RVLRFPPVSP LLDEDARLCS LSVFDAFRLL QGVVNIPTQE REAMFFGGLF AYDLVAGFEA LPHLEAGNNC PDYCFYLAET LMVIDHQKKS TRIQASLFTA SDREKQRLNA RLAYLSQQLT QPAPPLPVTP VPDMRCECNQ SDDAFGAVVR QLQKAIRAGE IFQVVPSRRF SLPCPSPLAA YYVLKKSNPS PYMFFMQDND FTLFGASPES SLKYDATSRQ IEIYPIAGTR PRGRRADGTL DRDLDSRIEL DMRTDHKELS EHLMLVDLAR NDLARICTPG SRYVADLTKV DRYSYVMHLV SRVVGELRHD LDALHAYRAC MNMGTLSGAP KVRAMQLIAD AEGQRRGSYG GAVGYFTAHG DLDTCIVIRS ALVENGIATV QAGAGIVLDS VPQSEADETR NKARAVLRAI ATAHHAQETF
|
| |