Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A1850 |
Symbol | trpE |
ID | 6483396 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 1814899 |
End bp | 1816461 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642737224 |
Product | anthranilate synthase component I |
Protein accession | YP_002040976 |
Protein GI | 194442664 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00565] anthranilate synthase component I, proteobacterial subset |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.174926 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 0.00154375 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAAACAC CAAAACCCAC GCTCGAACTG TTGACCTGCG ATGCCGCCTA CCGGGAAAAC CCAACGGCGC TTTTTCACCA GGTCTGCGGC GATCGCCCGG CAACGCTGCT GCTGGAATCC GCGGATATCG ACAGTAAAGA CGATTTAAAA AGCCTGCTGC TGGTAGATAG CGCGCTGCGC ATTACCGCTT TAGGTGACAC TGTCACTATT CAGGCGTTAT CTGATAATGG CGCCTCGTTA TTGCCGCTAC TGGATACCGC CCTGCCCGCT GGCGTGGAGA ACGAAGTCCT GCCTGCCGGT CGCATTCTAC GCTTTCCGCC CGTCAGCCCA TTATTAGATG AAGACGCCCG TTTATGCTCT CTGTCGGTAT TTGATGCGTT CCGCCTGTTA CAGGGAGTGG TGAACATACC GACGCAAGAG CGGGAGGCTA TGTTTTTCGG CGGTCTGTTT GCCTACGACC TGGTCGCTGG CTTTGAAGCG CTGCCACACC TTGAGGCTGG CAATAACTGC CCGGACTACT GCTTTTATTT AGCGGAAACG CTGATGGTGA TAGATCATCA GAAAAAAAGC ACCCGTATTC AGGCCAGTCT GTTCACCGCC AGCGACCGGG AAAAACAGCG CCTCAACGCC CGCCTGGCGT ACCTTAGCCA ACAGTTAACC CAGCCTGCGC CGCCGTTGCC GGTGACGCCG GTGCCGGACA TGCGCTGTGA ATGCAATCAG AGCGATGACG CGTTCGGCGC GGTGGTACGC CAGTTGCAAA AAGCCATCCG CGCGGGCGAG ATATTTCAGG TGGTGCCGTC GCGCCGCTTT TCACTGCCCT GCCCGTCGCC GTTGGCTGCC TACTACGTGC TGAAAAAGAG CAACCCCAGC CCGTATATGT TCTTTATGCA GGATAATGAT TTCACGCTTT TCGGCGCGTC GCCGGAAAGC TCGCTGAAAT ATGACGCCGC CAGCCGTCAG ATTGAGATTT ACCCCATCGC GGGTACCCGT CCACGCGGTC GCCGCGCCGA TGGTACGCTG GACAGAGATC TCGACAGCCG TATTGAGCTG GACATGCGTA CCGACCATAA AGAGCTTTCC GAACATCTGA TGCTGGTCGA TCTGGCGCGC AATGACCTGG CGCGCATCTG TACGCCGGGC AGTCGCTACG TTGCCGATCT GACCAAAGTT GACCGCTATT CGTACGTGAT GCATCTGGTT TCCCGGGTAG TGGGCGAACT GCGTCACGAT CTCGACGCGC TGCACGCCTA TCGCGCCTGC ATGAACATGG GCACCCTGAG CGGCGCGCCG AAAGTACGCG CCATGCAGTT GATTGCCGAT GCGGAAGGAC AGCGCCGCGG CAGCTATGGC GGCGCGGTCG GTTACTTCAC CGCCCACGGC GATCTGGACA CCTGTATTGT TATCCGCTCC GCGCTGGTGG AGAACGGTAT CGCCACCGTA CAGGCGGGCG CCGGAATCGT GCTGGACTCT GTTCCGCAGT CTGAAGCCGA TGAAACCCGT AATAAAGCGC GCGCCGTATT GCGTGCTATC GCCACCGCGC ATCATGCACA GGAGACCTTC TGA
|
Protein sequence | MQTPKPTLEL LTCDAAYREN PTALFHQVCG DRPATLLLES ADIDSKDDLK SLLLVDSALR ITALGDTVTI QALSDNGASL LPLLDTALPA GVENEVLPAG RILRFPPVSP LLDEDARLCS LSVFDAFRLL QGVVNIPTQE REAMFFGGLF AYDLVAGFEA LPHLEAGNNC PDYCFYLAET LMVIDHQKKS TRIQASLFTA SDREKQRLNA RLAYLSQQLT QPAPPLPVTP VPDMRCECNQ SDDAFGAVVR QLQKAIRAGE IFQVVPSRRF SLPCPSPLAA YYVLKKSNPS PYMFFMQDND FTLFGASPES SLKYDAASRQ IEIYPIAGTR PRGRRADGTL DRDLDSRIEL DMRTDHKELS EHLMLVDLAR NDLARICTPG SRYVADLTKV DRYSYVMHLV SRVVGELRHD LDALHAYRAC MNMGTLSGAP KVRAMQLIAD AEGQRRGSYG GAVGYFTAHG DLDTCIVIRS ALVENGIATV QAGAGIVLDS VPQSEADETR NKARAVLRAI ATAHHAQETF
|
| |