Gene SeAg_B1423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B1423 
SymboltrpE 
ID6793954 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp1382897 
End bp1384459 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content58% 
IMG OID642775668 
Productanthranilate synthase component I 
Protein accessionYP_002146304 
Protein GI197251861 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00565] anthranilate synthase component I, proteobacterial subset 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAACAC CAAAACCCAC GCTCGAACTA TTGACCTGCG ATGCCGCCTA CCGGGAAAAC 
CCAACGGCGC TTTTTCATCA GGTCTGCGGC GATCGCCCGG CAACGCTGCT GCTGGAATCC
GCTGATATCG ACAGTAAAGA CGATTTAAAA AGCCTGCTGC TGGTAGATAG CGCGCTGCGC
ATTACCGCTT TAGGTGACAC TGTCACTATT CAGGCGTTAT CTGATAATGG CGCCTCGTTA
TTGCCGCTAC TGGATACCGC CCTGCCCGCT GGCGCGGAGA ACGAAGTCCT GCCTGCCGGT
CGCATTCTAC GCTTTCCGCC CGTCAGCCCA TTATTAGATG AAGACGCCCG TTTATGCTCT
CTGTCGGTAT TTGATGCGTT CCGTCTGTTA CAGGGAGTGG TGAACATACC GACGCAAGAG
CGGGAGGCTA TGTTTTTCGG CGGTCTGTTT GCCTACGACC TGGTCGCTGG CTTTGAAGCG
CTGCCACACC TTGAGGCTGG CAATAACTGC CCGGACTACT GCTTTTATTT AGCGGAAACG
CTGATGGTGA TAGATCATCA GAAAAAAAGC ACCCGTATTC AGGCCAGTCT GTTCACCGCA
AGCGACCGGG AAAAACAGCG CCTCAACGCC CGCCTGGCGT ACCTTAGCCA ACAGTTAACC
CAGCCTGCGC CGCCGTTGCC GGTGACGCCG GTGCCGGACA TGCGCTGTGA ATGCAATCAG
AGCGATGACG CGTTCGGCGC GGTGGTACGC CAGTTGCAAA AAGCCATCCG CGCGGGGGAG
ATATTTCAGG TGGTGCCGTC GCGCCGCTTT TCACTGCCCT GCCCGTCGCC GTTGGCTGCC
TACTATGTGC TGAAAAAGAG CAACCCCAGC CCATATATGT TCTTTATGCA GGATAATGAT
TTCACGCTTT TCGGCGCGTC GCCGGAAAGC TCGCTGAAAT ATGACGCCGC CAGCCGTCAG
ATTGAGATTT ATCCCATCGC GGGTACCCGT CCACGCGGTC GCCGCGCCGA TGGTACGCTG
GACAGAGATC TCGACAGCCG TATTGAGCTG GACATGCGTA CCGACCATAA AGAGCTTTCC
GAACATCTGA TGCTGGTCGA TCTGGCGCGC AATGACCTGG CGCGCATCTG TACGCCGGGC
AGTCGCTACG TTGCCGATCT GACAAAAGTT GACCGCTATT CGTACGTGAT GCATCTGGTT
TCCCGGGTGG TGGGCGAACT GCGTCACGAT CTCGACGCGC TGCACGCCTA TCGCGCCTGC
ATGAACATGG GCACCCTGAG CGGCGCGCCG AAAGTACGCG CCATGCAGTT GATTGCCGAT
GCGGAAGGAC AGCGCCGCGG CAGCTATGGC GGCGCGGTCG GTTACTTCAC CGCCCACGGC
GATCTGGACA CCTGTATTGT TATCCGCTCC GCGCTGGTGG AGAACGGTAT CGCCACCGTA
CAGGCGGGCG CCGGAATTGT GCTGGACTCG GTTCCGCAGT CTGAAGCCGA TGAAACCCGT
AATAAAGCGC GCGCCGTATT GCGTGCTATC GCCACCGCGC ATCATGCACA GGAGACCTTC
TGA
 
Protein sequence
MQTPKPTLEL LTCDAAYREN PTALFHQVCG DRPATLLLES ADIDSKDDLK SLLLVDSALR 
ITALGDTVTI QALSDNGASL LPLLDTALPA GAENEVLPAG RILRFPPVSP LLDEDARLCS
LSVFDAFRLL QGVVNIPTQE REAMFFGGLF AYDLVAGFEA LPHLEAGNNC PDYCFYLAET
LMVIDHQKKS TRIQASLFTA SDREKQRLNA RLAYLSQQLT QPAPPLPVTP VPDMRCECNQ
SDDAFGAVVR QLQKAIRAGE IFQVVPSRRF SLPCPSPLAA YYVLKKSNPS PYMFFMQDND
FTLFGASPES SLKYDAASRQ IEIYPIAGTR PRGRRADGTL DRDLDSRIEL DMRTDHKELS
EHLMLVDLAR NDLARICTPG SRYVADLTKV DRYSYVMHLV SRVVGELRHD LDALHAYRAC
MNMGTLSGAP KVRAMQLIAD AEGQRRGSYG GAVGYFTAHG DLDTCIVIRS ALVENGIATV
QAGAGIVLDS VPQSEADETR NKARAVLRAI ATAHHAQETF