Gene SeSA_A1855 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A1855 
SymboltrpE 
ID6516718 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp1795929 
End bp1797491 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content58% 
IMG OID642746951 
Productanthranilate synthase component I 
Protein accessionYP_002114754 
Protein GI194735813 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00565] anthranilate synthase component I, proteobacterial subset 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.945253 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.19487 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAACAC TAAAACCCAC GCTCGAACTG TTGACCTGCG ATGCCGCCTA CCGGGAAAAC 
CCAACGGCGC TTTTTCACCA GGTCTGCGGC GATCGCCCGG CAACGCTGCT GCTGGAATCC
GCGGATATCG ACAGTAAAGA TGATTTAAAA AGCCTGCTGC TGGTAGATAG CGCGCTGCGC
ATTACCGCTT TAGGTGACAC TGTCACCATT CAGGCGTTAT CTGATAATGG CGCCTCGTTA
TTGCCGCTAC TGGATACCGC CCTGCCCGCT GGCGTGGAGA ACGAAGTCCT GCCTGCCGGT
CGCGTTCTAC GCTTCCCGCC CGTCAGCCCA TTATTAGATG AAGACGCCCG TTTATGCTCT
CTGTCGGTAT TTGATGCGTT CCGCCTATTA CAGGGGGTGG TGAACATACC AACGCAAGAG
CGGGAGGCTA TGTTTTTCGG CGGTCTGTTT GCCTACGACC TGGTCGCTGG CTTTGAAGCG
CTGCCACACC TTGAGGCTGG CAATAACTGC CCGGACTACT GCTTTTATTT AGCGGAAACG
CTAATGGTGA TAGATCATCA GAAAAAAAGC ACCCGTATTC AGGCCAGTCT GTTCACCGCC
AGTGACCGGG AAAAACAGCG CCTGAACGCC CGCCTGGCGT ACCTTAGCCA ACAGTTAACC
CAGCCTGCGC CGCCGTTGCC GGTGACGCCG GTGCCGGACA TGCGCTGTGA ATGCAATCAG
AGCGATGACG CGTTCGGCGC GGTGGTACGC CAGTTGCAAA AAGCCATCCG CGCGGGCGAG
ATATTTCAGG TGGTGCCGTC GCGCCGCTTT TCACTGCCCT GCCCGTCGCC GTTGGCTGCC
TACTACGTGC TGAAAAAAAG CAATCCCAGC CCGTATATGT TCTTTATGCA GGATAATGAT
TTCACGCTTT TCGGCGCGTC GCCGGAAAGC TCGCTGAAAT ATGACGCCAC CAGCCGTCAG
ATTGAGATTT ATCCCATCGC GGGTACCCGT CCACGCGGTC GCCGCGCCGA TGGTACGCTG
GACAGAGATC TCGACAGCCG TATTGAGCTG GACATGCGTA CCGACCATAA AGAGCTTTCC
GAACATCTGA TGCTGGTCGA TCTGGCGCGC AATGACCTGG CGCGCATCTG TACGCCGGGC
AGTCGCTACG TTGCCGATCT GACAAAAGTT GACCGCTATT CGTACGTGAT GCATCTGGTT
TCCCGGGTGG TGGGCGAACT GCGTCACGAT CTCGACGCTC TGCACGCCTA TCGCGCCTGC
ATGAACATGG GCACCCTGAG CGGCGCGCCG AAAGTACGCG CCATGCAGTT GATTGCCGAT
GCGGAAGGAC AGCGCCGCGG CAGCTATGGC GGCGCTGTCG GTTACTTCAC CGCCCACGGC
GATCTGGACA CCTGTATTGT TATCCGCTCC GCGCTGGTGG AGAACGGTAT CGCCACCGTA
CAGGCGGGCG CCGGAATCGT GCTGGACTCT GTTCCGCAGT CTGAAGCCGA TGAAACCCGT
AATAAAGCGC GCGCCGTATT GCGTGCTATC GCCACCGCGC ATCATGCACA GGAGACCTTC
TGA
 
Protein sequence
MQTLKPTLEL LTCDAAYREN PTALFHQVCG DRPATLLLES ADIDSKDDLK SLLLVDSALR 
ITALGDTVTI QALSDNGASL LPLLDTALPA GVENEVLPAG RVLRFPPVSP LLDEDARLCS
LSVFDAFRLL QGVVNIPTQE REAMFFGGLF AYDLVAGFEA LPHLEAGNNC PDYCFYLAET
LMVIDHQKKS TRIQASLFTA SDREKQRLNA RLAYLSQQLT QPAPPLPVTP VPDMRCECNQ
SDDAFGAVVR QLQKAIRAGE IFQVVPSRRF SLPCPSPLAA YYVLKKSNPS PYMFFMQDND
FTLFGASPES SLKYDATSRQ IEIYPIAGTR PRGRRADGTL DRDLDSRIEL DMRTDHKELS
EHLMLVDLAR NDLARICTPG SRYVADLTKV DRYSYVMHLV SRVVGELRHD LDALHAYRAC
MNMGTLSGAP KVRAMQLIAD AEGQRRGSYG GAVGYFTAHG DLDTCIVIRS ALVENGIATV
QAGAGIVLDS VPQSEADETR NKARAVLRAI ATAHHAQETF