Gene SNSL254_A1850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A1850 
SymboltrpE 
ID6483396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp1814899 
End bp1816461 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content58% 
IMG OID642737224 
Productanthranilate synthase component I 
Protein accessionYP_002040976 
Protein GI194442664 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00565] anthranilate synthase component I, proteobacterial subset 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.174926 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.00154375 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAAACAC CAAAACCCAC GCTCGAACTG TTGACCTGCG ATGCCGCCTA CCGGGAAAAC 
CCAACGGCGC TTTTTCACCA GGTCTGCGGC GATCGCCCGG CAACGCTGCT GCTGGAATCC
GCGGATATCG ACAGTAAAGA CGATTTAAAA AGCCTGCTGC TGGTAGATAG CGCGCTGCGC
ATTACCGCTT TAGGTGACAC TGTCACTATT CAGGCGTTAT CTGATAATGG CGCCTCGTTA
TTGCCGCTAC TGGATACCGC CCTGCCCGCT GGCGTGGAGA ACGAAGTCCT GCCTGCCGGT
CGCATTCTAC GCTTTCCGCC CGTCAGCCCA TTATTAGATG AAGACGCCCG TTTATGCTCT
CTGTCGGTAT TTGATGCGTT CCGCCTGTTA CAGGGAGTGG TGAACATACC GACGCAAGAG
CGGGAGGCTA TGTTTTTCGG CGGTCTGTTT GCCTACGACC TGGTCGCTGG CTTTGAAGCG
CTGCCACACC TTGAGGCTGG CAATAACTGC CCGGACTACT GCTTTTATTT AGCGGAAACG
CTGATGGTGA TAGATCATCA GAAAAAAAGC ACCCGTATTC AGGCCAGTCT GTTCACCGCC
AGCGACCGGG AAAAACAGCG CCTCAACGCC CGCCTGGCGT ACCTTAGCCA ACAGTTAACC
CAGCCTGCGC CGCCGTTGCC GGTGACGCCG GTGCCGGACA TGCGCTGTGA ATGCAATCAG
AGCGATGACG CGTTCGGCGC GGTGGTACGC CAGTTGCAAA AAGCCATCCG CGCGGGCGAG
ATATTTCAGG TGGTGCCGTC GCGCCGCTTT TCACTGCCCT GCCCGTCGCC GTTGGCTGCC
TACTACGTGC TGAAAAAGAG CAACCCCAGC CCGTATATGT TCTTTATGCA GGATAATGAT
TTCACGCTTT TCGGCGCGTC GCCGGAAAGC TCGCTGAAAT ATGACGCCGC CAGCCGTCAG
ATTGAGATTT ACCCCATCGC GGGTACCCGT CCACGCGGTC GCCGCGCCGA TGGTACGCTG
GACAGAGATC TCGACAGCCG TATTGAGCTG GACATGCGTA CCGACCATAA AGAGCTTTCC
GAACATCTGA TGCTGGTCGA TCTGGCGCGC AATGACCTGG CGCGCATCTG TACGCCGGGC
AGTCGCTACG TTGCCGATCT GACCAAAGTT GACCGCTATT CGTACGTGAT GCATCTGGTT
TCCCGGGTAG TGGGCGAACT GCGTCACGAT CTCGACGCGC TGCACGCCTA TCGCGCCTGC
ATGAACATGG GCACCCTGAG CGGCGCGCCG AAAGTACGCG CCATGCAGTT GATTGCCGAT
GCGGAAGGAC AGCGCCGCGG CAGCTATGGC GGCGCGGTCG GTTACTTCAC CGCCCACGGC
GATCTGGACA CCTGTATTGT TATCCGCTCC GCGCTGGTGG AGAACGGTAT CGCCACCGTA
CAGGCGGGCG CCGGAATCGT GCTGGACTCT GTTCCGCAGT CTGAAGCCGA TGAAACCCGT
AATAAAGCGC GCGCCGTATT GCGTGCTATC GCCACCGCGC ATCATGCACA GGAGACCTTC
TGA
 
Protein sequence
MQTPKPTLEL LTCDAAYREN PTALFHQVCG DRPATLLLES ADIDSKDDLK SLLLVDSALR 
ITALGDTVTI QALSDNGASL LPLLDTALPA GVENEVLPAG RILRFPPVSP LLDEDARLCS
LSVFDAFRLL QGVVNIPTQE REAMFFGGLF AYDLVAGFEA LPHLEAGNNC PDYCFYLAET
LMVIDHQKKS TRIQASLFTA SDREKQRLNA RLAYLSQQLT QPAPPLPVTP VPDMRCECNQ
SDDAFGAVVR QLQKAIRAGE IFQVVPSRRF SLPCPSPLAA YYVLKKSNPS PYMFFMQDND
FTLFGASPES SLKYDAASRQ IEIYPIAGTR PRGRRADGTL DRDLDSRIEL DMRTDHKELS
EHLMLVDLAR NDLARICTPG SRYVADLTKV DRYSYVMHLV SRVVGELRHD LDALHAYRAC
MNMGTLSGAP KVRAMQLIAD AEGQRRGSYG GAVGYFTAHG DLDTCIVIRS ALVENGIATV
QAGAGIVLDS VPQSEADETR NKARAVLRAI ATAHHAQETF