Gene SeD_A1605 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1605 
SymboltrpE 
ID6873403 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1548495 
End bp1550057 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content58% 
IMG OID642784751 
Productanthranilate synthase component I 
Protein accessionYP_002215419 
Protein GI198244516 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00565] anthranilate synthase component I, proteobacterial subset 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.443711 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.000400444 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAAACAC CAAAACCCAC GCTCGAACTG TTGACCTGCG ATGCCGCCTA TCGGGAAAAC 
CCAACGGCGC TTTTTCACCA GGTCTGCGGC GATCGCCCGG CAACGCTGCT GCTGGAATCC
GCGGATATCG ACAGTAAAGA TGATTTAAAA AGCCTGCTGC TGGTAGATAG CGCGCTGCGC
ATTACCGCTT TAGGTGACAC TGTCACCATT CAGGCGTTAT CTGATAATGG CGCCTCGTTA
TTGCCGCTAC TGGATACCGC CCTGCCCGCT GGCGTGGAGA ACGAAGTCCT GCCTGCCGGT
CGCGTTCTAC GCTTCCCGCC CGTCAGCCCA TTATTAGATG AAGACGCCCG TTTATGCTCT
CTGTCGGTAT TTGATGCGTT CCGTCTGTTA CAGGGAGTGG TGAACATACC GACGCAAGAG
CGGGAGGCTA TGTTTTTCGG CGGTCTGTTT GCCTACGACC TGGTCGCTGG CTTTGAAGCG
CTGCCACACC TTGAGGCTGG CAATAACTGC CCGGACTACT GCTTTTATTT AGCGGAAACG
CTGATGGTCA TAGATCATCA GAAAAAAAGC ACCCGTATTC AGGCCAGTCT GTTCACCGCA
AGCGACCGGG AAAAACAGCG CCTGAACGCC CGCCTGGCTT ACCTTAGCCA ACAGTTAACC
CAGCCTGCGC CGCCGTTGCC GGTCACGCCG GTGCCGGACA TGCGCTGCGA ATGCAATCAG
AGCGATGACG CGTTCGGCGC GGTGGTACGC CAGTTGCAAA AAGCCATCCG CGCGGGGGAG
ATATTTCAGG TGGTGCCGTC GCGCCGCTTT TCACTGCCCT GCCCGTCGCC GTTGGCTGCC
TACTACGTGC TGAAAAAGAG CAACCCCAGC CCGTATATGT TCTTTATGCA GGATAATGAT
TTCACGCTTT TCGGCGCGTC GCCGGAAAGC TCGCTGAAAT ATGACGCCGC CAGTCGTCAG
ATTGAGATTT ACCCCATCGC GGGTACCCGT CCACGCGGTC GCCGCGCCGA TGGTACGCTG
GACAGAGATC TCGACAGCCG TATTGAGCTG GACATGCGTA CCGACCATAA AGAGCTTTCC
GAACATCTGA TGCTGGTCGA TCTGGCGCGC AATGACCTGG CGCGCATCTG TACGCCGGGC
AGTCGCTACG TTGCCGATCT GACCAAAGTT GACCGCTATT CGTACGTGAT GCATCTGGTT
TCCCGGGTAG TGGGCGAACT GCGTCACGAT CTCGACGCGC TGCACGCCTA TCGCGCCTGC
ATGAACATGG GCACCCTGAG CGGCGCGCCG AAAGTACGCG CCATGCAGTT GATTGCCGAT
GCGGAAGGAC AGCGCCGCGG CAGCTATGGC GGCGCGGTCG GTTACTTCAC CGCCCACGGC
GATCTGGACA CCTGTATTGT TATCCGCTCC GCGCTGGTGG AGAACGGTAT CGCCACCGTA
CAGGCGGGCG CCGGAATCGT GCTGGACTCT GTTCCGCAGT CTGAAGCCGA TGAAACCCGT
AATAAAGCGC GCGCCGTATT GCGTGCTATC GCCACCGCGC ATCATGCACA GGAGACCTTC
TGA
 
Protein sequence
MQTPKPTLEL LTCDAAYREN PTALFHQVCG DRPATLLLES ADIDSKDDLK SLLLVDSALR 
ITALGDTVTI QALSDNGASL LPLLDTALPA GVENEVLPAG RVLRFPPVSP LLDEDARLCS
LSVFDAFRLL QGVVNIPTQE REAMFFGGLF AYDLVAGFEA LPHLEAGNNC PDYCFYLAET
LMVIDHQKKS TRIQASLFTA SDREKQRLNA RLAYLSQQLT QPAPPLPVTP VPDMRCECNQ
SDDAFGAVVR QLQKAIRAGE IFQVVPSRRF SLPCPSPLAA YYVLKKSNPS PYMFFMQDND
FTLFGASPES SLKYDAASRQ IEIYPIAGTR PRGRRADGTL DRDLDSRIEL DMRTDHKELS
EHLMLVDLAR NDLARICTPG SRYVADLTKV DRYSYVMHLV SRVVGELRHD LDALHAYRAC
MNMGTLSGAP KVRAMQLIAD AEGQRRGSYG GAVGYFTAHG DLDTCIVIRS ALVENGIATV
QAGAGIVLDS VPQSEADETR NKARAVLRAI ATAHHAQETF