Gene SbBS512_E1489 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E1489 
SymboltrpE 
ID6271937 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp1358253 
End bp1359815 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content55% 
IMG OID641725589 
Productanthranilate synthase component I 
Protein accessionYP_001880095 
Protein GI187732406 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00565] anthranilate synthase component I, proteobacterial subset 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAACAC AAAAACCGAC TCTCGAACTG CTAACCTGCG AAGGCGCTTA TCGCGACAAC 
CCGACTGCGC TTTTTCACCA GTTGTGTGGG GATCGTCCGG CAACGCTGCT GCTGGAATCC
GCAGATATCG ACAGCAAAGA TGATTTAAAA AGCCTGCTAC TGGTAGACAG TGCGCTGCGC
ATTACAGCAT TAGGTGACAC TGTCACAATT CAGGCACTTT CCGGCAACGG CGAAGCCCTG
CTGACACTAC TGGATAACGC CCTGCCTGCG GGTGTGGAAA ATGAACAATC ACCAAACTGC
CGTGTGCTGC GCTTCCCCCC TGTCAGTCCA CTGCTGGATG AAGACGCTCG CTTATGCTCC
CTTTCGGTTT TTGACGCTTT CCGTTTATTG CAGAATCTGT TGAATGTACC GAAGGAAGAA
CGAGAAGCAA TGTTCTTCGG CGGCCTGTTC TCTTATGACC TTGTGGCGGG ATTTGAAGAT
TTACCGCAAC TGTCAGCGGA AAATAACTGC CCTGATTTCT GTTTTTATCT CGCTGAAACG
CTGATGGTGA TTGACCATCA GAAAAAAAAC ACCCGCATTC AGGCCAGCCT GTTTGCTCCG
AATGAAGAAG AAAAACAACG TCTCACTGCT CGCCTGAACG ATCTTCGCCA GCAGCTGACC
GAAGCCGCGC CGCCGCTGCC GGTGGTTTCC GTGCCGCATA TGCGTTGTGA ATGTAACCAG
AGCGATGAAG AGTTCGGTGG CGTGGTGCGT TTGTTGCAAA AAGCGATTCG CACCGGAGAA
ATTTTCCAGG TGGTGCCGTC TCGCCGTTTC TCTCTGCCCT GCCCGTCACC GCTGGCGGCC
TATTACGTGC TGAAAAAGAG TAATCCCAGC CCGTACATGT TTTTTATGCA GGATAATGAT
TTCACCCTGT TTGGCGCGTC GCCGGAAAGT TCGCTCAAGT ATGACGCCAC CAGCCGCCAG
ATTGAGATCT ACCCGATTGC CGGAACACGC CCGCGCGGTC GTCGCGCCGA TGGTTCACTG
GACAGAGACC TCGACAGCCG CATCGAACTG GAAATGCGTA CCGATCATAA AGAGCTTTCT
GAACATCTGA TGCTGGTGGA TCTCGCCCGT AATGATCTGG CACGCATTTG CACCCCCGGC
AGCCGCTACG TCGCCGATCT TACCAAAGTT GACCGTTACT CTTACGTGAT GCACCTGGTC
TCCCGCGTGG TCGGTGAGCT GCGCCACGAT CTCGACGCCC TGCACGCTTA CCGCGCCTGT
ATGAATATGG GGACGTTAAG CGGTGCGCCG AAAGTACGCG CCATGCAGTT AATTGCCGAG
GCGGAAGGTC GTCGCCGCGG CAGCTACGGC GGCGCGGTAG GTTATTTCAC CGCGCATGGC
GATCTCGACA CCTGCATTGT GATCCGCTCA GCGCTGGTGG AAAACGGTAT CGCCACCGTG
CAAGCCGGTG CTGGCATAGT CCTTGATTCT GTTCCGCAGT CGGAAGCCGA CGAAACCCGT
AATAAAGCCC GCGCTGTACT GCGCGCTATT GCCACCGCGC ATCATGCACA GGAAACTTTC
TGA
 
Protein sequence
MQTQKPTLEL LTCEGAYRDN PTALFHQLCG DRPATLLLES ADIDSKDDLK SLLLVDSALR 
ITALGDTVTI QALSGNGEAL LTLLDNALPA GVENEQSPNC RVLRFPPVSP LLDEDARLCS
LSVFDAFRLL QNLLNVPKEE REAMFFGGLF SYDLVAGFED LPQLSAENNC PDFCFYLAET
LMVIDHQKKN TRIQASLFAP NEEEKQRLTA RLNDLRQQLT EAAPPLPVVS VPHMRCECNQ
SDEEFGGVVR LLQKAIRTGE IFQVVPSRRF SLPCPSPLAA YYVLKKSNPS PYMFFMQDND
FTLFGASPES SLKYDATSRQ IEIYPIAGTR PRGRRADGSL DRDLDSRIEL EMRTDHKELS
EHLMLVDLAR NDLARICTPG SRYVADLTKV DRYSYVMHLV SRVVGELRHD LDALHAYRAC
MNMGTLSGAP KVRAMQLIAE AEGRRRGSYG GAVGYFTAHG DLDTCIVIRS ALVENGIATV
QAGAGIVLDS VPQSEADETR NKARAVLRAI ATAHHAQETF