Gene EcE24377A_1463 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_1463 
SymboltrpE 
ID5590594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp1455149 
End bp1456711 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content55% 
IMG OID640925157 
Productanthranilate synthase component I 
Protein accessionYP_001462562 
Protein GI157157578 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00565] anthranilate synthase component I, proteobacterial subset 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.40703 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAACAC AAAAACCGAC TCTCGAACTG CTAACCTGCG AAGGCGCTTA TCGCGACAAC 
CCGACTGCGC TTTTTCACCA GTTGTGTGGG AATCGTCCGG CAACGCTGCT GCTGGAATCC
GCAGATATCG ACAGCAAAGA TGATTTAAAA AGCCTGCTGC TGGCAGACAG TGCGCTGCGC
ATTACAGCTT TAGGTGACAC TGTCACAATC CAGGCATTTT CCGGCAACGG CGAAGCCCTG
CTGACACTAC TGGATAACGC CCTGCCTGCG GGTGTGGAAA ATGAACAATT ACCAAACTGC
CGTGTGCTGC GCTTCCCCCC TGTCAGTCCA CTGCTGGATG AAGACGCTCG CTTATGCTCC
CTTTCGGTTT TTGACGCTTT CCGTTTATTG CAGAATCTGT TGAATGTACC GAAGGAAGAA
CGAGAAGCAA TGTTCTTCGG CGGCCTGTTC TCTTATGACC TTGTGGCGGG ATTTGAAGAT
TTACCGCAAC TGTCAGCGGA AAATAACTGC CCTGATTTCT GTTTTTATCT CGCTGAAACG
CTGATGGTGA TTGACCATCA GAAAAAAAGC ACCCGTATTC AGGCCAGCCT GTTTGCTCCG
AATGAAGAAG AAAAACAACG TCTCACTGCT CGCCTGAACG AACTACGTCA GCAACTGACC
GAAGCCGCGC CGCCGCTGCC GGTGGTTTCC GTGCCGCATA TGCGTTGTGA ATGTAATCAG
AGCGATGAAG AGTTCGGTGG CGTAGTGCGT TTGTTGCAAA AAGCGATTCG CGCTGGAGAA
ATTTTCCAGG TGGTGCCATC TCGCCGTTTT TCTCTGCCCT GCCCGTCACC GCTGGCGGCC
TATTACGTGC TGAAAAAGAG TAATCCCAGC CCGTACATGT TTTTTATGCA GGATAATGAT
TTCACCCTGT TTGGCGCGTC GCCGGAAAGT TCGCTCAAGT ATGACGCCAC CAGCCGCCAG
ATTGAGATCT ACCCGATTGC CGGAACACGT CCACGCGGTC GTCGTGCCGA TGGCTCGCTG
GACAGAGACC TCGACAGCCG CATCGAACTG GAAATGCGTA CCGATCATAA AGAGCTTTCT
GAACATCTGA TGCTGGTGGA TCTTGCCCGT AATGATCTGG CACGCATTTG CACCCCCGGC
AGCCGCTACG TCGCCGACCT CACCAAAGTT GACCGTTACT CTTACGTGAT GCACCTGGTC
TCCCGCGTGG TCGGTGAGCT GCGCCACGAT CTCGACGCCC TACACGCTTA CCGCGCCTGT
ATGAATATGG GGACGTTAAG CGGTGCGCCG AAAGTGCGCG CCATGCAGTT AATTGCCGAG
GCGGAAGGTC GTCGCCGCGG CAGCTACGGC GGCGCGGTAG GTTATTTCAC CGCGCACGGC
GATCTCGACA CCTGCATTGT GATCCGCTCG GCGCTGGTGG AAAACGGTAT CGCCACCGTG
CAAGCCGGTG CTGGCGTAGT CCTTGATTCT ATTCCGCAGT CGGAAGCCGA CGAAACCCGT
AATAAAGCCC GCGCTGTACT GCGCGCTATT GCCACCGCGC ATCATGCACA GGAGACTTTC
TGA
 
Protein sequence
MQTQKPTLEL LTCEGAYRDN PTALFHQLCG NRPATLLLES ADIDSKDDLK SLLLADSALR 
ITALGDTVTI QAFSGNGEAL LTLLDNALPA GVENEQLPNC RVLRFPPVSP LLDEDARLCS
LSVFDAFRLL QNLLNVPKEE REAMFFGGLF SYDLVAGFED LPQLSAENNC PDFCFYLAET
LMVIDHQKKS TRIQASLFAP NEEEKQRLTA RLNELRQQLT EAAPPLPVVS VPHMRCECNQ
SDEEFGGVVR LLQKAIRAGE IFQVVPSRRF SLPCPSPLAA YYVLKKSNPS PYMFFMQDND
FTLFGASPES SLKYDATSRQ IEIYPIAGTR PRGRRADGSL DRDLDSRIEL EMRTDHKELS
EHLMLVDLAR NDLARICTPG SRYVADLTKV DRYSYVMHLV SRVVGELRHD LDALHAYRAC
MNMGTLSGAP KVRAMQLIAE AEGRRRGSYG GAVGYFTAHG DLDTCIVIRS ALVENGIATV
QAGAGVVLDS IPQSEADETR NKARAVLRAI ATAHHAQETF