Gene B21_01248 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_01248 
SymboltrpE 
ID8112855 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp1306382 
End bp1307944 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content54% 
IMG OID644847499 
Producthypothetical protein 
Protein accessionYP_002999072 
Protein GI251784768 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00565] anthranilate synthase component I, proteobacterial subset 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAACAC AAAAACCGAC TCTCGAACAG CTAACCTGCG AAGGCGCTTA TCGCGACAAT 
CCCACCGCGC TTTTTCACCA GTTGTGTGGG GATCGTCCGG CAACGCTGCT GCTGGAATCC
GCAGATATCG ACAGCAAAGA TGATTTAAAA AGCCTGCTGC TGGTAGACAG TGCGCTGCGC
ATTACAGCTT TAGGTGACAC TGTCACAATC CAGGCACTTT CCGGCAACGG CGAAGCCCTG
CTGGCACTAC TGGATAACGC CCTGCCTGCG GGTGTGGAAA GTGAACAATC ACCAAACTGC
CGTGTGCTGC GCTTCCCCCC TGTCAGTCCA CTGCTGGATG AAGACGCCCG CTTATGCTCC
CTTTCGGTTT TTGACGCTTT CCGTTTATTG CAGAATCTGT TGAATGTACC GAAGGAAGAA
CGAGAAGCCA TGTTCTTCGG CGGCCTGTTC TCTTATGACC TTGTGGCGGG ATTTGAAGAT
TTACCGCAAC TGTCAGCGGA AAATAACTGC CCTGATTTCT GTTTTTATCT CGCTGAAACG
CTGATGGTGA TTGACCATCA GAAAAAAAGC ACCCGTATTC AGGCCAGCCT GTTTGCTCCG
AATGAAGAAG AAAAACAACG TCTCACTGCT CGCCTGAACG AACTACGTCA GCAACTGACC
GAAGCCGCGC CGCCGCTGCC AGTGGTTTCC GTGCCGCATA TGCGTTGTGA ATGTAATCAG
AGCGATGAAG AGTTCGGTGG CGTAGTGCGT TTGTTGCAAA AAGCGATTCG CGCTGGAGAA
ATTTTCCAGG TGGTGCCATC TCGCCGTTTC TCTCTGCCCT GCCCGTCACC GCTGGCGGCC
TATTACGTGC TGAAAAAGAG TAATCCCAGC CCGTACATGT TTTTTATGCA GGATAATGAT
TTCACCCTAT TTGGCGCGTC GCCGGAAAGC TCGCTCAAGT ATGATGCCAC CAGCCGCCAG
ATTGAGATCT ACCCGATTGC CGGAACACGC CCACGCGGTC GTCGCGCCGA TGGTTCACTG
GACAGAGATC TCGACAGCCG TATTGAACTG GAAATGCGTA CCGATCATAA AGAGCTGTCT
GAACATCTGA TGCTGGTTGA TCTCGCCCGT AATGATCTGG CACGCATTTG CACCCCCGGC
AGCCGCTACG TCGCCGATCT CACCAAAGTT GACCGTTATT CCTATGTGAT GCACCTCGTC
TCTCGCGTAG TCGGCGAACT GCGTCACGAT CTTGACGCCC TGCACGCTTA TCGCGCCTGT
ATGAATATGG GGACGTTAAG CGGTGCGCCG AAAGTACGCG CTATGCAGTT AATTGCCGAG
GCGGAAGGTC GTCGCCGCGG CAGCTACGGC GGCGCGGTAG GTTATTTCAC CGCGCATGGC
GATCTCGACA CCTGCATTGT GATCCGCTCG GCGCTGGTGG AAAACGGTAT CGCCACCGTG
CAAGCGGGTG CTGGTGTAGT CCTTGATTCT GTTCCGCAGT CGGAAGCCGA CGAAACCCGT
AACAAAGCCC GCGCTGTACT GCGCGCTATT GCCACCGCGC ATCATGCACA GGAGACTTTC
TGA
 
Protein sequence
MQTQKPTLEQ LTCEGAYRDN PTALFHQLCG DRPATLLLES ADIDSKDDLK SLLLVDSALR 
ITALGDTVTI QALSGNGEAL LALLDNALPA GVESEQSPNC RVLRFPPVSP LLDEDARLCS
LSVFDAFRLL QNLLNVPKEE REAMFFGGLF SYDLVAGFED LPQLSAENNC PDFCFYLAET
LMVIDHQKKS TRIQASLFAP NEEEKQRLTA RLNELRQQLT EAAPPLPVVS VPHMRCECNQ
SDEEFGGVVR LLQKAIRAGE IFQVVPSRRF SLPCPSPLAA YYVLKKSNPS PYMFFMQDND
FTLFGASPES SLKYDATSRQ IEIYPIAGTR PRGRRADGSL DRDLDSRIEL EMRTDHKELS
EHLMLVDLAR NDLARICTPG SRYVADLTKV DRYSYVMHLV SRVVGELRHD LDALHAYRAC
MNMGTLSGAP KVRAMQLIAE AEGRRRGSYG GAVGYFTAHG DLDTCIVIRS ALVENGIATV
QAGAGVVLDS VPQSEADETR NKARAVLRAI ATAHHAQETF