Gene ECH74115_1893 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1893 
SymboltrpB 
ID6971209 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1785256 
End bp1786449 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content55% 
IMG OID643385827 
Producttryptophan synthase subunit beta 
Protein accessionYP_002270316 
Protein GI209396478 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0133] Tryptophan synthase beta chain 
TIGRFAM ID[TIGR00263] tryptophan synthase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.964383 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.00187983 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACAACAT TACTTAACCC CTATTTTGGT GAGTTTGGCG GCATGTACGT GCCACAAATC 
CTGATGCCTG CTCTGCGCCA GCTGGAAGAA GCTTTTGTCA GCGCGCAAAA AGATCCTGAA
TTTCAGGCTC AGTTCAACGA CCTGCTGAAA AACTATGCCG GGCGTCCAAC CGCGCTGACC
AAATGCCAGA ACATTACGGC TGGCACGAAC ACCACGCTGT ATCTGAAGCG CGAAGATTTG
CTGCACGGCG GCGCGCATAA AACTAACCAG GTGCTCGGTC AGGCTTTACT GGCGAAGCGG
ATGGGTAAAA CCGAAATCAT CGCCGAAACC GGGGCCGGTC AGCATGGCGT GGCGTCGGCC
CTTGCCAGCG CCCTGCTCGG CCTGAAATGC CGTATTTATA TGGGGGCCAA AGACGTTGAA
CGCCAGTCGC CTAACGTTTT TCGTATGCGT TTAATGGGTG CGGAAGTGAT CCCGGTGCAT
AGCGGTTCCG CGACGCTGAA AGATGCCTGT AACGAGGCGC TGCGCGACTG GTCCGGCAGT
TATGAAACCG CGCACTATAT GCTGGGCACC GCAGCTGGCC CACATCCTTA TCCGACCATT
GTCCGTGAGT TTCAGCGGAT GATTGGCGAA GAAACCAAAG CACAGATTCT GGAAAGAGAA
GGTCGCCTGC CGGATGCCGT TATCGCCTGT GTTGGCGGCG GTTCGAATGC CATCGGCATG
TTTGCAGATT TCATCAACGA AACCAACGTC GGCCTGATTG GTGTGGAGCC TGGTGGTCAC
GGTATCGAAA CTGGCGAGCA CGGCGCACCG CTAAAACATG GTCGCGTGGG TATCTATTTC
GGTATGAAAG CGCCGATGAT GCAAACCGAA GACGGGCAGA TTGAAGAATC TTACTCCATC
TCCGCCGGAC TCGATTTTCC GTCCGTCGGC CCACAACACG CGTATCTTAA CAGCACTGGA
CGCGCTGATT ATGTGTCAAT TACCGATGAT GAAGCACTGG AAGCCTTCAA AACGCTGTGC
CTGCACGAAG GGATTATCCC GGCGCTGGAA TCCTCCCACG CCCTGGCCCA TGCGCTGAAA
ATGATGCGCG AAAACCCGGA AAAAGAGCAG CTACTGGTGG TTAACCTTTC CGGTCGCGGC
GATAAAGACA TCTTCACCGT TCACGATATT TTGAAAGCAC GAGGGGAAAT CTGA
 
Protein sequence
MTTLLNPYFG EFGGMYVPQI LMPALRQLEE AFVSAQKDPE FQAQFNDLLK NYAGRPTALT 
KCQNITAGTN TTLYLKREDL LHGGAHKTNQ VLGQALLAKR MGKTEIIAET GAGQHGVASA
LASALLGLKC RIYMGAKDVE RQSPNVFRMR LMGAEVIPVH SGSATLKDAC NEALRDWSGS
YETAHYMLGT AAGPHPYPTI VREFQRMIGE ETKAQILERE GRLPDAVIAC VGGGSNAIGM
FADFINETNV GLIGVEPGGH GIETGEHGAP LKHGRVGIYF GMKAPMMQTE DGQIEESYSI
SAGLDFPSVG PQHAYLNSTG RADYVSITDD EALEAFKTLC LHEGIIPALE SSHALAHALK
MMRENPEKEQ LLVVNLSGRG DKDIFTVHDI LKARGEI