Gene EcE24377A_1460 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_1460 
SymboltrpB 
ID5588370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp1450987 
End bp1452180 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content54% 
IMG OID640925153 
Producttryptophan synthase subunit beta 
Protein accessionYP_001462558 
Protein GI157157226 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0133] Tryptophan synthase beta chain 
TIGRFAM ID[TIGR00263] tryptophan synthase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.00292681 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAACAT TACTTAACCC CTATTTTGGT GAGTTTGGCG GCATGTACGT GCCACAAATC 
CTGATGCCTG CTCTGCGCCA GCTGGAAGAA GCTTTTGTCA GTGCGCAAAA AGATCCTGAA
TTTCAGGCTC AGTTCAACGA CCTGCTGAAA AACTATGCCG GGCGTCCAAC CGCGCTGACC
AAATGCCAGA ACATTACAGC CGGGACGAAC ACCACTCTGT ATCTCAAGCG TGAAGATTTG
CTGCACGGCG GCGCGCATAA AACTAACCAG GTGCTCGGTC AGGCTTTACT GGCGAAGCGG
ATGGGTAAAA CCGAAATCAT CGCCGAAACC GGTGCCGGTC AGCATGGCGT GGCGTCGGCC
CTTGCCAGCG CCCTGCTCGG CCTGAAATGC CGTATTTATA TGGGTGCCAA AGACGTAGAA
CGCCAGTCGC CTAACGTTTT TCGTATGCGC TTAATGGGTG CGGAAGTGAT CCCGGTGCAT
AGCGGTTCCG CGACGCTGAA AGATGCCTGT AACGAGGCGC TGCGCGACTG GTCCGGTAGT
TACGAAACCG CGCACTATAT GCTGGGCACC GCAGCTGGCC CGCATCCTTA TCCGACCATT
GTGCGTGAGT TTCAGCGGAT GATTGGCGAA GAAACCAAAG CGCAGATTCT GGAAAGAGAA
GGTCGCCTGC CGGATGCCGT TATCGCCTGT GTTGGCGGCG GTTCGAATGC CATCGGCATG
TTTGCTGATT TCATCAATGA AACCAACGTC GGCCTGATTG GTGTGGAGCC AGGTGGTCAC
GGTATCGAAA CTGGCGAGCA CGGCGCACCG TTAAAACATG GTCGCGTGGG CATCTATTTC
GGTATGAAAG CGCCGATGAT GCAAACCGAA GACGGGCAGA TTGAAGAATC TTACTCCATC
TCCGCCGGAC TGGATTTCCC GTCTGTCGGC CCACAACACG CGTATCTTAA CAGCACTGGA
CGCGCTGATT ACGTGTCTAT TACCGATGAT GAAGCCCTTG AAGCCTTCAA AACGCTGTGC
CTGCACGAAG GGATCATCCC GGCGCTGGAA TCCTCCCACG CCCTGGCCCA TGCGTTGAAA
ATGATGCGCG AAAACCCGGA TAAAGAGCAG CTACTGGTGG TTAACCTTTC CGGTCGCGGC
GATAAAGACA TCTTCACCGT TCACGATATT TTGAAAGCAC GAGGGGAAAT CTGA
 
Protein sequence
MTTLLNPYFG EFGGMYVPQI LMPALRQLEE AFVSAQKDPE FQAQFNDLLK NYAGRPTALT 
KCQNITAGTN TTLYLKREDL LHGGAHKTNQ VLGQALLAKR MGKTEIIAET GAGQHGVASA
LASALLGLKC RIYMGAKDVE RQSPNVFRMR LMGAEVIPVH SGSATLKDAC NEALRDWSGS
YETAHYMLGT AAGPHPYPTI VREFQRMIGE ETKAQILERE GRLPDAVIAC VGGGSNAIGM
FADFINETNV GLIGVEPGGH GIETGEHGAP LKHGRVGIYF GMKAPMMQTE DGQIEESYSI
SAGLDFPSVG PQHAYLNSTG RADYVSITDD EALEAFKTLC LHEGIIPALE SSHALAHALK
MMRENPDKEQ LLVVNLSGRG DKDIFTVHDI LKARGEI