Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1871 |
Symbol | trpB |
ID | 6143055 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1895997 |
End bp | 1897190 |
Gene Length | 1194 bp |
Protein Length | 397 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641616747 |
Product | tryptophan synthase subunit beta |
Protein accession | YP_001743925 |
Protein GI | 170683411 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0133] Tryptophan synthase beta chain |
TIGRFAM ID | [TIGR00263] tryptophan synthase, beta subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.000000282354 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACAACAT TACTTAACCC GTATTTTGGT GAGTTTGGCG GCATGTACGT GCCACAAATC CTGATGCCTG CTCTGCGCCA GCTGGAAGAA GCTTTTGTCA GTGCGCAAAA AGATCCTGAA TTTCAGGCTC AGTTCAACGA CCTGCTGAAA AACTATGCCG GGCGTCCAAC CGCGCTGACC AAATGCCAGA ACATTACAGC CGGGACGAAC ACCACGCTGT ATCTGAAGCG CGAAGATTTG CTGCACGGCG GCGCGCATAA AACTAACCAG GTGCTGGGAC AGGCGTTGCT GGCGAAGCGG ATGGGTAAAA CGGAAATCAT CGCCGAAACC GGAGCCGGTC AGCATGGCGT GGCGTCGGCC CTTGCCAGCG CCCTGCTCGG CCTGAAATGC CGTATTTATA TGGGTGCCAA AGACGTTGAA CGCCAGTCGC CAAACGTTTT TCGTATGCGC TTAATGGGTG CGGAAGTGAT CCCGGTACAT AGCGGTTCCG CGACGCTGAA AGATGCCTGT AACGAGGCGC TGCGCGACTG GTCCGGCAGT TATGAAACCG CGCACTATAT GCTGGGCACC GCAGCTGGCC CACATCCTTA TCCGACCATT GTCCGTGAGT TTCAGCGAAT GATTGGCGAA GAAACCAAAG CACAGATTCT GGAAAGAGAA GGTCGCCTGC CGGATGCGGT TATCGCCTGT GTTGGCGGCG GTTCGAATGC CATCGGCATG TTTGCAGATT TCATCAACGA AACCGACGTC GGCCTGATTG GTGTGGAGCC TGGCGGCCAC GGTATCGAAA CTGGCGAGCA CGGCGCACCG TTAAAACATG GTCGCGTGGG CATCTATTTC GGTATGAAAG CGCCGATGAT GCAAACCGAA GACGGGCAAA TTGAAGAGTC TTACTCCATT TCTGCCGGGC TGGATTTCCC GTCCGTCGGC CCGCAACATG CGTATCTCAA CAGCACTGGA CGCGCTGATT ACGTGTCTAT TACCGACGAT GAAGCCCTGG AAGCCTTTAA AACGCTTTGC CTGCATGAAG GGATCATCCC GGCGCTGGAA TCCTCCCACG CCCTGGCCCA TGCGCTGAAA ATGATGCGCG AAAATCCGGA AAAAGAGCAG CTATTGGTGG TTAACCTTTC CGGTCGCGGC GATAAAGACA TCTTCACCGT TCACGATATT TTGAAAGCAC GAGGGGAAAT CTGA
|
Protein sequence | MTTLLNPYFG EFGGMYVPQI LMPALRQLEE AFVSAQKDPE FQAQFNDLLK NYAGRPTALT KCQNITAGTN TTLYLKREDL LHGGAHKTNQ VLGQALLAKR MGKTEIIAET GAGQHGVASA LASALLGLKC RIYMGAKDVE RQSPNVFRMR LMGAEVIPVH SGSATLKDAC NEALRDWSGS YETAHYMLGT AAGPHPYPTI VREFQRMIGE ETKAQILERE GRLPDAVIAC VGGGSNAIGM FADFINETDV GLIGVEPGGH GIETGEHGAP LKHGRVGIYF GMKAPMMQTE DGQIEESYSI SAGLDFPSVG PQHAYLNSTG RADYVSITDD EALEAFKTLC LHEGIIPALE SSHALAHALK MMRENPEKEQ LLVVNLSGRG DKDIFTVHDI LKARGEI
|
| |