Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1894 |
Symbol | trpC |
ID | 6967389 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 1786461 |
End bp | 1787819 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643385828 |
Product | bifunctional indole-3-glycerol phosphate synthase/phosphoribosylanthranilate isomerase |
Protein accession | YP_002270317 |
Protein GI | 209395881 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0134] Indole-3-glycerol phosphate synthase [COG0135] Phosphoribosylanthranilate isomerase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.890724 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.00124685 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAAACCG TTTTAGCGAA AATCGTCGCA GACAAGGCGA TTTGGGTAGA AGCCCGCAAA CAGCAGCAAC CGCTGGCCAG TTTTCAGAAT GAGGTTCAGC CGAGCACGCG ACATTTTTAT GATGCGCTAC AGGGTGCGCG CACGGCGTTT ATTCTGGAGT GCAAGAAAGC GTCGCCGTCA AAAGGCGTGA TCTGTGATGA TTTCGATCCA GCACGCATTG CCGCCATTTA TAAACATTAC GCTTCGGCAA TTTCGGTGCT GACTGATGAG AAATATTTTC AGGGGAGCTT TGATTTCCTC CCCATCGTCA GCCAAATCGC CCCGCAGCCG ATTTTATGTA AAGACTTCAT TATCGACCCT TACCAGATCT ATCTGGCGCG CTATTACCAG GCCGATGCCT GCTTATTAAT GCTTTCAGTA CTGGATGACG AACAATATCG CCAGCTTGCC GCCGTCGCCC ACAGTCTGAA GATGGGTGTG CTGACCGAAG TCAGTAATGA AGAGGAACTG GAGCGCGCCA TTGCATTGGG GGCAAAGGTC GTTGGCATCA ACAACCGCGA TCTGCGTGAT TTGTCGATTG ATCTCAACCG TACCCGCGAG CTTGCGCCGA AACTGGGGCA CAACGTGACG GTAATCAGCG AATCAGGCAT CAATACTTAC GCTCAGGTGC GCGAGTTAAG CCACTTCGCT GACGGTTTTC TAATTGGTTC GGCGTTGATG GCCCATGACG ATTTGCACGC CGCCGTGCGT CGGGTGTTGC TGGGTGAGAA TAAAGTATGT GGCCTGACGC GTGGGCAAGA TGCTAAAGCA GCGTACGACG CGGGCGCGAT TTACGGTGGG TTGATTTTTG TTGCGACATC ACCGCGTTGC GTCAACGTTG AACAGGCGCA GGAAGTGATG GCTGCGGCAC CGTTGCAGTA TGTTGGCGTG TTCCGCAATC ACGATATTGC CGATGTGCTG GACAAAGCTA AGGTGTTATC ACTGGTGGCA GTGCAACTGC ATGGTAATGA AGATCAGCTG TATATCGATA CGCTGCGTGA AGCTCTGCCA GCACACGTTG CCATCTGGAA AGCATTAAGC GTCGGTGAAA CCCTGCCCGC CCGCGAGCTT CAGCACGTTG ATAAATATGT TTTAGACAAC GGCCAGGGTG GAAGCGGGCA ACGTTTCGAC TGGTCACTAT TAAATGGTCA ATCGCTTGGC AACGTTCTGC TGGCGGGGGG CTTAGGCGCA GATAACTGCG TGGAAGCGGC ACAAACCGGC TGCGCCGGAC TTGATTTTAA TTCTGCTGTA GAGTCGCAAC CGGGCATCAA AGACGCACGT CTTTTGGCCT CGGTTTTCCA GACGCTGCGC GCATATTAA
|
Protein sequence | MQTVLAKIVA DKAIWVEARK QQQPLASFQN EVQPSTRHFY DALQGARTAF ILECKKASPS KGVICDDFDP ARIAAIYKHY ASAISVLTDE KYFQGSFDFL PIVSQIAPQP ILCKDFIIDP YQIYLARYYQ ADACLLMLSV LDDEQYRQLA AVAHSLKMGV LTEVSNEEEL ERAIALGAKV VGINNRDLRD LSIDLNRTRE LAPKLGHNVT VISESGINTY AQVRELSHFA DGFLIGSALM AHDDLHAAVR RVLLGENKVC GLTRGQDAKA AYDAGAIYGG LIFVATSPRC VNVEQAQEVM AAAPLQYVGV FRNHDIADVL DKAKVLSLVA VQLHGNEDQL YIDTLREALP AHVAIWKALS VGETLPAREL QHVDKYVLDN GQGGSGQRFD WSLLNGQSLG NVLLAGGLGA DNCVEAAQTG CAGLDFNSAV ESQPGIKDAR LLASVFQTLR AY
|
| |