Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECD_01236 |
Symbol | trpC |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21(DE3) |
Kingdom | Bacteria |
Replicon accession | CP001509 |
Strand | - |
Start bp | 1304019 |
End bp | 1305377 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | |
Product | bifunctional indole-3-glycerol phosphate synthase/phosphoribosylanthranilate isomerase |
Protein accession | ACT43126 |
Protein GI | 253977456 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAACCG TTTTAGCGAA AATCGTCGCA GACAAGGCGA TTTGGGTAGA AACCCGCAAA CAGCAGCAAC CGCTGGCCAG TTTTCAGAAT GAGGTTCAGC CGAGCACGCG ACATTTTTAT GATGCGCTAC AGGGTGCGCG CACGGCGTTT ATTCTGGAGT GCAAGAAAGC GTCGCCGTCA AAAGGCGTGA TCCGTGATGA TTTCGATCCA GCACGCATTG CCGCCATTTA TAAACATTAC GCTTCGGCAA TTTCGGTGCT GACTGATGAG AAATATTTTC AGGGGAGCTT TGATTTCCTC CCCATCGTCA GCCAAATCGC CCCGCAGCCG ATTTTATGTA AAGACTTCAT TATCGATCCT TACCAGATCT ATCTGGCGCG CTATTACCAG GCTGATGCCT GCTTATTAAT GCTTTCAGTA CTGGATGACG AACAATATCG CCAGCTTGCC GCCGTCGCCC ACAGTCTGGA GATGGGTGTG CTGACCGAAG TCAGTAATGA AGAGGAACTG GAGCGTGCCA TTGCATTGGG GGCAAAGGTC GTTGGCATCA ACAACCGCGA TCTGCGTGAT TTGTCGATTG ATCTCAACCG TACCCGCGAG CTTGCGCCGA AACTGGGGCA CAACGTGACG GTAATCAGCG AATCCGGCAT CAATACTTAC GCTCAGGTGC GCGAGTTAAG CCACTTCGCT AACGGTTTTC TGATTGGTTC GGCGTTGATG GCCCATGACG ATTTGCACGC CGCCGTGCGC CGGGTGTTGC TGGGTGAGAA TAAAGTATGT GGCCTGACGC GTGGGCAAGA TGCTAAAGCA GCTTATGACG CGGGCGCGAT TTACGGTGGG TTGATTTTTG TTGCGACATC ACCGCGTTGC GTCAACGTTG AACAGGCGCA GGAAGTGATG GCTGCGGCAC CGTTGCAGTA TGTTGGCGTG TTCCGCAATC ACGATATTGC CGATGTGGTG GACAAAGCTA AGGTGTTATC GCTGGCGGCA GTGCAACTGC ATGGTAATGA AGAACAGCTG TATATCGATA CGCTGCGTGA AGCTCTGCCA GCACATGTTG CCATCTGGAA AGCATTAAGC GTCGGTGAAA CCCTGCCCGC CCGCGAGTTT CAGCACGTTG ATAAATATGT TTTAGACAAC GGCCAGGGTG GAAGCGGGCA ACGTTTTGAC TGGTCACTAT TAAATGGTCA ATCGCTTGGC AACGTTCTGC TGGCGGGGGG CTTAGGCGCA GATAACTGCG TGGAAGCGGC ACAAACCGGC TGCGCCGGAC TTGATTTTAA TTCTGCTGTA GAGTCGCAAC CGGGCATCAA AGACGCACGT CTTTTGGCCT CGGTTTTCCA GACGCTGCGC GCATATTAA
|
Protein sequence | MQTVLAKIVA DKAIWVETRK QQQPLASFQN EVQPSTRHFY DALQGARTAF ILECKKASPS KGVIRDDFDP ARIAAIYKHY ASAISVLTDE KYFQGSFDFL PIVSQIAPQP ILCKDFIIDP YQIYLARYYQ ADACLLMLSV LDDEQYRQLA AVAHSLEMGV LTEVSNEEEL ERAIALGAKV VGINNRDLRD LSIDLNRTRE LAPKLGHNVT VISESGINTY AQVRELSHFA NGFLIGSALM AHDDLHAAVR RVLLGENKVC GLTRGQDAKA AYDAGAIYGG LIFVATSPRC VNVEQAQEVM AAAPLQYVGV FRNHDIADVV DKAKVLSLAA VQLHGNEEQL YIDTLREALP AHVAIWKALS VGETLPAREF QHVDKYVLDN GQGGSGQRFD WSLLNGQSLG NVLLAGGLGA DNCVEAAQTG CAGLDFNSAV ESQPGIKDAR LLASVFQTLR AY
|
| |