Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1371 |
Symbol | trpC |
ID | 5592556 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 1365484 |
End bp | 1366845 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640920526 |
Product | bifunctional indole-3-glycerol phosphate synthase/phosphoribosylanthranilate isomerase |
Protein accession | YP_001458085 |
Protein GI | 157160767 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0134] Indole-3-glycerol phosphate synthase [COG0135] Phosphoribosylanthranilate isomerase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 50 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGCAAA CCGTTTTAGC GAAAATCGTC GCAGACAAGG CGATTTGGGT AGAAGCCCGC AAACAGCAGC AACCGCTGGC CAGTTTTCAG AATGAGGTTC AGCCGAGCAC GCGACATTTT TATGATGCGC TACAGGGTGC GCGCACGGCG TTTATTCTGG AGTGCAAGAA AGCGTCGCCG TCAAAAGGCG TGATCCGTGA TGATTTCGAT CCAGCACGCA TTGCCGCCAT TTATAAACAT TACGCTTCGG CAATTTCGGT GCTGACTGAT GAGAAATATT TTCAGGGGAG CTTTGATTTC CTCCCCATCG TCAGCCAAAT CGCCCCGCAG CCGATTTTAT GTAAAGACTT CATTATCGAC CCTTACCAGA TCTATCTGGC GCGCTATTAC CAGGCCGATG CCTGCTTATT AATGCTTTCA GTACTGGATG ACGACCAATA TCGCCAGCTT GCCGCCGTCG CTCACAGTCT GGAGATGGGG GTGCTGACCG AAGTCAGTAA TGAAGAGGAA CAGGAGCGCG CCATTGCATT GGGAGCAAAG GTCGTTGGCA TCAACAACCG CGATCTGCGT GATTTGTCGA TTGATCTCAA CCGTACCCGC GAGCTTGCGC CGAAACTGGG GCACAACGTG ACGGTAATCA GCGAATCCGG CATCAATACT TACGCTCAGG TGCGCGAGTT AAGCCACTTC GCTAACGGTT TTCTGATTGG TTCGGCGTTG ATGGCCCATG ACGATTTGCA CGCCGCCGTG CGCCGGGTGT TGCTGGGTGA GAATAAAGTA TGCGGCCTGA CGCGTGGGCA AGATGCTAAA GCAGCTTATG ACGCGGGCGC GATTTACGGT GGGTTGATTT TTGTTGCGAC ATCACCGCGT TGCGTCAACG TTGAACAGGC GCAGGAAGTG ATGGCTGCGG CACCGTTGCA GTATGTTGGC GTGTTCCGCA ATCACGATAT TGCCGATGTG GTGGACAAAG CTAAGGTGTT ATCGCTGGCG GCAGTGCAAC TGCATGGTAA TGAAGAACAG CTGTATATCG ATACGCTGCG TGAAGCTCTG CCAGCACATG TTGCCATCTG GAAAGCATTA AGCGTCGGTG AAACCCTGCC CGCCCGCGAG TTTCAGCACG TTGATAAATA TGTTTTAGAC AACGGCCAGG GTGGAAGCGG GCAACGTTTT GACTGGTCAC TATTAAATGG TCAATCGCTT GGCAACGTTC TGCTGGCGGG GGGCTTAGGC GCAGATAACT GCGTGGAAGC GGCACAAACC GGCTGCGCCG GACTTGATTT TAATTCTGCT GTAGAGTCGC AACCGGGCAT CAAAGACGCA CGTCTTTTGG CCTCGGTTTT CCAGACGCTG CGCGCATATT AA
|
Protein sequence | MMQTVLAKIV ADKAIWVEAR KQQQPLASFQ NEVQPSTRHF YDALQGARTA FILECKKASP SKGVIRDDFD PARIAAIYKH YASAISVLTD EKYFQGSFDF LPIVSQIAPQ PILCKDFIID PYQIYLARYY QADACLLMLS VLDDDQYRQL AAVAHSLEMG VLTEVSNEEE QERAIALGAK VVGINNRDLR DLSIDLNRTR ELAPKLGHNV TVISESGINT YAQVRELSHF ANGFLIGSAL MAHDDLHAAV RRVLLGENKV CGLTRGQDAK AAYDAGAIYG GLIFVATSPR CVNVEQAQEV MAAAPLQYVG VFRNHDIADV VDKAKVLSLA AVQLHGNEEQ LYIDTLREAL PAHVAIWKAL SVGETLPARE FQHVDKYVLD NGQGGSGQRF DWSLLNGQSL GNVLLAGGLG ADNCVEAAQT GCAGLDFNSA VESQPGIKDA RLLASVFQTL RAY
|
| |