Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5139 |
Symbol | tnaA |
ID | 6972188 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 4780534 |
End bp | 4781949 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643388810 |
Product | tryptophanase |
Protein accession | YP_002273236 |
Protein GI | 209397514 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3033] Tryptophanase |
TIGRFAM ID | [TIGR02617] tryptophanase, leader peptide-associated |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.685998 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.24086 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAAACT TTAAACATCT CCCTGAACCG TTCCGCATTC GTGTTATTGA GCCAGTAAAA CGTACCACTC GCGCTTATCG TGAAGAGGCA ATTATTAAAT CCGGTATGAA CCCGTTCCTG CTGGATAGCG AAGATGTGTT TATCGATTTA CTGACCGACA GCGGCACCGG GGCGGTGACG CAGAGTATGC AGGCCGCGAT GATGCGCGGC GACGAAGCTT ACAGCGGCAG CCGCAGCTAC TATGCGTTAG CCGAGTCAGT GAAAAATATC TTTGGTTATC AATACACCAT TCCGACTCAC CAGGGCCGTG GTGCAGAGCA AATCTATATT CCAGTACTGA TTAAAAAGCG CGAGCAGGAA AAAGGCCTGG ATCGCAGCAA AATGGTGGCG TTCTCTAACT ATTTCTTTGA TACCACGCAG GGCCATAGCC AGATTAACGG CTGTACCGTG CGTAACGTCT ATATCAAAGA AGCCTTCGAT ACGGGCGTGC GTTACGACTT TAAAGGCAAC TTTGACCTTG AGGGATTAGA ACGCGGTATT GAAGAAGTTG GTCCGAATAA CGTGCCGTAT ATCGTTGCAA CCATCACCAG TAACTCCGCA GGCGGTCAGC CGGTTTCACT GGCAAACTTA AAAGCGATGT ACAGCATCGC GAAGAAATAT GATATTCCGG TGGTAATGGA CTCCGCGCGC TTTGCTGAAA ACGCCTATTT CATCAAGCAG CGTGAAGCAG AATACAAAGA CTGGACCATC GAGCAGATCA CCCGCGAAAC CTACAAATAT GCCGATATGC TGGCGATGTC CGCCAAGAAA GATGCGATGG TGCCGATGGG CGGCTTGCTG TGCATGAAAG ACGACAGTTT CTTTGATGTG TACACCGAGT GCAGAACCCT TTGCGTAGTT CAGGAAGGCT TCCCGACGTA TGGCGGCCTG GAAGGCGGCG CGATGGAGCG TCTGGCCGTA GGTCTGTATG ACGGCATGAA TCTGGACTGG CTGGCTTATC GTATTGCGCA GGTGCAGTAT CTGGTCGATG GTCTGGAAGA GATTGGCGTT GTCTGCCAGC AGGCGGGCGG TCACGCGGCA TTCGTTGATG CCGGTAAACT GCTTCCGCAT ATCCCGGCAG ATCAGTTCCC GGCACAGGCG CTGGCCTGCG AGCTGTATAA AGTCGCCGGT ATCCGCGCGG TAGAAATTGG CTCTTTCCTG TTAGGCCGCG ATCCGAAAAC CGGTAAACAA CTGCCATGCC CGGCTGAACT GCTGCGTTTA ACCATTCCGC GCGCAACATA TACTCAAACA CATATGGACT TCATTATTGA AGCCTTCAAA CATGTGAAAG AGAACGCGTC GAATATTAAA GGGTTAACCT TTACCTACGA ACCAAAAGTA TTGCGTCACT TCACCGCAAA ACTGAAAGAA GTTTAA
|
Protein sequence | MENFKHLPEP FRIRVIEPVK RTTRAYREEA IIKSGMNPFL LDSEDVFIDL LTDSGTGAVT QSMQAAMMRG DEAYSGSRSY YALAESVKNI FGYQYTIPTH QGRGAEQIYI PVLIKKREQE KGLDRSKMVA FSNYFFDTTQ GHSQINGCTV RNVYIKEAFD TGVRYDFKGN FDLEGLERGI EEVGPNNVPY IVATITSNSA GGQPVSLANL KAMYSIAKKY DIPVVMDSAR FAENAYFIKQ REAEYKDWTI EQITRETYKY ADMLAMSAKK DAMVPMGGLL CMKDDSFFDV YTECRTLCVV QEGFPTYGGL EGGAMERLAV GLYDGMNLDW LAYRIAQVQY LVDGLEEIGV VCQQAGGHAA FVDAGKLLPH IPADQFPAQA LACELYKVAG IRAVEIGSFL LGRDPKTGKQ LPCPAELLRL TIPRATYTQT HMDFIIEAFK HVKENASNIK GLTFTYEPKV LRHFTAKLKE V
|
| |