Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A3922 |
Symbol | tnaA |
ID | 5592656 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 3915901 |
End bp | 3917316 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640923030 |
Product | tryptophanase |
Protein accession | YP_001460507 |
Protein GI | 157163189 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3033] Tryptophanase |
TIGRFAM ID | [TIGR02617] tryptophanase, leader peptide-associated |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 45 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAACT TTAAACATCT CCCTGAACCG TTCCGCATTC GTGTTATTGA GCCAGTAAAA CGTACCACTC GCGCTTATCG TGAAGAGGCA ATTATTAAAT CCGGTATGAA CCCGTTCCTG CTGGATAGCG AAGATGTGTT TATCGATTTA CTGACCGACA GCGGCACCGG GGCGGTGACG CAGAGTATGC AGGCCGCGAT GATGCGCGGC GACGAAGCTT ACAGCGGCAG CCGCAGCTAC TATGCGTTAG CCGAGTCAGT AAAAAATATC TTTGGTTATC AATACACTAT TCCAACTCAC CAGGGCCGTG GTGCAGAACA AATCTATATT CCGGTACTGA TTAAAAAACG CGAGCAGGAA AAAGGCCTGG ATCGCAGCAA AATGGTGGCG TTCTCTAACT ATTTCTTTGA TACCACGCAG GGCCATAGCC AGATTAACGG CTGTACCGTG CGTAACGTCT ATATCAAAGA AGCCTTCGAT ACTGGCGTGC GTTACGACTT TAAAGGTAAC TTTGACCTCG AAGGATTAGA ACGCGGTATT GAAGAAGTTG GCCCGAATAA CGTGCCGTAT ATCGTTGCAA CCATCACCAG TAACTCTGCA GGTGGTCAGC CGGTTTCACT GGCAAACTTA AAAGCGATGT ACAGCATCGC GAAGAAATAC GATATTCCGG TGGTAATGGA CTCCGCACGC TTTGCTGAAA ACGCCTATTT CATCAAGCAG CGTGAAGCAG AATACAAAGA CTGGACCATC GAGCAGATCA CCCGCGAAAC CTACAAATAT GCCGATATGC TGGCGATGTC CGCCAAGAAA GATGCGATGG TGCCGATGGG CGGCTTGCTG TGCATGAAAG ACGACAGCTT CTTTGATGTG TACACCGAGT GCAGAACCCT TTGCGTGGTG CAGGAAGGCT TCCCGACATA TGGCGGCCTG GAAGGCGGCG CGATGGAGCG TCTGGCGGTA GGTCTGTATG ACGGCATGAA TCTCGACTGG CTGGCTTATC GTATCGCGCA GGTACAGTAT CTGGTCGATG GTCTGGAAGA GATTGACGTT GTCTGCCAGC AGGCGGGCGG TCACGCGGCA TTCGTTGATG CCGGTAAACT GCTGCCGCAT ATCCCGGCAG ACCAGTTCCC GGCACAGGCG CTGGCGTGCG AGCTGTATAA AGTCGCCGGT ATCCGTGCGG TAGAAATTGG CTCTTTCCTG TTAGGCCGCG ATCCGAAAAC CGGTAAACAA CTGCCATGCC CGGCTGAACT GCTGCGTTTA ACCATTCCGC GCGCAACATA TACTCAAACA CATATGGACT TCATTATTGA AGCCTTTAAA CATGTGAAAG AGAACGCGGC GAATATTAAA GGATTAACCT TTACCTACGA ACCAAAAGTA TTGCGTCACT TCACCGCAAA ACTGAAAGAA GTTTAA
|
Protein sequence | MENFKHLPEP FRIRVIEPVK RTTRAYREEA IIKSGMNPFL LDSEDVFIDL LTDSGTGAVT QSMQAAMMRG DEAYSGSRSY YALAESVKNI FGYQYTIPTH QGRGAEQIYI PVLIKKREQE KGLDRSKMVA FSNYFFDTTQ GHSQINGCTV RNVYIKEAFD TGVRYDFKGN FDLEGLERGI EEVGPNNVPY IVATITSNSA GGQPVSLANL KAMYSIAKKY DIPVVMDSAR FAENAYFIKQ REAEYKDWTI EQITRETYKY ADMLAMSAKK DAMVPMGGLL CMKDDSFFDV YTECRTLCVV QEGFPTYGGL EGGAMERLAV GLYDGMNLDW LAYRIAQVQY LVDGLEEIDV VCQQAGGHAA FVDAGKLLPH IPADQFPAQA LACELYKVAG IRAVEIGSFL LGRDPKTGKQ LPCPAELLRL TIPRATYTQT HMDFIIEAFK HVKENAANIK GLTFTYEPKV LRHFTAKLKE V
|
| |