Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4075 |
Symbol | tnaA |
ID | 6145087 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4165397 |
End bp | 4166812 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641618900 |
Product | tryptophanase |
Protein accession | YP_001746038 |
Protein GI | 170681687 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3033] Tryptophanase |
TIGRFAM ID | [TIGR02617] tryptophanase, leader peptide-associated |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.420944 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAAACT TTAAACATCT CCCTGAACCA TTCCGCATTC GTGTTATTGA GCCAGTAAAA CGTACGACTC GCGCTTACCG TGAAGAGGCA ATTATTAAAT CCGGTATGAA CCCGTTCCTG CTGGATAGCG AAGATGTGTT TATCGATTTA CTGACCGACA GCGGCACCGG GGCGGTGACG CAGAGCATGC AGGCCGCGAT GATGCGCGGC GACGAAGCCT ACAGCGGCAG CCGCAGCTAC TATGCGTTAG CCGAGTCAGT GAAAAATATC TTTGGTTATC AATACACCAT TCCGACTCAC CAGGGCCGTG GCGCAGAACA AATCTATATT CCGGTACTGA TTAAAAAACG CGAGCAGGAA AAAGGCCTGG ATCGCAGCAA AATGGTGGCG TTCTCTAACT ATTTCTTTGA TACCACGCAG GGCCATAGCC AGATTAACGG CTGTACCGTG CGTAACGTCT ATATCAAAGA AGCCTTCGAT ACTGGCGTGC GTTACGACTT TAAAGGCAAC TTTGACCTTG AGGGATTAGA ACGCGGTATT GAAGAAGTTG GCCCGAATAA CGTGCCGTAT ATCGTTGCAA CCATCACCAG TAACTCCGCA GGTGGTCAGC CGGTTTCACT GGCAAACTTA AAAGCGATGT ACAGCATCGC GAAGAAATAC GATATTCCGG TGGTCATGGA CTCCGCACGC TTTGCCGAAA ACGCCTATTT CATCAAGCAG CGTGAAGCAG AATACAAAGA CTGGACCATC GAGCAGATCA CCCGCGAAAC CTACAAATAT GCCGATATGC TGGCGATGTC CGCCAAGAAA GATGCAATGG TGCCGATGGG CGGCCTGCTG TGCGTGAAAG ACGACAGCTT CTTTGATGTG TACACCGAGT GCAGAACCCT TTGCGTGGTA CAGGAAGGCT TCCCGACATA TGGCGGCCTG GAAGGCGGCG CGATGGAGCG TCTGGCGGTA GGTCTGTATG ACGGCATGAA TCTGGACTGG CTGGCTTATC GTATCGCGCA GGTGCAGTAT CTGGTCGATG GTCTGGAAGA GATTGGCGTT GTCTGCCAGC AGGCGGGCGG TCACGCTGCA TTCGTTGATG CCGGTAAACT GCTGCCGCAT ATCCCGGCAG ATCAGTTCCC GGCACAGGCG CTGGCCTGCG AGCTGTATAA AGTCGCCGGT ATCCGTGCGG TAGAAATTGG CTCTTTCCTG TTAGGCCGCG ATCCGAAAAC CGGTAAACAA CTGCCATGCC CGGCTGAACT GCTGCGTTTA ACCATTCCGC GCGCAACATA TACTCAAACA CATATGGACT TCATTATTGA AGCCTTCAAA CATGTGAAAG AGAACGCGGC GAATATTAAA GGGTTAACCT TTACCTACGA ACCGAAAGTA TTGCGTCACT TCACCGCAAA ACTGAAAGAA GTTTAA
|
Protein sequence | MENFKHLPEP FRIRVIEPVK RTTRAYREEA IIKSGMNPFL LDSEDVFIDL LTDSGTGAVT QSMQAAMMRG DEAYSGSRSY YALAESVKNI FGYQYTIPTH QGRGAEQIYI PVLIKKREQE KGLDRSKMVA FSNYFFDTTQ GHSQINGCTV RNVYIKEAFD TGVRYDFKGN FDLEGLERGI EEVGPNNVPY IVATITSNSA GGQPVSLANL KAMYSIAKKY DIPVVMDSAR FAENAYFIKQ REAEYKDWTI EQITRETYKY ADMLAMSAKK DAMVPMGGLL CVKDDSFFDV YTECRTLCVV QEGFPTYGGL EGGAMERLAV GLYDGMNLDW LAYRIAQVQY LVDGLEEIGV VCQQAGGHAA FVDAGKLLPH IPADQFPAQA LACELYKVAG IRAVEIGSFL LGRDPKTGKQ LPCPAELLRL TIPRATYTQT HMDFIIEAFK HVKENAANIK GLTFTYEPKV LRHFTAKLKE V
|
| |