Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4561 |
Symbol | tldD |
ID | 6971208 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 4227584 |
End bp | 4229029 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643388272 |
Product | protease TldD |
Protein accession | YP_002272707 |
Protein GI | 209398830 |
COG category | [R] General function prediction only |
COG ID | [COG0312] Predicted Zn-dependent proteases and their inactivated homologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.657779 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 66 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCTTA ACCTGGTAAG TGAACAATTG CTAGCGGCGA ACGGCCTGAA ACATCAGGAC TTGTTCGTGA TCCTCGGTCA ACTGGCCGAA CGTCGCCTTG ATTATGGCGA TCTCTATTTT CAGTCGAGCT ATCACGAATC CTGGGTTTTA GAAGACCGCA TTATTAAAGA TGGTTCTTAC AACATCGATC AGGGCGTTGG TGTGCGTGCA ATCAGCGGTG AAAAAACCGG ATTTGCTTAC GCTGACCAAA TCAGCCTGCT GGCGCTGGAA CAGAGTGCGC AAGCGGCGCG CACCATCGCC CGTGATAGCG GTGATGGCAA AGTACAGACG CTGGGCGCGG TAGAGCATAG CCCGTTGTAT ACCTCGGTAG ATCCGCTGCA AAGCATGAGC CGTGAAGAGA AGCTGGATAT CCTGCGTCGC GTCGATAAGG TTGCCCGCGA AGCGGACAAG CGCGTACAGG AAGTGACTGC CAGCCTCAGT GGTGTCTATG AATTAATTCT GGTTGCGGCC ACCGACGGCA CGCTAGCGGC GGATGTCCGT CCGCTGGTGC GTCTTTCCGT GAGCGTTCTG ATCGAAGAAG ATGGCAAACG CGAACGCGGT GCCAGTGGCG GCGGCGGTCG TTTTGGTTAT GAGTTCTTCC TTGCCGATCT CGACGGCGAA GTCCGTGCGG ATGCATGGGC AAAAGAAGCA GTACGTATGG CGCTGGTCAA TCTTTCTGCC GTTGCTGCAC CAGCGGGCAC CATGCCGGTA GTACTTGGCG CAGGTTGGCC GGGCGTGCTG TTGCATGAAG CGGTAGGTCA CGGTCTGGAA GGCGACTTCA ACCGCCGTGG CACTTCAGTA TTTAGTGGAC AGGTCGGGGA GCTGGTGGCT TCAGAACTGT GTACCGTGGT TGATGACGGT ACGATGGTCG ATCGCCGTGG TTCGGTGGCG ATTGATGACG AAGGTACGCC AGGCCAGTAC AACGTGCTGA TTGAGAACGG CATTCTGAAA GGCTACATGC AGGATAAACT CAACGCGCGT TTGATGGGGA TGACGCCGAC TGGCAACGGT CGCCGTGAAT CCTACGCCCA TCTGCCCATG CCGCGTATGA CCAACACCTA TATGCTGCCG GGTAAATCGA CCCCGCAGGA AATTATTGAA TCCGTTGAGT ACGGTATCTA TGCACCGAAC TTTGGTGGCG GTCAGGTGGA TATCACCTCC GGCAAATTCG TTTTCTCCAC TTCAGAAGCA TATCTGATTG AAAACGGTAA AGTAACGAAG CCGGTGAAAG GCGCAACGTT GATTGGTTCC GGTATCGAAA CCATGCAGCA GATTTCGATG GTTGGCAACG ACCTGAAACT GGATAACGGC GTGGGTGTCT GCGGTAAAGA AGGGCAAAGT TTGCCGGTTG GCGTGGGCCA GCCAACGCTG AAAGTTGATA ACCTGACTGT TGGCGGTACT GCGTAA
|
Protein sequence | MSLNLVSEQL LAANGLKHQD LFVILGQLAE RRLDYGDLYF QSSYHESWVL EDRIIKDGSY NIDQGVGVRA ISGEKTGFAY ADQISLLALE QSAQAARTIA RDSGDGKVQT LGAVEHSPLY TSVDPLQSMS REEKLDILRR VDKVAREADK RVQEVTASLS GVYELILVAA TDGTLAADVR PLVRLSVSVL IEEDGKRERG ASGGGGRFGY EFFLADLDGE VRADAWAKEA VRMALVNLSA VAAPAGTMPV VLGAGWPGVL LHEAVGHGLE GDFNRRGTSV FSGQVGELVA SELCTVVDDG TMVDRRGSVA IDDEGTPGQY NVLIENGILK GYMQDKLNAR LMGMTPTGNG RRESYAHLPM PRMTNTYMLP GKSTPQEIIE SVEYGIYAPN FGGGQVDITS GKFVFSTSEA YLIENGKVTK PVKGATLIGS GIETMQQISM VGNDLKLDNG VGVCGKEGQS LPVGVGQPTL KVDNLTVGGT A
|
| |