Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2870 |
Symbol | truD |
ID | 6145553 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2943453 |
End bp | 2944502 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641617739 |
Product | tRNA pseudouridine synthase D |
Protein accession | YP_001744894 |
Protein GI | 170681051 |
COG category | [S] Function unknown |
COG ID | [COG0585] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR00094] tRNA pseudouridine synthase, TruD family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTGAGT TCGATAATCT CACTTACCTC CACGGCAAAC CGCAAGGCAC CGGGCTGCTG AAAGCGAATC CGGAAGACTT TGTGGTGGTG GAAGATTTGG GCTTTGAGCC TGATGGTGAA GGTGAGCATA TTCTGGTTAG AATCCTCAAA AACGGCTGCA ATACCCGTTT TGTGGCGGAT GCACTGGCGA AATTCCTGAA AATTCATGCC CGTGAAGTCA GCTTCGCTGG GCAAAAAGAT AAACATGCTG TTACGGAACA GTGGTTATGC GCTCGCGTGC CGGGCAAAGA GATGCCCGAT CTGAGCGCCT TTCAACTGGA AGGCTGCCAG GTGCTGGAGT ATGCGCGGCA CAAGCGCAAG CTGCGTTTAG GGGCGCTGAA AGGTAACGCC TTTACCCTGG TACTACGCGA AGTGAGCAAT CGCGATGACG TTGAACAACG TCTGATCGAT ATTTGCGTAA AAGGTGTACC GAACTACTTC GGTGCCCAAC GTTTTGGGAT TGGCGGTAGC AACTTGCAGG GCGCGCTGCG CTGGGCGCAA ACCAATACTC CGGTGCGCGA TCGCAATAAA CGGAGTTTTT GGTTGTCGGC AGCCCGCAGT GCGTTGTTTA ATCAGATTGT TGCTGAGCGC CTCAAAAAAG CAGACGTTAA TCAAGTTGTT GACGGCGATG CGCTACAATT AGCCGGACGT GGTAGCTGGT TTGTTGCAAC CACCGAAGAA GTGGCGGAAT TACAGCGTCG CGTCAACGAT AAAGAGCTAA TGATAACCGC CGCATTGCCG GGCAGTGGCG AATGGGGAAC TCAGCGTGAA GCGCTGGCAT TCGAACAAGC AGCTGTCGCC GCAGAAACTG AATTACAAGC TTTACTGGTG CGCGAAAAAG TTGAAGCCGC GCGCAGAGCG ATGCTGCTGT ATCCGCAACA ATTAAGCTGG AACTGGTGGG ATGACGTCAC CGTAGAAATC CGTTTCTGGC TTCCGGCGGG TAGTTTTGCA ACCAGCGTTG TCAGGGAACT TATCAACACA ACAGGTGATT ATGCGCATAT TGCTGAGTAA
|
Protein sequence | MIEFDNLTYL HGKPQGTGLL KANPEDFVVV EDLGFEPDGE GEHILVRILK NGCNTRFVAD ALAKFLKIHA REVSFAGQKD KHAVTEQWLC ARVPGKEMPD LSAFQLEGCQ VLEYARHKRK LRLGALKGNA FTLVLREVSN RDDVEQRLID ICVKGVPNYF GAQRFGIGGS NLQGALRWAQ TNTPVRDRNK RSFWLSAARS ALFNQIVAER LKKADVNQVV DGDALQLAGR GSWFVATTEE VAELQRRVND KELMITAALP GSGEWGTQRE ALAFEQAAVA AETELQALLV REKVEAARRA MLLYPQQLSW NWWDDVTVEI RFWLPAGSFA TSVVRELINT TGDYAHIAE
|
| |