Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1694 |
Symbol | truB |
ID | 7084114 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 1901958 |
End bp | 1902944 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643698715 |
Product | tRNA pseudouridine synthase B |
Protein accession | YP_002355345 |
Protein GI | 217970111 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0130] Pseudouridine synthase |
TIGRFAM ID | [TIGR00431] tRNA pseudouridine 55 synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00112277 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCTCG CTCCCGCGGC TGAGGTCGCG GCTGAGCCGG CGCGGCCGAA GCGCAAGGGC CAGCCGCAAC GCAGGATCGT CCGCCGCGAA GTCGACGGCG TGCTGTTGCT CGACAAGCCG CAGGGCATGA GCTCGAACGG TGCCTTGCAG ACCGCACGGC GCTTGCTTGC GGCCGCCAAG GCCGGCCACA CCGGCACGCT GGATCCGATG GCGAGCGGAC TGCTGCCGCT CACCTTCGGC GAGGCCACCA AGTTCGCGCA GATGCTGCTC GACGCCGACA AGACCTACGA GGCGGTGGTG CGGCTCGGCG TGGACACCGA CAGCGGCGAT GCCGAGGGCA AGGTGCTCGC CACCCGTCCG GTGGCGGTGG ATCGCGCCGC GCTCGAAGCC GTGCTGGAGC GCTTCCGCGG CGAGATCGAG CAGGTGCCGC CGATGTATTC GGCGCTCAAG CGCGACGGCA AGGCGCTCTA CGAGTACGCG CGCGCGGGCA TCGAGCTCGA GCGCGCGGCG CGCCGGGTCG TGATCCATGC GCTCGAACTG CTCGACTTCG CAGGCGAGCG CTTCACGATC CGCGTGCATT GCAGCAAGGG CACCTACATC CGCAGCCTCG CGATGGACAT CGGCGCTGCG CTCGGTTGCG GTGCCCACCT GGCGGCGTTG CGGCGCACCG CGATCGGCGC CTTCGACCTG TCCGGTGCGC TCACGCTCGA AGCGCTGGAG GCGGCCGGCG AGGGCGGACG CGATGCGCTG CTCGCCCCGG TCGATGCGCT GGTGGCGGGC TTTCCGGTGC TGCAGCTCGA TGCCGAGGCG GCGCGCGGCC TGCTGCAGGG TCGCACGCTC GCGCTCGCGG GCGCGCAGCC GGGCGCGAAG GTGCGCGCCT ACGGGCCGGG GGGCTTCCTC GGCCTGGCGC AGTGGCAGGA CGACGGACGC CTGGCGGCGC GCCGGCTGAT CGCCACCGGT GGGCGGAGCG AAGATGCATC CGCATGA
|
Protein sequence | MTLAPAAEVA AEPARPKRKG QPQRRIVRRE VDGVLLLDKP QGMSSNGALQ TARRLLAAAK AGHTGTLDPM ASGLLPLTFG EATKFAQMLL DADKTYEAVV RLGVDTDSGD AEGKVLATRP VAVDRAALEA VLERFRGEIE QVPPMYSALK RDGKALYEYA RAGIELERAA RRVVIHALEL LDFAGERFTI RVHCSKGTYI RSLAMDIGAA LGCGAHLAAL RRTAIGAFDL SGALTLEALE AAGEGGRDAL LAPVDALVAG FPVLQLDAEA ARGLLQGRTL ALAGAQPGAK VRAYGPGGFL GLAQWQDDGR LAARRLIATG GRSEDASA
|
| |