Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1689 |
Symbol | |
ID | 7084109 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 1895285 |
End bp | 1896424 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643698710 |
Product | pseudouridine synthase |
Protein accession | YP_002355340 |
Protein GI | 217970106 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG1187] 16S rRNA uridine-516 pseudouridylate synthase and related pseudouridylate synthases |
TIGRFAM ID | [TIGR00093] pseudouridine synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0924391 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTCGCA AACCGAACCG GCCGCTGCGC CCGCCCCTGA AGAAGAAGTC TGACGCGATC GAGATCCTCG ACCGCGACCC GCGCGAATCC CGTGGCGCGC GCGACGACGA CGCCCAGCCC GGCCGCAAGG ACGCGCACCT CAACGCCGAT CCGCGCGGCC CGCGCACGCC GCGTCCGCCG CGCGCCGGTG GCGCGCCTGC GCCGCGCCGG CCGCGCATCG AGGGCGATCC CGTCCGCCAT CCCGACGCTC ACCGTTCCGC GCTCGGCCAG CCCGAGGGCG GCGAGCGCCC CGTGCGTGGC CGTGGGCCGG GCGGCGGGCG CAACGCGCCC ACCACGCTCA GCGAGCCCGA GCGCCTGCAG AAGGTGCTCG CCCAGGCGGG CGTCGCCTCG CGGCGCGAGA TCGAGGAATG GGTGGTGGCC GGGCGCATCT CGGTCAATGG CCTGCCGGCC TCGCTCGGCC AGAAGATCGG CCCGGGCGAC CGCGTCAAGG TCAATGGCAA GCTCGTGCCG CTGCGCTTCA CCCAGCGCTC GCCGCGCGTG CTGATCTACC ACAAGCCCGA AGGCGAGATC GTCTCGCGCG ACGACCCCGA GGGCCGTCCC ACCGTATTCG AGCGCCTGCC CATCCTGCGC AAGGGGCGCT GGCTGGCGGT CGGGCGCCTG GACTTCAACA CCTCGGGGCT GCTGTTGTTC ACCAACGACG GCGACCTCGC CAACAAGCTG ATGCACCCGC GCTACGAACT CGAGCGCGAG TACGCGGTGC GCATCCTCGG CGAGCTCACC GAAGAGCAGG TGAAGTCGCT TACCGACGGC ATCCAGCTCG AGGACGGCCC GGCCAAGTTC AACCTGCTGC GCGACGAGGG CGGCGAGGGT GCCAACCACT GGTACCGGGT GACGATCTCG GAGGGCCGCA ACCGCGAAGT GCGGCGCATG TTCGAAGCCG TGGGCCTGAC CGTGAGCCGG CTGATGCGGG TGCGCTACGG CAGCGTCGAG CTGCCCGCGC GCCTCAAGCG CGGCATGTGG ATGGAGATGC CCGAGGCCGA GGCCTGCGCG CTCGCCGGCC TGCCGCCGCC GCAGCAGAGC CGTCAGCAGG ATCTGCGCGA CAAGCGGCCG GTGAAGCTGC ACCGCACCCA GGCGCGTTGA
|
Protein sequence | MSRKPNRPLR PPLKKKSDAI EILDRDPRES RGARDDDAQP GRKDAHLNAD PRGPRTPRPP RAGGAPAPRR PRIEGDPVRH PDAHRSALGQ PEGGERPVRG RGPGGGRNAP TTLSEPERLQ KVLAQAGVAS RREIEEWVVA GRISVNGLPA SLGQKIGPGD RVKVNGKLVP LRFTQRSPRV LIYHKPEGEI VSRDDPEGRP TVFERLPILR KGRWLAVGRL DFNTSGLLLF TNDGDLANKL MHPRYELERE YAVRILGELT EEQVKSLTDG IQLEDGPAKF NLLRDEGGEG ANHWYRVTIS EGRNREVRRM FEAVGLTVSR LMRVRYGSVE LPARLKRGMW MEMPEAEACA LAGLPPPQQS RQQDLRDKRP VKLHRTQAR
|
| |