Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1053 |
Symbol | |
ID | 3831859 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1083752 |
End bp | 1084669 |
Gene Length | 918 bp |
Protein Length | 305 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637828981 |
Product | tRNA pseudouridine synthase B |
Protein accession | YP_429910 |
Protein GI | 83589901 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0130] Pseudouridine synthase |
TIGRFAM ID | [TIGR00431] tRNA pseudouridine 55 synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000000253025 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.00000016309 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGTTATGG GTTTTGTTAA TGTCTTAAAA CCGCCGGGAC TTACCTCCCA TGACGTGGTG CAGAATCTGC GCCGGCTTCT CAAAGTCAAG AGGATCGGCC ATGGTGGCAC CCTGGACCCT CTGGCGGCTG GCGTCCTGCC GGTTGCCGTT GGTACGGCTA CCCGTTTGCT GGAATACCTG CAGGGCGGCG ATAAAGCCTA CCGGGCCGAG TTTATCCTGG GCCTGAAGAC CGACACCCAG GACTTGGGCG GCCGGGTCCT GGCCAGGAAA CCCTGCCCGC CTTTCACAGA AAAGGATTTA CAGGCCGCCA CCAGGCCCTT TACGGGGACT ATCAGGCAGG TACCACCCAT GGTATCGGCT GTGCACTACC AGGGCCGCCG GCTTTATGAA CTGGCAAGGG AGGGCCTGGA GGTTGAACGA CCGGCCCGCC AGGTGACCAT CCATGAATTT CGGCTGATTA GGGCCTGGCC TGATGGACCT TACTACCGGG CGTTAATAGA TATCACCTGC TCCCGGGGTA CCTATATCCG TACCCTGGGG GCTGACTGGG GTGATTACCT GGGGGTAGGT GCCACCCTGG CCTTTTTACT TCGTACCCGA GCCGGGAGTT TCCGATTGAC AGATGCCTGG ACCCTGGAGG AAATAGCCGG GGCTATAGAT AGGGGCGAGA GGACCTTCCT TCTCCCGCCC GCCGCCGGCC TGGCCCACCT GCCAGTGATA ATAGTTCCAG GCGAGTTTAT CCGCCATGTA AGTAACGGGG TAGCCATCAA GGGTGATGTA TGCCGGCCGC TACCGTCCCT CAGAGAAGGG GATATAGTGC GCCTGGAGAC CGGCGAAGGC CAACTCCTGG CCCTGGCCAG GGTGGAGCCA GATACCAGGG GGTCCTTCTT ACTAAAACCC CATAAGGTTT TGAAGTGA
|
Protein sequence | MVMGFVNVLK PPGLTSHDVV QNLRRLLKVK RIGHGGTLDP LAAGVLPVAV GTATRLLEYL QGGDKAYRAE FILGLKTDTQ DLGGRVLARK PCPPFTEKDL QAATRPFTGT IRQVPPMVSA VHYQGRRLYE LAREGLEVER PARQVTIHEF RLIRAWPDGP YYRALIDITC SRGTYIRTLG ADWGDYLGVG ATLAFLLRTR AGSFRLTDAW TLEEIAGAID RGERTFLLPP AAGLAHLPVI IVPGEFIRHV SNGVAIKGDV CRPLPSLREG DIVRLETGEG QLLALARVEP DTRGSFLLKP HKVLK
|
| |