Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1745 |
Symbol | |
ID | 3832890 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1796991 |
End bp | 1798151 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637829669 |
Product | pseudouridylate synthase |
Protein accession | YP_430589 |
Protein GI | 83590580 |
COG category | [S] Function unknown |
COG ID | [COG0585] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR00094] tRNA pseudouridine synthase, TruD family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000000122124 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.627089 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATTAA AGGTGATCCC CGAGGATTTC GTGGTCCGGG AACTGGCCCG CCTTCCTATT CGGGAAAAGG GGCCTTACCG GCTGTATCTT TTTGAAAAGA AGGGATGGAA TACCATCGAT CTTTTGATAC GGCTGGCAAA GGCCCATCGC CTTCCCTACC GACTTTTTGC TTACGGGGGG TTAAAGGACC GGCACGCCCA TACCTTTCAG TACGTCACAG TGAAGCATCC CGCCGATTTA ACCACTGAGG CAGAAAACTT TTCCTTGCAG AGTATTGGCT ATATGGACAG GCCCATGGGT CCCGATCTCC TGGAGGGGAA CGAGTTTGCT ATCACCATCC GCGCCCTGGG AGCGGCGGAG GTATGCCGCA TCAGCCGCCG GGTTGACGAG GTGCGGGGTT TCGGCTACCC CAACTACTAT GACAACCAGC GTTTTGGCAG CATGGACCGC CAGATGGGCC TCATGGCCGA GAGACTGCTG AAGAAGCATT ATAACGGCAG CCTGCAGATC TACCTTACCG GCATTTACCC GGAAGAAAAA AAAGAGGCCA GGGAACGCAA GCTCTTTTTC CGCGAACACT GGGGTGATTG GTCGACCTGC CTGGCACGTG CTAAAACCAC TATGGAGAGC AGAATATTTT CCTTGCTTGT CGAAAAACCC AAAGCTTACA TCGAGGCTTT GCAGATGATA CCCCGCGAAG AGCTCTCCCT GCTTTTTTCA GCCTACCAGA GTTTTCTTTT TAACGAGCTT TTAAGGAGGA TTTTGCAGGA ATTCGGCCTC GATCTTACGG CCGTACCCGG CACCGCCGGG CCCTATCTTT TTTACCGGCG TCTAGAAAGG AAAGAGCTGG GCTATTTAAG AGCGCTTAGC TTACCGCTGG CTGCCAGCCG CATGGAATTT CCCGATGCCA TGAGCGAGCG GCTCTTTGCG GCCATCCTGG AAGAAAGGGG CATCAAGCGC AGCAGTTTCA ACCTGCGTAA GGTCCGGCAG GCCTTTTTTA AGTCGACGCC CAGGGAAGCC ATAGTTTTTC CCGGTAATTT TCGAATACAG CCGGCGGAGC CCGATGACCT TTACCCCGGC AGGCAAAAAA TCCGCCTCTT CTTCAAGCTG CCGCGGGGGA GCTACGGGAC AATGCTCATC AAAAGGCTGA CCATGCCTTG A
|
Protein sequence | MKLKVIPEDF VVRELARLPI REKGPYRLYL FEKKGWNTID LLIRLAKAHR LPYRLFAYGG LKDRHAHTFQ YVTVKHPADL TTEAENFSLQ SIGYMDRPMG PDLLEGNEFA ITIRALGAAE VCRISRRVDE VRGFGYPNYY DNQRFGSMDR QMGLMAERLL KKHYNGSLQI YLTGIYPEEK KEARERKLFF REHWGDWSTC LARAKTTMES RIFSLLVEKP KAYIEALQMI PREELSLLFS AYQSFLFNEL LRRILQEFGL DLTAVPGTAG PYLFYRRLER KELGYLRALS LPLAASRMEF PDAMSERLFA AILEERGIKR SSFNLRKVRQ AFFKSTPREA IVFPGNFRIQ PAEPDDLYPG RQKIRLFFKL PRGSYGTMLI KRLTMP
|
| |