Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1962 |
Symbol | |
ID | 4810745 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2339325 |
End bp | 2340287 |
Gene Length | 963 bp |
Protein Length | 320 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640107378 |
Product | RluA family pseudouridine synthase |
Protein accession | YP_001038373 |
Protein GI | 125974463 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0564] Pseudouridylate synthases, 23S RNA-specific |
TIGRFAM ID | [TIGR00005] pseudouridine synthase, RluA family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00000134652 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTACGA TAACAATTAC GGAAGACAAG GCAAACAAAA GAATTGACAA AGTTTTAAGA GAAACTTTTC CAAGGCTCCC CAACGGAGCC TTGTTTAAAG CCTTTCGCAA AAAGGACATA AAGGTAAATG GCGTGCGGGT TAAAGAGGAC CACATTGTAA AGCTCAATGA CAGGGTTGAC ATATATATCA TTGATGAAAT TTTGGACGGC GTGCCCAAAG CGGGAGAACT TAATTATGAA ACGGCATTTT CGGTCGCCTA TGAAGACAGC AATCTTTTAA TCGTAAACAA AAAACAGGGC ATACCCGTGC ATCCCGACAG GACACAGACG GAGAATACGC TTATCGATTT TGTAAAAGAG TACCTTAAGC TAAAGGGTGA ATTTGAAGAA AATTCCGGAT TTACCCCTTC CCTTTGTCAC AGGCTTGACC GCAACACAGG CGGTCTTGTT ATGATAGCAA AAAACAGCTC CACTCTTCAT ATGGTGCTCA AAAAGATGAA AAGCGGAGAA ATCAGCAAGT ACTACCAGTG CCTTGTCAAA GGGAAAATGG AAAAAAAAGA GGATATTTTA AAAGCATACC TTGAAAAAGA CGAGAAGAAA AGCAGAGTTT TCATCAAGGA CACAAAGTCA AAAAACGCAG TTGAGATAAT TACAGGATAT AAAGTGCTTT CTTACAAGGA ACTTCCGGAT ATCGGTGAAG GTATCAGCAA CCTTGAGGTT ACGCTTTACA CCGGCCGCAC CCATCAAATC CGGGCCCACC TTGCCCATAT CGGACATCCT GTTGTGGGCG ACGGCAAATA CGGAATCAAC ACCTTCAACC GTCTTTTAGG TGCCAAATAT CAGGCTTTAT GGGCGTACAA GCTAAAGTTC GATTTTAAAA GCGATGCCGG GATTTTGAAT TACTTAAGGG GAAAGGTAAT ACAGGTACAG CCGGAGTACA AGCTCTCAAA GTCATGGAAA TAA
|
Protein sequence | MRTITITEDK ANKRIDKVLR ETFPRLPNGA LFKAFRKKDI KVNGVRVKED HIVKLNDRVD IYIIDEILDG VPKAGELNYE TAFSVAYEDS NLLIVNKKQG IPVHPDRTQT ENTLIDFVKE YLKLKGEFEE NSGFTPSLCH RLDRNTGGLV MIAKNSSTLH MVLKKMKSGE ISKYYQCLVK GKMEKKEDIL KAYLEKDEKK SRVFIKDTKS KNAVEIITGY KVLSYKELPD IGEGISNLEV TLYTGRTHQI RAHLAHIGHP VVGDGKYGIN TFNRLLGAKY QALWAYKLKF DFKSDAGILN YLRGKVIQVQ PEYKLSKSWK
|
| |