Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0908 |
Symbol | |
ID | 4810529 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1084767 |
End bp | 1085678 |
Gene Length | 912 bp |
Protein Length | 303 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640106327 |
Product | ribosomal large subunit pseudouridine synthase D |
Protein accession | YP_001037335 |
Protein GI | 125973425 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0564] Pseudouridylate synthases, 23S RNA-specific |
TIGRFAM ID | [TIGR00005] pseudouridine synthase, RluA family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00261324 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGAAA TTATACTTGT TTCCAATGAA AGCGGGATAA GGATTGATGC GTGGCTTTGC GGAAAGGTCA AAGATTATTC AAGGTCGTAT ATACAAAAGC TTATAAACGA TGGAATGATA CTTGTAAACG GAAAACCTGT AAAATCAAAC TACAAGATAA AACAAAACGA CGAGATACTG GCAAGAATAC CGGAGCCTCA AGTTCTTGAT GTGGTGGCGG AGGATATTGA TGTGCCCATA TTGTACGAGG ATGAGCACAT TATAGTGGTG GACAAGCCCA AGGGCATGGT AGTGCATCCG GCTGCGGGAA ACTATTCCGG CACTTTGGTA AATGCCCTGC TTAAACATTG CGGCACCAAT CTTTCCAATA TAAACGGGAT TATAAGACCC GGTATTGTCC ATAGAATTGA CAAGGATACT TCGGGAGTTT TGGTGGTTGC AAAGAGCAAT GCCGCGCATG AAGGTTTGTC TGAAAAACTT AAGGATCATG ACATTGAAAG AGTTTATATT GCTGTGGTTC ATGGCATAAT CCGGGAGGAT TATGGCAAAA TTGACGCTCC CATAGGAAGG CACCCTGTTG ACAGAAAAAA GATGGCTGTG AACACCAAAA ACGGCAGGCG GGCAGTGACC CGTTTTAAAG TGCTTGAAAG ATTCAAGGAT GCCACTTATA TTGAGGCGAC TTTGGAAACC GGAAGAACAC ATCAGATACG TGTTCATATG TCCTACATAG GCTTCCCCAT TATCGGGGAT ACGGTTTATG GAAGGAAAAA TGATATATAC AACATAAACG GACAGGCTCT TCATGCAAAG AAGCTTGGAT TTGTCCATCC GATAAAGGGA GAGTATATGG AGTTTGAGTC ACCGCTTCCC GAGTATTTTA AAGAGCTTCT TGAAAAGTTG AGAAAAAGTT AA
|
Protein sequence | MEEIILVSNE SGIRIDAWLC GKVKDYSRSY IQKLINDGMI LVNGKPVKSN YKIKQNDEIL ARIPEPQVLD VVAEDIDVPI LYEDEHIIVV DKPKGMVVHP AAGNYSGTLV NALLKHCGTN LSNINGIIRP GIVHRIDKDT SGVLVVAKSN AAHEGLSEKL KDHDIERVYI AVVHGIIRED YGKIDAPIGR HPVDRKKMAV NTKNGRRAVT RFKVLERFKD ATYIEATLET GRTHQIRVHM SYIGFPIIGD TVYGRKNDIY NINGQALHAK KLGFVHPIKG EYMEFESPLP EYFKELLEKL RKS
|
| |