Gene Cthe_1962 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1962 
Symbol 
ID4810745 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2339325 
End bp2340287 
Gene Length963 bp 
Protein Length320 aa 
Translation table11 
GC content42% 
IMG OID640107378 
ProductRluA family pseudouridine synthase 
Protein accessionYP_001038373 
Protein GI125974463 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0564] Pseudouridylate synthases, 23S RNA-specific 
TIGRFAM ID[TIGR00005] pseudouridine synthase, RluA family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000134652 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTACGA TAACAATTAC GGAAGACAAG GCAAACAAAA GAATTGACAA AGTTTTAAGA 
GAAACTTTTC CAAGGCTCCC CAACGGAGCC TTGTTTAAAG CCTTTCGCAA AAAGGACATA
AAGGTAAATG GCGTGCGGGT TAAAGAGGAC CACATTGTAA AGCTCAATGA CAGGGTTGAC
ATATATATCA TTGATGAAAT TTTGGACGGC GTGCCCAAAG CGGGAGAACT TAATTATGAA
ACGGCATTTT CGGTCGCCTA TGAAGACAGC AATCTTTTAA TCGTAAACAA AAAACAGGGC
ATACCCGTGC ATCCCGACAG GACACAGACG GAGAATACGC TTATCGATTT TGTAAAAGAG
TACCTTAAGC TAAAGGGTGA ATTTGAAGAA AATTCCGGAT TTACCCCTTC CCTTTGTCAC
AGGCTTGACC GCAACACAGG CGGTCTTGTT ATGATAGCAA AAAACAGCTC CACTCTTCAT
ATGGTGCTCA AAAAGATGAA AAGCGGAGAA ATCAGCAAGT ACTACCAGTG CCTTGTCAAA
GGGAAAATGG AAAAAAAAGA GGATATTTTA AAAGCATACC TTGAAAAAGA CGAGAAGAAA
AGCAGAGTTT TCATCAAGGA CACAAAGTCA AAAAACGCAG TTGAGATAAT TACAGGATAT
AAAGTGCTTT CTTACAAGGA ACTTCCGGAT ATCGGTGAAG GTATCAGCAA CCTTGAGGTT
ACGCTTTACA CCGGCCGCAC CCATCAAATC CGGGCCCACC TTGCCCATAT CGGACATCCT
GTTGTGGGCG ACGGCAAATA CGGAATCAAC ACCTTCAACC GTCTTTTAGG TGCCAAATAT
CAGGCTTTAT GGGCGTACAA GCTAAAGTTC GATTTTAAAA GCGATGCCGG GATTTTGAAT
TACTTAAGGG GAAAGGTAAT ACAGGTACAG CCGGAGTACA AGCTCTCAAA GTCATGGAAA
TAA
 
Protein sequence
MRTITITEDK ANKRIDKVLR ETFPRLPNGA LFKAFRKKDI KVNGVRVKED HIVKLNDRVD 
IYIIDEILDG VPKAGELNYE TAFSVAYEDS NLLIVNKKQG IPVHPDRTQT ENTLIDFVKE
YLKLKGEFEE NSGFTPSLCH RLDRNTGGLV MIAKNSSTLH MVLKKMKSGE ISKYYQCLVK
GKMEKKEDIL KAYLEKDEKK SRVFIKDTKS KNAVEIITGY KVLSYKELPD IGEGISNLEV
TLYTGRTHQI RAHLAHIGHP VVGDGKYGIN TFNRLLGAKY QALWAYKLKF DFKSDAGILN
YLRGKVIQVQ PEYKLSKSWK