Gene Moth_0869 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0869 
Symbol 
ID3831507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp899478 
End bp900398 
Gene Length921 bp 
Protein Length306 aa 
Translation table11 
GC content64% 
IMG OID637828799 
Productribosomal large subunit pseudouridine synthase D 
Protein accessionYP_429729 
Protein GI83589720 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0564] Pseudouridylate synthases, 23S RNA-specific 
TIGRFAM ID[TIGR00005] pseudouridine synthase, RluA family 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAAGTCG TGGTTCCACC GGAAGCCAGG GGCCGGCGTA TCGATGCCTG GCTGGCCGGC 
GAACTGCCGG AGGTATCCCG TTCCCGCATC CAGCAGCTCC TGGAGGCCGG GGAGATTACC
CTGGCCCTCC CGGGCCGTCT CAAAGCCAAC TACCGTCTCC GGGGTGGTGA AAGGGTCCGG
GTGCGGTTAC CGGAGCCAAC CCTGCTGGCG GCCAGACCGG AAGCCATTCC CCTGGACATC
CTCTATGAGG ACGAGGATAT AATCGTCGTC AACAAACCCC AGGGAATGGT AGTGCACCCG
GCGCCGGGGA GCGAGGGTGG CACCCTGGTA AACGCTCTAT TGTATCACTG CGGGGACCTG
TCGGGGATTA ACGGCGTCTT ACGGCCCGGC ATTGTCCACC GCCTGGACAA GGATACCTCG
GGCATCCTGG TGGCGGCCAA GAACGACGCC GCCCACCGCG GCCTGGCGGC CCAGATCAAG
GATCACAGTA TGAAAAGGAT TTACCTGGCC CTGGTCCACG GCGAGGTGGC CGAACCCCGG
GGCCGGGTGG AAGCCCCCAT CGGCCGCCAC CCGGTGGACC GCCAGCGCAT GGCCGTTACC
CTGAAGAACT CCCGGCCGGC CGTTACCCAT TACCGGGTGG TGGAGCATTT TCCCGGCTAT
ACCCTCCTGG AAGCGCGCCT GGAAACGGGC CGTACCCACC AGATCCGGGT CCATATGGCC
TTTATCGGCC ACCCGGTAGT AGGAGATCCC AAATATGGTC CCCGCCGTTG CCCCTTTGCG
GTTCCCGGAC AACTCCTCCA CGCCGGGTGC CTGGGGTTTG TTCACCCTGT ACGGGGCGAT
TACCTGGAGT TCACGACACC ACCCCCGTCG ATTTTTTTAC AGGTCCTGGA GCAGTTACGC
CGGGCAAAAG GAGAGAAGTA G
 
Protein sequence
MEVVVPPEAR GRRIDAWLAG ELPEVSRSRI QQLLEAGEIT LALPGRLKAN YRLRGGERVR 
VRLPEPTLLA ARPEAIPLDI LYEDEDIIVV NKPQGMVVHP APGSEGGTLV NALLYHCGDL
SGINGVLRPG IVHRLDKDTS GILVAAKNDA AHRGLAAQIK DHSMKRIYLA LVHGEVAEPR
GRVEAPIGRH PVDRQRMAVT LKNSRPAVTH YRVVEHFPGY TLLEARLETG RTHQIRVHMA
FIGHPVVGDP KYGPRRCPFA VPGQLLHAGC LGFVHPVRGD YLEFTTPPPS IFLQVLEQLR
RAKGEK