Gene Moth_1053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1053 
Symbol 
ID3831859 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1083752 
End bp1084669 
Gene Length918 bp 
Protein Length305 aa 
Translation table11 
GC content59% 
IMG OID637828981 
ProducttRNA pseudouridine synthase B 
Protein accessionYP_429910 
Protein GI83589901 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0130] Pseudouridine synthase 
TIGRFAM ID[TIGR00431] tRNA pseudouridine 55 synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000253025 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000016309 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTTATGG GTTTTGTTAA TGTCTTAAAA CCGCCGGGAC TTACCTCCCA TGACGTGGTG 
CAGAATCTGC GCCGGCTTCT CAAAGTCAAG AGGATCGGCC ATGGTGGCAC CCTGGACCCT
CTGGCGGCTG GCGTCCTGCC GGTTGCCGTT GGTACGGCTA CCCGTTTGCT GGAATACCTG
CAGGGCGGCG ATAAAGCCTA CCGGGCCGAG TTTATCCTGG GCCTGAAGAC CGACACCCAG
GACTTGGGCG GCCGGGTCCT GGCCAGGAAA CCCTGCCCGC CTTTCACAGA AAAGGATTTA
CAGGCCGCCA CCAGGCCCTT TACGGGGACT ATCAGGCAGG TACCACCCAT GGTATCGGCT
GTGCACTACC AGGGCCGCCG GCTTTATGAA CTGGCAAGGG AGGGCCTGGA GGTTGAACGA
CCGGCCCGCC AGGTGACCAT CCATGAATTT CGGCTGATTA GGGCCTGGCC TGATGGACCT
TACTACCGGG CGTTAATAGA TATCACCTGC TCCCGGGGTA CCTATATCCG TACCCTGGGG
GCTGACTGGG GTGATTACCT GGGGGTAGGT GCCACCCTGG CCTTTTTACT TCGTACCCGA
GCCGGGAGTT TCCGATTGAC AGATGCCTGG ACCCTGGAGG AAATAGCCGG GGCTATAGAT
AGGGGCGAGA GGACCTTCCT TCTCCCGCCC GCCGCCGGCC TGGCCCACCT GCCAGTGATA
ATAGTTCCAG GCGAGTTTAT CCGCCATGTA AGTAACGGGG TAGCCATCAA GGGTGATGTA
TGCCGGCCGC TACCGTCCCT CAGAGAAGGG GATATAGTGC GCCTGGAGAC CGGCGAAGGC
CAACTCCTGG CCCTGGCCAG GGTGGAGCCA GATACCAGGG GGTCCTTCTT ACTAAAACCC
CATAAGGTTT TGAAGTGA
 
Protein sequence
MVMGFVNVLK PPGLTSHDVV QNLRRLLKVK RIGHGGTLDP LAAGVLPVAV GTATRLLEYL 
QGGDKAYRAE FILGLKTDTQ DLGGRVLARK PCPPFTEKDL QAATRPFTGT IRQVPPMVSA
VHYQGRRLYE LAREGLEVER PARQVTIHEF RLIRAWPDGP YYRALIDITC SRGTYIRTLG
ADWGDYLGVG ATLAFLLRTR AGSFRLTDAW TLEEIAGAID RGERTFLLPP AAGLAHLPVI
IVPGEFIRHV SNGVAIKGDV CRPLPSLREG DIVRLETGEG QLLALARVEP DTRGSFLLKP
HKVLK