Gene Moth_1745 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1745 
Symbol 
ID3832890 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1796991 
End bp1798151 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content54% 
IMG OID637829669 
Productpseudouridylate synthase 
Protein accessionYP_430589 
Protein GI83590580 
COG category[S] Function unknown 
COG ID[COG0585] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00094] tRNA pseudouridine synthase, TruD family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000122124 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.627089 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTAA AGGTGATCCC CGAGGATTTC GTGGTCCGGG AACTGGCCCG CCTTCCTATT 
CGGGAAAAGG GGCCTTACCG GCTGTATCTT TTTGAAAAGA AGGGATGGAA TACCATCGAT
CTTTTGATAC GGCTGGCAAA GGCCCATCGC CTTCCCTACC GACTTTTTGC TTACGGGGGG
TTAAAGGACC GGCACGCCCA TACCTTTCAG TACGTCACAG TGAAGCATCC CGCCGATTTA
ACCACTGAGG CAGAAAACTT TTCCTTGCAG AGTATTGGCT ATATGGACAG GCCCATGGGT
CCCGATCTCC TGGAGGGGAA CGAGTTTGCT ATCACCATCC GCGCCCTGGG AGCGGCGGAG
GTATGCCGCA TCAGCCGCCG GGTTGACGAG GTGCGGGGTT TCGGCTACCC CAACTACTAT
GACAACCAGC GTTTTGGCAG CATGGACCGC CAGATGGGCC TCATGGCCGA GAGACTGCTG
AAGAAGCATT ATAACGGCAG CCTGCAGATC TACCTTACCG GCATTTACCC GGAAGAAAAA
AAAGAGGCCA GGGAACGCAA GCTCTTTTTC CGCGAACACT GGGGTGATTG GTCGACCTGC
CTGGCACGTG CTAAAACCAC TATGGAGAGC AGAATATTTT CCTTGCTTGT CGAAAAACCC
AAAGCTTACA TCGAGGCTTT GCAGATGATA CCCCGCGAAG AGCTCTCCCT GCTTTTTTCA
GCCTACCAGA GTTTTCTTTT TAACGAGCTT TTAAGGAGGA TTTTGCAGGA ATTCGGCCTC
GATCTTACGG CCGTACCCGG CACCGCCGGG CCCTATCTTT TTTACCGGCG TCTAGAAAGG
AAAGAGCTGG GCTATTTAAG AGCGCTTAGC TTACCGCTGG CTGCCAGCCG CATGGAATTT
CCCGATGCCA TGAGCGAGCG GCTCTTTGCG GCCATCCTGG AAGAAAGGGG CATCAAGCGC
AGCAGTTTCA ACCTGCGTAA GGTCCGGCAG GCCTTTTTTA AGTCGACGCC CAGGGAAGCC
ATAGTTTTTC CCGGTAATTT TCGAATACAG CCGGCGGAGC CCGATGACCT TTACCCCGGC
AGGCAAAAAA TCCGCCTCTT CTTCAAGCTG CCGCGGGGGA GCTACGGGAC AATGCTCATC
AAAAGGCTGA CCATGCCTTG A
 
Protein sequence
MKLKVIPEDF VVRELARLPI REKGPYRLYL FEKKGWNTID LLIRLAKAHR LPYRLFAYGG 
LKDRHAHTFQ YVTVKHPADL TTEAENFSLQ SIGYMDRPMG PDLLEGNEFA ITIRALGAAE
VCRISRRVDE VRGFGYPNYY DNQRFGSMDR QMGLMAERLL KKHYNGSLQI YLTGIYPEEK
KEARERKLFF REHWGDWSTC LARAKTTMES RIFSLLVEKP KAYIEALQMI PREELSLLFS
AYQSFLFNEL LRRILQEFGL DLTAVPGTAG PYLFYRRLER KELGYLRALS LPLAASRMEF
PDAMSERLFA AILEERGIKR SSFNLRKVRQ AFFKSTPREA IVFPGNFRIQ PAEPDDLYPG
RQKIRLFFKL PRGSYGTMLI KRLTMP