Gene Moth_1786 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1786 
Symbol 
ID3832452 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1839500 
End bp1840885 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content58% 
IMG OID637829711 
Productradical SAM family protein 
Protein accessionYP_430630 
Protein GI83590621 
COG category[R] General function prediction only 
COG ID[COG0535] Predicted Fe-S oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00175459 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAACCTGG CGGCAACCTA TGTCGGTGAA AAGGTACTGG AGCAGGGGGT AAAATTGATT 
CTCAATAATC CCGAAAAAAA TATCCCGCGC CTGATCACCC TGGCCGAAAA ACTGGCGCGG
GACCCCTACC ACCGGGAGAT GGTAGCCAAC GTTAAGAAAG TACTGGAAAA TAAAGAAGGC
AACTGGTACC AGTTTGCCCG ACGGTTGCTG ACCACAACCC ATCCCAATAT TCGCCAGCGC
CTGGCCATGG ATTTTTTTGT TAATTCTACC TTTATTGGCG TACCGCGGCA AAAGGAATGG
GCGGCGAAAC TGGGAGTGGC AGTGCCCTGG GCCATTCTCA TGGACCCGAC GGAGAAATGT
AACCTCCACT GCCGGGGCTG CTGGGCCGGC GACTACCAGC GGGCCCGGGA GCTGGATTTC
GCCACTATGG ACAGGGTAGT TACTGAGGGG GAGAAGCTAG GGATCAACTT TATTGTCCTC
TCCGGGGGGG AACCCATGAT GCGGCGGGGA GATATTGTTC GCCTGGCAGA GAAACATCCC
GACCAGGTCT TCCATCTCTT TACCAACGGT ACCCTGATTG ACCGGGCCTT TGTGGACGAC
ATGGTTCGCC TGGGTAATAT TACCGTAGCC CTGAGCCTGG AAGGTTTTGA GGAAAAGACC
GACGCCCGCC GGGGTAAAGG CGTTTTCGCC AGGGTGATGC AGGCCATGGA TCTCATGCGC
GAGGCCGGGG CCGTATACGG GGTCTCGGTC ACCTACAGCC GTAACAATAC CGAGGAACTG
GGCAGCGAGG AATTTGTAGA TATGCTGGTG GAAAAGGGCG TGGCCTTTGG CTGGTATTTC
ACCTATATCC CCATCGGCAA GGACGTGGAC TTGGAGATGA TGGCCACGCC GGAGCAGCGG
GCCTGGATGT TCGACCGCAT CCAGTATTTC CGCCAGACGA AACCCATCTT TCTAGTGGAC
TTCTGGAACG ACGGCGAGGC GAGCAACGGC TGTATCGCCG GCGGCCGGCG CTACTTCCAC
ATCAACGCCG CCGGGGAAGT AGAGCCCTGC GCCTTTGTCC ACTACAGTAC CTGTAATATT
AACCATATCA GCCTGGTGGA GGCCCTGCAG AACCCCCTTT TCCGGGCCTA TCAGAAACGC
CAGCCTTTTA ATACCAACCT GCGCCGGCCC TGCCCCCTTA TCGACAACCC GGAGATGCTG
CGGGAGATGG TGGCCGAGGC GGGCGCCCGC TCGACCCAGC TCCACGCTGA CGAGACAGCG
GAGGAGTTCG CGGCCAAACT GGCCCCCTAC GCCCGGGATT GGGGGGCCAT CGCCGACCGC
ATCTGGAATG AGGCCGGGAA GGCGGGTAAG ACAGCTGCGG GAGACAGGTG CTGCCAGGCC
CATTGA
 
Protein sequence
MNLAATYVGE KVLEQGVKLI LNNPEKNIPR LITLAEKLAR DPYHREMVAN VKKVLENKEG 
NWYQFARRLL TTTHPNIRQR LAMDFFVNST FIGVPRQKEW AAKLGVAVPW AILMDPTEKC
NLHCRGCWAG DYQRARELDF ATMDRVVTEG EKLGINFIVL SGGEPMMRRG DIVRLAEKHP
DQVFHLFTNG TLIDRAFVDD MVRLGNITVA LSLEGFEEKT DARRGKGVFA RVMQAMDLMR
EAGAVYGVSV TYSRNNTEEL GSEEFVDMLV EKGVAFGWYF TYIPIGKDVD LEMMATPEQR
AWMFDRIQYF RQTKPIFLVD FWNDGEASNG CIAGGRRYFH INAAGEVEPC AFVHYSTCNI
NHISLVEALQ NPLFRAYQKR QPFNTNLRRP CPLIDNPEML REMVAEAGAR STQLHADETA
EEFAAKLAPY ARDWGAIADR IWNEAGKAGK TAAGDRCCQA H