Gene Moth_0197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0197 
Symbol 
ID3832270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp191849 
End bp193819 
Gene Length1971 bp 
Protein Length656 aa 
Translation table11 
GC content64% 
IMG OID637828133 
Productradical SAM family protein 
Protein accessionYP_429075 
Protein GI83589066 
COG category[C] Energy production and conversion 
COG ID[COG1032] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGCATCC TGTTAACCAC CCTCAACGCC AAATACATCC ACGTCAACCT GGCGCTGCGC 
TACCTGCGGG CCTGCTGCCG CGACCTGCCC CACACCTTTA TCCTGGATGA GTTCACCATC
AACGACCACC CGGAGCATAT CGCCGCCGCC ATCTACCGGC ACCGGCCCGA CCTGGTGGCC
TTCTCCTGCT ATATCTGGAA CATTGAGCCC ACCCTGGCCG TAGCGGCGAT TCTTAAGAAA
GTACGGCCGG AACTCACCAT CCTCTGCGGC GGCCCGGAGG TCTCTTTCGA TACGGCCGCT
TTTCTAGCGC AAAACCCCCA GATCGACCTG GTAATCACCG GTGAAGGGGA AATCCCCTTC
CGTGCCCTCC TGGAACAACT GGCCGGGCAG CAGCGCCCCC CGGCAGGGAA CCGGGAGGAA
CCCCTTACCG CCTCCATGTA CCGGCCGGCC GGGGGGAGGC CTGCCCCGGG GCCTGTGCCC
GACCCGGCGG CCATCCCCGG CCTGGCCTGG CGGCGGGGCG CAGAGACGAT CGTCAACCCG
CCGGCGCCGC CTCTCCGCGA CCTGGACACT ATACCCTTCC CCTACCAGGA AGACCTGGAC
GCCGCCGGCA ACGAACTCAG CCAGCGCACG GTATACTACG AAACCTCCCG CGGCTGCCCC
TTCGCCTGCG GTTTCTGCCT CTCATCCACC ACGGAGGGCC TGCGCTACTT CTCCCTGGAG
CGTGTCCGGG ACGACCTGGA ACGCCTGCTC AAGGCCGGCG TCCGGGAGAT CAAGTTCGTC
GACCGCACCT TTAACGCCCA TAAAAAAAGG GCCCTGGCCA TCTGGGAGTT CCTCCTTTCC
CGCAGGCCCC GCGCCCGTTG TTACTTTGAA ATCGCCGGCG ACCGCCTGGA CGAAGAGATG
CTGGCCCATC TCAACCGGGT GCCGCCGGAT CTGTTCCAGT TCGAAATCGG CGTCCAGACC
ATCGATGCCG GGGTGAACGC CCGCTGCAAT CGCCGCCAGG ACTGGGCCCG CCTGGCTGCC
AACACCCGCC GCCTGCGGGA GATGGGCAGC ATCCGCCTGC ACCTGGACCT CATCGCCGGC
CTGCCCGGGG AGACTTACGC AGGAGTCGGC GAAAGCTTCG ACGCCGTAGT GGCCCTGAAA
CCCCACGAGA TCCAGCTCGG TTTTCTGAAA CTCCTGAAAG GCACGACCCT GCGCGCCCGG
GCCGATGAGT TCGGCTACCT CTTCCTCGAC CGGCCGCCCT ACCAGGTCCT GGCCAGCAGC
GCCATCACCT ACGAGGAGAT GCTGCGCCTC CACGCCATTG AAGGCCTCCT TAAGTATTAC
GGCAACAGCC ACCTGGCCGA CCACGCTTTT GCCTACCTGG CCGCCACCGC CTTTGACGGC
AGCTACTTCG CCATCTACGA GGCCCTGGCC GCCTGGTGGG AGGCCCGGGG CCTCCTGCGC
CGGGGCCACA GCCAGCGGGA TCTCTTCAAC CACCTGGCCT CCTTCGCCAT CCACCTGGCC
CGGGGGATCG CCACAAGGGT TACCGTACCG CCCTTACCTG GCTATCATGA TAGCATTCTT
GAGATAAACA GCCACGGCGA TACCCCCGCC TCCGGAGCAG AACACTCGGG GTGCGCCCGC
AACCTGACGC CGGCCCAGCT GGACCGCTTC TACCAGCTCC TTAAATTCGA CTGCCTCTGC
CGCGACCGCA GCCGCAATTT CCCGGGCTGG ATGCCGCCCT CGCCCTTGAC GGAGGAGGAG
CGTTCAAACT GGATGGCGCG GATAACCACT CCCCGCTCCA TGGAAAAATA TATGCCGGGG
CTGGTGAAAG AATCCCCGGC AAATTTACGC CGCCACGGCT TTATCGAACT CTTCCCCTGC
CACCCGGAAA GGCCGGAAGA AGACAACCCT ACCATAGTCT TCTTTTACTA CGGTCCCCCT
GGAAGTGAGA CAAGGGTATA TTATTTACAG GCGCCCCCCT TAGTAGTATA A
 
Protein sequence
MRILLTTLNA KYIHVNLALR YLRACCRDLP HTFILDEFTI NDHPEHIAAA IYRHRPDLVA 
FSCYIWNIEP TLAVAAILKK VRPELTILCG GPEVSFDTAA FLAQNPQIDL VITGEGEIPF
RALLEQLAGQ QRPPAGNREE PLTASMYRPA GGRPAPGPVP DPAAIPGLAW RRGAETIVNP
PAPPLRDLDT IPFPYQEDLD AAGNELSQRT VYYETSRGCP FACGFCLSST TEGLRYFSLE
RVRDDLERLL KAGVREIKFV DRTFNAHKKR ALAIWEFLLS RRPRARCYFE IAGDRLDEEM
LAHLNRVPPD LFQFEIGVQT IDAGVNARCN RRQDWARLAA NTRRLREMGS IRLHLDLIAG
LPGETYAGVG ESFDAVVALK PHEIQLGFLK LLKGTTLRAR ADEFGYLFLD RPPYQVLASS
AITYEEMLRL HAIEGLLKYY GNSHLADHAF AYLAATAFDG SYFAIYEALA AWWEARGLLR
RGHSQRDLFN HLASFAIHLA RGIATRVTVP PLPGYHDSIL EINSHGDTPA SGAEHSGCAR
NLTPAQLDRF YQLLKFDCLC RDRSRNFPGW MPPSPLTEEE RSNWMARITT PRSMEKYMPG
LVKESPANLR RHGFIELFPC HPERPEEDNP TIVFFYYGPP GSETRVYYLQ APPLVV