Gene Moth_0643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0643 
Symbol 
ID3832039 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp673123 
End bp676674 
Gene Length3552 bp 
Protein Length1183 aa 
Translation table11 
GC content50% 
IMG OID637828584 
Producthypothetical protein 
Protein accessionYP_429514 
Protein GI83589505 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00455941 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTGGATTA ATGCGCATGT TAATTTACCT GGAGAACTGG TCACAGCCCA GGGGGAGGGC 
AGGTTGGTTA TCTTTGCGGG AGCCGGGGTT TCCAAAGGCT CTCCTTCCAA TTTTCCGGAT
TTTGAGGGAC TGGCCGATGA AGTAATGAGC AGGTCAGCGC AAATATTGAC CAGAGGTAAA
GCGGAACCGG TTGACCATTT TTTTGGCAGG CTTAAAAAGA AAGGTGTCCA CGTGCATCGG
ATAGTAAGAG AAATATTAAC CCGTCCGGAT GCAAAACCAA CTAAGCTGCA TAAAGAGCTC
TTATCGTTAT TTCGTAACCC ACAAGAAGTC CGGGTTGTTA CTACCAACTT CGACCGGCAT
TTTTCCACGG CCGCCAGCAA ACTTTTTGGT GATAATAAAA TCCCGGTATA CTGGGCGCCG
GCGCTGCCCC TGGGCCACCG GTTTAATGGC ATTGTCTATC TGCACGGATG TGTAGATCAG
GAGGCCGAGG AGTTTATTCT AACCGATAGC GACTTCGGTC GTGCTTACCT TACTGAAGGC
TGGGCTACCA GCTTTCTCAA AAGTTTGTTT GGAAAGTATG CAGTTCTATT TGTCGGTTAT
AGTCACAATG ACCAAATCAT GGAGTATCTG GGCAGGAGTT TGCCTCCCGA ATCTTTGCGG
TTTGCTCTGG TACCGGAAGA TATAGGTGAA GAAGAACGGG AAAAGTGGAA ACTCAGAGGG
ATAGAGCCCA TTTTTTATCC TCACACCAAG GGAGATATAG ACCATCAGGC CTTGATCGAA
GCCGTGGAAG CCTGGGCAAG TAGGACCAAA ATGGGTCTGC TAGAACATGA ACAACGCATC
AAGGAGATTG TACGAAATGT ACCACCAATA GACCAGGAAG AAGTTGACTA TATTTTAGAA
ACTTTACGAG ACCCGGCCAG GGCACGAATT TTCACCACGT ACGCCGAAAC ACCGGAGTGG
CTGCGCTGGG TGGAAAAAAG AGATGTGCTT AAACCGTTGT TTAAGGGGGA AGCCCCTGCA
GACGATCTGG CTACAGTTTT TGCTAATTGG TTTGTGGATA AATTTGCCCT TGCTCACCCG
GAAGAGGCCC TGGCCGTAGT CCACAGACAA GGTCTGACAT TTGCACCCGT ACTCTGGTGG
CGAATAGCCC ATGCGCTAAC TTATAATAAG AAACCCCTTG ATCCGGCGGT TCTGGGCAAA
TGGGTGCCCT TGCTGCTTCA ATCTGCTCCT CAACTTGCCA GGCAGGTAAG AGACGTCTTG
AGCATCATGC TAGGATATTG TCGGTATCCG GACGATGCCA CTACAGCCTT GCTTTTGTTT
AGGAAGCTGA CGGAACCCAT GTTCATTCTT GAACCATATA TTCCCCTGGT AGCTCAAGAT
GACCATAACA GAAAGGTTAA CTTTGAAATA ACCATTCCTG GCGAGCTTTA CTGGCTTGAA
AAAGCGTGGA ATGAGCTGTT CAAACCTAAT TTAAATGTCC TCGCCGGGGA GCTGGAAGCA
ATACTTACTA CTCATTTGCG GGAGGCTCAC CGTCTCCTAT GTAGTGTGGG AAGCGCCGAC
AACAAATGGG ATCCCATGAG TTTTAGCCGA ATTGCTATTG AGCCCCATGA GCAGGACCAG
TATCCCAGGA ATGTGGACGT ATTAATCAAC GCCGCCAGGG ATGTTTTAGA ATGGCTGGTT
GCCAATAACC ACGAGCGTGC TAATGTAGTT ATAGAGGAAT GGGCTGCCTC AGATGTTCCC
TTACTCAAGA GGCTGGCGGT TCATGGTATA AGGGTTTGCA GTTACCTGCA GCCAGACGAA
AAGCTTGGGT GGCTTCTGGG AAAAGGTTTT CTTTTTGCCT ATGGTTTGAA GCACGAAGTT
TTTCAACTGC TTAAGACTGC CTACCCGCTG GCCAGTGGAA TTGTGCGCCA ACAAATCTTG
GACCGGGTAG AGCACTACTT GGAACATGAT AAGGACGATG AGCGAGAGAT ACGGCCATAT
GAAGTTTATA ATCTCGTTGT TTGGTTACAC CAGGTAGCGC CAGATTGCGC GTTGGCCGCC
GAAAAGCTAC GGGAAATCCG CGAAAGATAC CCGGATTTTG AGCCAAGGAA GCATCCTGAC
CTAGATAGAA TTATATCCGT GGGCTGGCAT GGCCCGCAGA GTCCGCTGAC AACCGATGAA
CTTCTTGCCA GGTCTCCTTA CGAGAGAGTC GAGTTTCTGC TTACGTACCA GGGAGAGGAG
TTTTTTGGCC CTAACCGGAG GGGGCTCCTG GAGTTACTTA AAGATGCGGT GGCCAAATCA
TTTCAGTGGA GCTGGGAACT TGTCAAGGAG CTTAAGGCGA GAGGTGAATG GACAACTGAC
CTGTGGGGCG TGATTATTTC CGGCTGGCAG AACGCGCCCA AGGATGCAGG CCAGTGGGAG
CAGGTACTGG GGCTACTGGA GAGCGAACCG GAACTGCTGC GGCGATCCGG CTACGAGGTG
GCCGGCTTAC TGGTCAGCGG AGTCGAGAAA GGTGAGGGCG GGTTACCTGT TTCTCTTTTG
GATCGTGCAG AAGCCTGTGC CGACCGGCTC TTTATGGAAA CGGAAAACGC CGCCGTTATG
GAAACGGGAG ATTGGTTATT GCGGGCTATC AATCACATTG GGGGCAAGCT TACCGAGTTC
TGGCTGCACG CCCTTGCCAG GCGACGAAAG GAAAAAGGAA GCGGATGGGT GTCCCTGCCG
GAAGAATACA AGAACAGGTT TGCTCGCATA CTCTCGGGGC AATCGGCCAG TGCGGAAATG
GGCAGGGTAC TGCTGGCCAG TCAGTTACTT TTCTTGTTTG CACTGGATCG CGATTGGACC
CGCGAAAACA TTATCCCCTT GCTCGATTGG AATATTGATT CCAAGAGGGC CGAGCAGGCC
TGGCATGGGT ATCTTTTCTG GGGTAAGTGG AACGAGGCGT TTTTGCCAGA ATTACTACCT
TTGTATGTAC AGGCCTCTCG GGAGCTACCT GCGGAACCAG ATAGAATACG TGAGCGGCTC
TACGAGCATC TGGCTAGCAT AGCCGTACAA AGTTCAATAA ATCCACTACA GGAAGGCTGG
CTTGGTAAAT GTATCGTTGC TGCCGATGAA AAAGAAAGAA TAAAATGGGC CAACAGTATC
TGGCATGAAC TTGCATCTTT GCCCAAGGAG GCTACTAGCC AGCTATGGGG GAAGTGGATG
GAGAAGTACT GGGAAAACCG TCTTTTCGGT ATTCCTATAC CTTTGAGTCC AGGTGAGGCT
GGTGCTATGG TGAACTGGGC CCTTGAACTG GAGCCGGTAT TTCCATCGGT AGTTGAAAAG
ATTATTTCTG GTCCTACTCC TAATTTAGAG CGTGGAACTT TCTACTACCG GTTAAGCCAG
GAAAAAATAG CAAAAAAATA CCCTGAGGAT GTAGCTAGAC TCCTGGTTTA CCTGCTTTCC
GGAACGGACA TGCCCTTTTA TTGGTGTACA GAAGTAAAAA ATATTTTTGA GCAGATAGCA
ACCGCGGGTC TGTCGGAGGA AAAGCTCAGT CTATTGAGAG AACAATTAAT TCGTCTGGGA
TGTTTGTTGT GA
 
Protein sequence
MWINAHVNLP GELVTAQGEG RLVIFAGAGV SKGSPSNFPD FEGLADEVMS RSAQILTRGK 
AEPVDHFFGR LKKKGVHVHR IVREILTRPD AKPTKLHKEL LSLFRNPQEV RVVTTNFDRH
FSTAASKLFG DNKIPVYWAP ALPLGHRFNG IVYLHGCVDQ EAEEFILTDS DFGRAYLTEG
WATSFLKSLF GKYAVLFVGY SHNDQIMEYL GRSLPPESLR FALVPEDIGE EEREKWKLRG
IEPIFYPHTK GDIDHQALIE AVEAWASRTK MGLLEHEQRI KEIVRNVPPI DQEEVDYILE
TLRDPARARI FTTYAETPEW LRWVEKRDVL KPLFKGEAPA DDLATVFANW FVDKFALAHP
EEALAVVHRQ GLTFAPVLWW RIAHALTYNK KPLDPAVLGK WVPLLLQSAP QLARQVRDVL
SIMLGYCRYP DDATTALLLF RKLTEPMFIL EPYIPLVAQD DHNRKVNFEI TIPGELYWLE
KAWNELFKPN LNVLAGELEA ILTTHLREAH RLLCSVGSAD NKWDPMSFSR IAIEPHEQDQ
YPRNVDVLIN AARDVLEWLV ANNHERANVV IEEWAASDVP LLKRLAVHGI RVCSYLQPDE
KLGWLLGKGF LFAYGLKHEV FQLLKTAYPL ASGIVRQQIL DRVEHYLEHD KDDEREIRPY
EVYNLVVWLH QVAPDCALAA EKLREIRERY PDFEPRKHPD LDRIISVGWH GPQSPLTTDE
LLARSPYERV EFLLTYQGEE FFGPNRRGLL ELLKDAVAKS FQWSWELVKE LKARGEWTTD
LWGVIISGWQ NAPKDAGQWE QVLGLLESEP ELLRRSGYEV AGLLVSGVEK GEGGLPVSLL
DRAEACADRL FMETENAAVM ETGDWLLRAI NHIGGKLTEF WLHALARRRK EKGSGWVSLP
EEYKNRFARI LSGQSASAEM GRVLLASQLL FLFALDRDWT RENIIPLLDW NIDSKRAEQA
WHGYLFWGKW NEAFLPELLP LYVQASRELP AEPDRIRERL YEHLASIAVQ SSINPLQEGW
LGKCIVAADE KERIKWANSI WHELASLPKE ATSQLWGKWM EKYWENRLFG IPIPLSPGEA
GAMVNWALEL EPVFPSVVEK IISGPTPNLE RGTFYYRLSQ EKIAKKYPED VARLLVYLLS
GTDMPFYWCT EVKNIFEQIA TAGLSEEKLS LLREQLIRLG CLL