Gene Moth_0638 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0638 
Symbol 
ID3832034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp661653 
End bp665264 
Gene Length3612 bp 
Protein Length1203 aa 
Translation table11 
GC content50% 
IMG OID637828579 
Producthypothetical protein 
Protein accessionYP_429509 
Protein GI83589500 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000981876 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.418476 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAAGA TATGTGAGCT CTTTAACTTT GAAAGGATCC GGGAAGTAGT TGACATAGAC 
GCCATAACCG ATAAGAAGGC AATGGTGGAG AATTATGTTA TTTCCCCATC CCTGGAAGAA
TATCTGGTTC ATCTGTTCCA GGACCTTAAT TCCAGCACAC ATAAAGCGGC TCAGATCATT
GGCGGTTACG GTTCCGGCAA GTCGCACCTT TTGGCCTTCA TTATCTCCCT GCTGACCGAA
CCCGGGCTGG TCAGGTCTGT GCAAAACGAA CAGGTGAGGC GGGCCGTGGA GGCCATGAAG
CGGGATTTTG TGGTAGTTCA CTGGGAGCTG CAGCCTAATG ATGTAAGTTT GAGCGAGTAT
TTCTACGATC AACTGGAAAT CCAATTGAAA GAGAAATACG GCATCTCGAT AAACCTGCCC
ACTTCCGGCG TAATCGACCA CAAAAAGACT ATCCTGCAGA TACTGGAAAA AGTAAAAGAA
GGCCATCCCA CCCGCGGCCT GGTGGTAGTA ATAGACGAGA TTTCCGACTT TCTGAAGCAA
AAGACCAAAG AAAAACTCAA TCAGGATGTC CAGTTCCTGC GCGTGCTTGG CCAGGCGGCG
CAGAGCTGCG ACTTCATGTT TATCGGCGCC ATGCAGGAGC ACATTTTTAC TAATCCCAAG
TACGTAGATG AAGCAGAGAG CTTTGGCCGG GTGGCCGAGC GGTTCCAGGT CATCACCATC
AAACGCGAAG ACATCAAGCG GGTGATTGCC AGACGGGTCC TCAAGAAGAC CGCCGAACAG
AGGCTGCGGC TGGAGGAGCT ATTCAGGGAG TATGTGCACT ACTTCCCGCA CCTGCAGGCC
GGCCTTGATG AGTATATTGA CCTTTTCCCC CTTCACCCCT ATGTAATCCA GGTTTTCAGC
GAGCTACCCT ATTTTGAAAA ACGGGGCGTG ATCCAGTTTA CCATCCAGGA AGTAGAACAA
ATACTGGACC GCGACTTTCC TTTTTTGATC ACCTATGACC GTATATATGA TGAAATTGCC
TCCAAGCATA CCGTAAAGAA TCTGGAGACC GTGTCCCCGG TGGTCAACGC CATCCAGACC
CTGGAAAGTA AAGTAGACCT TTTAGAACCA CGCCATCAGG ATACGGCCCG CAAGATCATT
AAGGCCCTAG CTGTGCTACG GCTATACGGA AAAAGTGTCA ACAACGGGGC CACTGCCGAG
GAACTGGCAA ATACTCTTCT ACTTCTACCC GGCAATAGGC TGATGGAAGC CGCCGACGAA
ATTTCCTTGG TGTTGAAGAA CTTCCGTAGA GTTACCGATG GCCAATTTAT TAACGTTACC
TCCGACGGTT ATTATTTTCT GGACCTTAAT CTGACCATTG ATTACGACCA GGTGATCCGG
CGGCGAGCCG AAAACCTGCC GGAAGACAGC CTGGACGAGG AGATCCTGTC CATCCTCAAA
GAGCAGCTAC TCCTGGAGGA GGGTGCGGCA CGTTATGCTT TCCGGGACAC TTGCGAATGG
TCTGGCCGCC GTTCTTTCCG CGAAGGCACT TTTGTCTATG AAACCGGCAA AGGGGAAGTG
GTCCGGGCCG ACGGCGACTA CCGGATTGTT TTCGTCTCTC CTTTTTGCGC CGGGAACCGC
TACCCGCCAG CCGAAAACTG CGCTGTGATC AGCGGCTCTC TGCCACCTGA AGCCATTGAG
CAGTTAAAGC TGGCCGTTGC AGCTCACCTT TTGTTCAGGG AGAACTACCA GCGTAGCATC
ATGGAGAAAA AATATGCTTC GTTGAAGAAA AAGTTTGTGG AAATGTTCGT GCAGGCATGG
CTGGAAACCG GCCAGGTAAA TATTGGGAAA GCGCAAAAGG GCATCAAGGC TTTGATAGTC
AGGGAGTTTA GCAATTTTGA CGAGCTCTTT ACCGAGATCA AACCCCAGCT ATTTGAGGAT
TATTTTAATC AGAAGTATCC TAAACACCCC AAATTTACTC AGCGAATCAC CCGCGACAAT
ATCATGGGTG AGTTTGGAGC GGCTATTAAG GAGCTACTCA GCAAAGGCAA CGTCCAGTCC
GTTTTTTCCA GTACCAAATC CATTCTCAAT GCCCTGGACC TGCTCGATGC TCAGGGTAAC
CTTTCGACTG CCAATTCAGA AATAGCAGCG GCTATTCTGG AAACCGCTCG AGCCGCTGGT
GGGAAAGTTG TAGCGGTGGA GGATTTTATT AACCGCTTCC GGCAGACCCC CTACGGTTAC
GATCGGTGGA TGACCGCTTT TGTCATCATC GTATTAACCT ACAACGGCGA GATCGTTCTC
AGGGCTGCTG GCGGCACGCT TATTAGCTCT TCTGAAGTGT ACGATACATT CGGATCAGGC
CTGGAGGCTT TCGAGAATAT AAAGTATCTT GCTATAGAAA GCGACTTCAA TCCCCAGCCG
GTTATCAACC TTATGCTGGC CCTGGGTATA GACCCAGGTA TTACTGCTAA AATGCGTGTA
ACCGCAAAAC GTAACGAGGC CTTACAAGCT TTTCGCACCC GCTATCTAGA GATTAAGGAA
CAGTTGGACT TTGTTCGAAA AAAACTGGAG ACCCTGGCTT TGAATGCCGG TAACATAGTT
GATGTTAATG GTCTTAGAAA TTATCACCAA AAATTAGCAG ATATACCCAT TGACAAATTT
GAACAGGTGA AAGCGCCCAA CGATTTCAAG AAAGTGGTGT ACGACGAGGC TGCCATCCAG
AAAATTGGAG AGGCTTATAA GATACTTCAG GATCTGGACC ATTTCTACAA AATGTATTCG
GAACGGCTGG AAAAGGAGGT GGAGTATGCC CGGGAAGTTC GGAAAGTCGT TGAGGAATAT
CCCGGTTTAT TTCTGACCGA TGGACTTAAG GAATTCCTCC ATGAAGCCTT TGCCGTCTTG
GCCGATTCCA GCCGTCTGGT GAACCGGAGC GAGCTGCTGT CGCTTTTGGG CAAGTTGCAG
CAGGCCCGCC AGAAATATGT GACTGCTTAT TATAGGGCGC ACGAAAAGTG TGTAGGCGAA
AGGGCTGGCT GGAACAAACT ACAAGAATTA ACCGCCAGCC AGACCTGGAA AAACCTGCTC
CTTTTGAAGA ACGTTACTGT TTTAAATAAG TACCCCCTAG CCAGAGTGGA AAACGAAATT
TCAGTGCTGG CTACCCAGCG CTGCGAGGGG TTTAAGGTTG AACTCATCGA GAAGAGCCCT
CTGTGCCCGC GGTGTAATTT TCCAAAGAGC TTTAGAGGCG AAAATATCGG CCAGTGGATA
GCTAACCTGG AGGATGAGTT GGAGGACATC TTCAGAGAGT GGGAGGAAAC AATCCTCGCT
GAGGTGAAGA ACTACCAGGA TAACCTGGAC TACCTTGAGG CCGGGGAAAG GGAATTAATT
CTGCAGGTAA TGAAGGCCGG GAAACTGCCG CCGGTGGTCA CCGAGGACCT GGTGGTGGCC
TTGAACAACC TTTTCAGAGA GCTGGTTTCG GTAGAGGTCA GCCCAGCAGA GCTCCTGGAA
GCGGTGTTTG CCGGCGCCCG GGTCATGGAC TACTTCACCT TCGAGCGGAG GCTGAATGCC
TGGAAGCAGA AGCTGGTAGC CGGCCACGAC CTAGACAAGG TGCGCATCAA GCTGGCTGGC
GGGGAAGCGT GA
 
Protein sequence
MAKICELFNF ERIREVVDID AITDKKAMVE NYVISPSLEE YLVHLFQDLN SSTHKAAQII 
GGYGSGKSHL LAFIISLLTE PGLVRSVQNE QVRRAVEAMK RDFVVVHWEL QPNDVSLSEY
FYDQLEIQLK EKYGISINLP TSGVIDHKKT ILQILEKVKE GHPTRGLVVV IDEISDFLKQ
KTKEKLNQDV QFLRVLGQAA QSCDFMFIGA MQEHIFTNPK YVDEAESFGR VAERFQVITI
KREDIKRVIA RRVLKKTAEQ RLRLEELFRE YVHYFPHLQA GLDEYIDLFP LHPYVIQVFS
ELPYFEKRGV IQFTIQEVEQ ILDRDFPFLI TYDRIYDEIA SKHTVKNLET VSPVVNAIQT
LESKVDLLEP RHQDTARKII KALAVLRLYG KSVNNGATAE ELANTLLLLP GNRLMEAADE
ISLVLKNFRR VTDGQFINVT SDGYYFLDLN LTIDYDQVIR RRAENLPEDS LDEEILSILK
EQLLLEEGAA RYAFRDTCEW SGRRSFREGT FVYETGKGEV VRADGDYRIV FVSPFCAGNR
YPPAENCAVI SGSLPPEAIE QLKLAVAAHL LFRENYQRSI MEKKYASLKK KFVEMFVQAW
LETGQVNIGK AQKGIKALIV REFSNFDELF TEIKPQLFED YFNQKYPKHP KFTQRITRDN
IMGEFGAAIK ELLSKGNVQS VFSSTKSILN ALDLLDAQGN LSTANSEIAA AILETARAAG
GKVVAVEDFI NRFRQTPYGY DRWMTAFVII VLTYNGEIVL RAAGGTLISS SEVYDTFGSG
LEAFENIKYL AIESDFNPQP VINLMLALGI DPGITAKMRV TAKRNEALQA FRTRYLEIKE
QLDFVRKKLE TLALNAGNIV DVNGLRNYHQ KLADIPIDKF EQVKAPNDFK KVVYDEAAIQ
KIGEAYKILQ DLDHFYKMYS ERLEKEVEYA REVRKVVEEY PGLFLTDGLK EFLHEAFAVL
ADSSRLVNRS ELLSLLGKLQ QARQKYVTAY YRAHEKCVGE RAGWNKLQEL TASQTWKNLL
LLKNVTVLNK YPLARVENEI SVLATQRCEG FKVELIEKSP LCPRCNFPKS FRGENIGQWI
ANLEDELEDI FREWEETILA EVKNYQDNLD YLEAGERELI LQVMKAGKLP PVVTEDLVVA
LNNLFRELVS VEVSPAELLE AVFAGARVMD YFTFERRLNA WKQKLVAGHD LDKVRIKLAG
GEA