Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0638 |
Symbol | |
ID | 3832034 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 661653 |
End bp | 665264 |
Gene Length | 3612 bp |
Protein Length | 1203 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 637828579 |
Product | hypothetical protein |
Protein accession | YP_429509 |
Protein GI | 83589500 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000981876 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.418476 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTAAGA TATGTGAGCT CTTTAACTTT GAAAGGATCC GGGAAGTAGT TGACATAGAC GCCATAACCG ATAAGAAGGC AATGGTGGAG AATTATGTTA TTTCCCCATC CCTGGAAGAA TATCTGGTTC ATCTGTTCCA GGACCTTAAT TCCAGCACAC ATAAAGCGGC TCAGATCATT GGCGGTTACG GTTCCGGCAA GTCGCACCTT TTGGCCTTCA TTATCTCCCT GCTGACCGAA CCCGGGCTGG TCAGGTCTGT GCAAAACGAA CAGGTGAGGC GGGCCGTGGA GGCCATGAAG CGGGATTTTG TGGTAGTTCA CTGGGAGCTG CAGCCTAATG ATGTAAGTTT GAGCGAGTAT TTCTACGATC AACTGGAAAT CCAATTGAAA GAGAAATACG GCATCTCGAT AAACCTGCCC ACTTCCGGCG TAATCGACCA CAAAAAGACT ATCCTGCAGA TACTGGAAAA AGTAAAAGAA GGCCATCCCA CCCGCGGCCT GGTGGTAGTA ATAGACGAGA TTTCCGACTT TCTGAAGCAA AAGACCAAAG AAAAACTCAA TCAGGATGTC CAGTTCCTGC GCGTGCTTGG CCAGGCGGCG CAGAGCTGCG ACTTCATGTT TATCGGCGCC ATGCAGGAGC ACATTTTTAC TAATCCCAAG TACGTAGATG AAGCAGAGAG CTTTGGCCGG GTGGCCGAGC GGTTCCAGGT CATCACCATC AAACGCGAAG ACATCAAGCG GGTGATTGCC AGACGGGTCC TCAAGAAGAC CGCCGAACAG AGGCTGCGGC TGGAGGAGCT ATTCAGGGAG TATGTGCACT ACTTCCCGCA CCTGCAGGCC GGCCTTGATG AGTATATTGA CCTTTTCCCC CTTCACCCCT ATGTAATCCA GGTTTTCAGC GAGCTACCCT ATTTTGAAAA ACGGGGCGTG ATCCAGTTTA CCATCCAGGA AGTAGAACAA ATACTGGACC GCGACTTTCC TTTTTTGATC ACCTATGACC GTATATATGA TGAAATTGCC TCCAAGCATA CCGTAAAGAA TCTGGAGACC GTGTCCCCGG TGGTCAACGC CATCCAGACC CTGGAAAGTA AAGTAGACCT TTTAGAACCA CGCCATCAGG ATACGGCCCG CAAGATCATT AAGGCCCTAG CTGTGCTACG GCTATACGGA AAAAGTGTCA ACAACGGGGC CACTGCCGAG GAACTGGCAA ATACTCTTCT ACTTCTACCC GGCAATAGGC TGATGGAAGC CGCCGACGAA ATTTCCTTGG TGTTGAAGAA CTTCCGTAGA GTTACCGATG GCCAATTTAT TAACGTTACC TCCGACGGTT ATTATTTTCT GGACCTTAAT CTGACCATTG ATTACGACCA GGTGATCCGG CGGCGAGCCG AAAACCTGCC GGAAGACAGC CTGGACGAGG AGATCCTGTC CATCCTCAAA GAGCAGCTAC TCCTGGAGGA GGGTGCGGCA CGTTATGCTT TCCGGGACAC TTGCGAATGG TCTGGCCGCC GTTCTTTCCG CGAAGGCACT TTTGTCTATG AAACCGGCAA AGGGGAAGTG GTCCGGGCCG ACGGCGACTA CCGGATTGTT TTCGTCTCTC CTTTTTGCGC CGGGAACCGC TACCCGCCAG CCGAAAACTG CGCTGTGATC AGCGGCTCTC TGCCACCTGA AGCCATTGAG CAGTTAAAGC TGGCCGTTGC AGCTCACCTT TTGTTCAGGG AGAACTACCA GCGTAGCATC ATGGAGAAAA AATATGCTTC GTTGAAGAAA AAGTTTGTGG AAATGTTCGT GCAGGCATGG CTGGAAACCG GCCAGGTAAA TATTGGGAAA GCGCAAAAGG GCATCAAGGC TTTGATAGTC AGGGAGTTTA GCAATTTTGA CGAGCTCTTT ACCGAGATCA AACCCCAGCT ATTTGAGGAT TATTTTAATC AGAAGTATCC TAAACACCCC AAATTTACTC AGCGAATCAC CCGCGACAAT ATCATGGGTG AGTTTGGAGC GGCTATTAAG GAGCTACTCA GCAAAGGCAA CGTCCAGTCC GTTTTTTCCA GTACCAAATC CATTCTCAAT GCCCTGGACC TGCTCGATGC TCAGGGTAAC CTTTCGACTG CCAATTCAGA AATAGCAGCG GCTATTCTGG AAACCGCTCG AGCCGCTGGT GGGAAAGTTG TAGCGGTGGA GGATTTTATT AACCGCTTCC GGCAGACCCC CTACGGTTAC GATCGGTGGA TGACCGCTTT TGTCATCATC GTATTAACCT ACAACGGCGA GATCGTTCTC AGGGCTGCTG GCGGCACGCT TATTAGCTCT TCTGAAGTGT ACGATACATT CGGATCAGGC CTGGAGGCTT TCGAGAATAT AAAGTATCTT GCTATAGAAA GCGACTTCAA TCCCCAGCCG GTTATCAACC TTATGCTGGC CCTGGGTATA GACCCAGGTA TTACTGCTAA AATGCGTGTA ACCGCAAAAC GTAACGAGGC CTTACAAGCT TTTCGCACCC GCTATCTAGA GATTAAGGAA CAGTTGGACT TTGTTCGAAA AAAACTGGAG ACCCTGGCTT TGAATGCCGG TAACATAGTT GATGTTAATG GTCTTAGAAA TTATCACCAA AAATTAGCAG ATATACCCAT TGACAAATTT GAACAGGTGA AAGCGCCCAA CGATTTCAAG AAAGTGGTGT ACGACGAGGC TGCCATCCAG AAAATTGGAG AGGCTTATAA GATACTTCAG GATCTGGACC ATTTCTACAA AATGTATTCG GAACGGCTGG AAAAGGAGGT GGAGTATGCC CGGGAAGTTC GGAAAGTCGT TGAGGAATAT CCCGGTTTAT TTCTGACCGA TGGACTTAAG GAATTCCTCC ATGAAGCCTT TGCCGTCTTG GCCGATTCCA GCCGTCTGGT GAACCGGAGC GAGCTGCTGT CGCTTTTGGG CAAGTTGCAG CAGGCCCGCC AGAAATATGT GACTGCTTAT TATAGGGCGC ACGAAAAGTG TGTAGGCGAA AGGGCTGGCT GGAACAAACT ACAAGAATTA ACCGCCAGCC AGACCTGGAA AAACCTGCTC CTTTTGAAGA ACGTTACTGT TTTAAATAAG TACCCCCTAG CCAGAGTGGA AAACGAAATT TCAGTGCTGG CTACCCAGCG CTGCGAGGGG TTTAAGGTTG AACTCATCGA GAAGAGCCCT CTGTGCCCGC GGTGTAATTT TCCAAAGAGC TTTAGAGGCG AAAATATCGG CCAGTGGATA GCTAACCTGG AGGATGAGTT GGAGGACATC TTCAGAGAGT GGGAGGAAAC AATCCTCGCT GAGGTGAAGA ACTACCAGGA TAACCTGGAC TACCTTGAGG CCGGGGAAAG GGAATTAATT CTGCAGGTAA TGAAGGCCGG GAAACTGCCG CCGGTGGTCA CCGAGGACCT GGTGGTGGCC TTGAACAACC TTTTCAGAGA GCTGGTTTCG GTAGAGGTCA GCCCAGCAGA GCTCCTGGAA GCGGTGTTTG CCGGCGCCCG GGTCATGGAC TACTTCACCT TCGAGCGGAG GCTGAATGCC TGGAAGCAGA AGCTGGTAGC CGGCCACGAC CTAGACAAGG TGCGCATCAA GCTGGCTGGC GGGGAAGCGT GA
|
Protein sequence | MAKICELFNF ERIREVVDID AITDKKAMVE NYVISPSLEE YLVHLFQDLN SSTHKAAQII GGYGSGKSHL LAFIISLLTE PGLVRSVQNE QVRRAVEAMK RDFVVVHWEL QPNDVSLSEY FYDQLEIQLK EKYGISINLP TSGVIDHKKT ILQILEKVKE GHPTRGLVVV IDEISDFLKQ KTKEKLNQDV QFLRVLGQAA QSCDFMFIGA MQEHIFTNPK YVDEAESFGR VAERFQVITI KREDIKRVIA RRVLKKTAEQ RLRLEELFRE YVHYFPHLQA GLDEYIDLFP LHPYVIQVFS ELPYFEKRGV IQFTIQEVEQ ILDRDFPFLI TYDRIYDEIA SKHTVKNLET VSPVVNAIQT LESKVDLLEP RHQDTARKII KALAVLRLYG KSVNNGATAE ELANTLLLLP GNRLMEAADE ISLVLKNFRR VTDGQFINVT SDGYYFLDLN LTIDYDQVIR RRAENLPEDS LDEEILSILK EQLLLEEGAA RYAFRDTCEW SGRRSFREGT FVYETGKGEV VRADGDYRIV FVSPFCAGNR YPPAENCAVI SGSLPPEAIE QLKLAVAAHL LFRENYQRSI MEKKYASLKK KFVEMFVQAW LETGQVNIGK AQKGIKALIV REFSNFDELF TEIKPQLFED YFNQKYPKHP KFTQRITRDN IMGEFGAAIK ELLSKGNVQS VFSSTKSILN ALDLLDAQGN LSTANSEIAA AILETARAAG GKVVAVEDFI NRFRQTPYGY DRWMTAFVII VLTYNGEIVL RAAGGTLISS SEVYDTFGSG LEAFENIKYL AIESDFNPQP VINLMLALGI DPGITAKMRV TAKRNEALQA FRTRYLEIKE QLDFVRKKLE TLALNAGNIV DVNGLRNYHQ KLADIPIDKF EQVKAPNDFK KVVYDEAAIQ KIGEAYKILQ DLDHFYKMYS ERLEKEVEYA REVRKVVEEY PGLFLTDGLK EFLHEAFAVL ADSSRLVNRS ELLSLLGKLQ QARQKYVTAY YRAHEKCVGE RAGWNKLQEL TASQTWKNLL LLKNVTVLNK YPLARVENEI SVLATQRCEG FKVELIEKSP LCPRCNFPKS FRGENIGQWI ANLEDELEDI FREWEETILA EVKNYQDNLD YLEAGERELI LQVMKAGKLP PVVTEDLVVA LNNLFRELVS VEVSPAELLE AVFAGARVMD YFTFERRLNA WKQKLVAGHD LDKVRIKLAG GEA
|
| |