Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0656 |
Symbol | |
ID | 3832143 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 686504 |
End bp | 688486 |
Gene Length | 1983 bp |
Protein Length | 660 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637828597 |
Product | hypothetical protein |
Protein accession | YP_429527 |
Protein GI | 83589518 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000000140636 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.000000360107 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAAAAA GAATAGTTTC CCTTTTGATG GCAGCAATTC TACTTTTGTT GTTGCCAGTC ACCCCTTCGG CCGCAACCCC CCAATTCACC GACATCCAGA ACCACTGGGC GAAAGACTAC ATCCTTAGTT TTGCCAATAA AGGTTTTGTC AAGGGCTACC CGGACCAGAC CTTCAAACCC GACAGGCCAA TCAGCCGGGC AGAGTTCACC TGCATACTGC TCAACTGCCT GGGTATCACC CCGGCGTCAG ACGTTAACAC ACCGACCTTT AGCGATACCA CCAACCACTG GACCAGGGCC CAGATCGCCG AAGCGGTGCG GAGGGGGATC CTGGTAGTGA GCGAATACCC CGGTGGGCTC AAACCTGATG ATCCCATATA TCGCAGTGAA GCTGCCGCTA TGATGATTAG AGCCCTGGGG AAGAGTCCCG ACATGACCCC AACCTCTTTT AAAGACAGCA ACCAGATTGC AAAGAGCATG TACCGGGGCT ACATCAAGGC AGCCTCCAGC GAAGGGCTGA TGCACGGCTA CCCGGATGGA ACGTTCCGTC CCTTCCAGGG CGTGAAAAGG GGTGAGGCCT GTGCCATGCT GGTCAATCTG CTAGGCAAGA TCGGTACCGC CTCTCCTCCT GCCGTCCAGG TCAACCCATC AAGCAACAGT GCCCTGTCCG CCCTGGTAAT CCAGGGCAAC CGCTACAAAC TGGGGGACAC TGTCGTTTAC CTCAAGCGGG ACTCGACTAA CATACCCATA TACTCCCTCA GCGTGGCGGG GGGCTTAGTC TTCATCAACA ACACCTTCAC TTACCCTCTA AACAGCACCG ACAATAACCC GGATCTGGTC GTCAACAATA CCCGCTATGT CCAATGCCGG CTCAGCGTAA GCGGCAGCGA TTTACAGGTC ACCCCGGGCG CCGTGAAGCT GGATTCCATC TCATATAACG GCTACAAGTA TAACGCCGAT TACGTGAAGC TCTATATCGG CAACAAAAAC GGCAGCTACT ACCTGTCCGA CGCCGAACTG GTAGATCGGC AAACCATCAG GGTCGGCGGT AACAGCTATG ACATCGGCAG CACCCCGGTA TCCATCGCCC TGGGGGATAA TTTCTATGCC ATCAACGGGA TCAATTACGA CAGCAGCAGT ATCAGCCTGG ACCTGGCAGC CACCACCCCG GTGGTGACGA ACGGCCTTGA TATCTCGGAT ATCTCTGCCA TCTTCGTGGA TACCAGGTCC CTGGACCTCA ATACCATCAG CAGCCTCTTT TTCATTATTG ACGGCAGCAG GTATGACCGC TCGGAAGTGG TCATCGACGC CTCCGGCAAT TTTACGGCTA ATAACAAGTA TTATACCCCC GATCAGGTTA CCATGGTTAT AAACAACAGC TTCTATAAGC TAACCGACGT CAAGTCCTTC GGCGGCAAGT TTATATTCTA CTGCACCGCC AGCAATGTGA CCACCTGGGC AATAGTTAAT GATAAATACC AGGATGCCAG CACGATCCAG ATCCTGATGG GGAACAACAT TTACACCCTG GACAAGATCC TGGTGGTGCA GCATAACGTG ATCCGCATCG GGGGGCGCCA ATATAAACTC GGCGACATAT TCGGCTGCCG GATTAACGGA ACATTGTACG ACATCGAAGA TATCGATTAC GACAACAGCC TGGATCTGGT AACGATGGAC GTTACCGAAT CCACCGGCAG TTGGACCGGC TACCTCCCCG GCCAGCCCCA AAAGTACTTA TTCTACGTCG ATAACTCAAT CTATCAAGAC GGCGCCACCG GCAATGTCAC CATTTACGCC GGCGGAGGCT GGAGGACATT TGACAGCATC ACCTTCTCCG ATCAGTCCCA TTTCGTATAC GACAACACGA CTTATAACCT GTTGGGAGCA GAGATAAAGA TTGGAGACAC CGTTTTTACG GTCGTCGATT CCGCCTGGCG GGTCAGCTCT CAGGTTATGG AGGTGTACCT GCAGAAGGCC TGA
|
Protein sequence | MKKRIVSLLM AAILLLLLPV TPSAATPQFT DIQNHWAKDY ILSFANKGFV KGYPDQTFKP DRPISRAEFT CILLNCLGIT PASDVNTPTF SDTTNHWTRA QIAEAVRRGI LVVSEYPGGL KPDDPIYRSE AAAMMIRALG KSPDMTPTSF KDSNQIAKSM YRGYIKAASS EGLMHGYPDG TFRPFQGVKR GEACAMLVNL LGKIGTASPP AVQVNPSSNS ALSALVIQGN RYKLGDTVVY LKRDSTNIPI YSLSVAGGLV FINNTFTYPL NSTDNNPDLV VNNTRYVQCR LSVSGSDLQV TPGAVKLDSI SYNGYKYNAD YVKLYIGNKN GSYYLSDAEL VDRQTIRVGG NSYDIGSTPV SIALGDNFYA INGINYDSSS ISLDLAATTP VVTNGLDISD ISAIFVDTRS LDLNTISSLF FIIDGSRYDR SEVVIDASGN FTANNKYYTP DQVTMVINNS FYKLTDVKSF GGKFIFYCTA SNVTTWAIVN DKYQDASTIQ ILMGNNIYTL DKILVVQHNV IRIGGRQYKL GDIFGCRING TLYDIEDIDY DNSLDLVTMD VTESTGSWTG YLPGQPQKYL FYVDNSIYQD GATGNVTIYA GGGWRTFDSI TFSDQSHFVY DNTTYNLLGA EIKIGDTVFT VVDSAWRVSS QVMEVYLQKA
|
| |