Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0592 |
Symbol | |
ID | 3830977 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 616150 |
End bp | 617439 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637828533 |
Product | putative stage IV sporulation YqfD |
Protein accession | YP_429465 |
Protein GI | 83589456 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02876] sporulation protein YqfD |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.117032 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCGTAC ATAACTGGCA GGCCTACCTG ACCGGCTACC TGGTCCTGAC CATCAGCGGC GACGACCCCG AGGCTTTTCT CAATCTGGCC CTGGCCCGGG GCCTCACCCT CCTGGACGTT ACCCGCACCG GCAAGGGCCA GATTCTGGTC AAAATGCCGG TTAGCGAGTA CCGGTTATTA CGCCCCCTGG CCCGGGAGAG CCACTGCCGT GTCCGTATTG TGGACAAGCG GGGCCTGCCT TTTTTCAGCC GCCGCCTGCG CGGCCGGCAG CTATTACTTG CCGGCGTCCT TTTCTTTTTG GTTGCCATCT ACATCCTCTC GGCCTGCGTC TGGGTGGTAG ATGTCCGGCC CGAAAAGGGA ACCCTGCGCC AGGTTACACC GGACCAGGTT ATAGCCGCCG CCAGGGCTGA AGGCTTGAAG CCCGGTGCCT GGAAGGGCAG GATAGACATC CGCGCTCTGG AGTACGGCCT GGAACAGCGC CTGCCCCAGC TGGCCTGGGT GGGGGTCAGC TTCCACGGCA CCAGGGCAGA AATTAAGGTC GTCGAAAAGA TACCCCCCCC AGCAGGCGAT GACTATGAAT TACCGGCCAG CATAATTGCC GCCAAAGACG GGGTGATTAA ACAGATCCTG GTGATGAATG GTGAAGGGCG GGTGGCTGTC GGCGATACGG TTCGCCGGGG AGAGGTCTTG ATCTCCGGCC TGATCCTGCC TCCGGAGCCG GAGAAAAAAC CGGGTGAAAA CCAGCCGCAG CAGCAACCAC GGCCCGAACC CCGCCTGGTC CGGGCCCGGG GTATTGTCCG GGCCCGGGTC TGGTATGAAG AAGAAAAGGA GATAGAGCGC CTGCAGGTCC GGGAACAGGC CACCGGACGT CAGCAGAGAG CGATAATAAT TAAGACTCCC GGACGCCAGG TCGTTCTCAG GGGTCCGGCC CGGTCCCCCT ATGCCCATTA CCTGCAGGAA AATAAAGTAA TCACTTTACC GTCCTGGAGG AATTTCCCCC CTCCCGTCGA ACTTATCATC AGTACCTACC GCGAGATCCG GATCCAGAAG CAACAACTGG GCTACGAAGA GGCTGTCAGG GTCGCCGGTC AACAGGCCCT GGCCAGCCTG AAAGCCCGGT TGCCGGCAGG GGTAACCATT ACCGGTGAGA AAATAATCCC CCTCACTGGA ACGGGGGAAC AGAGAGTCCG GGTGCGGGCC TGGGTCGAAA CCGAAGAAGA CATCGGCCAG GTGGTCCCTC TGCATGGCCG GACGCCCCCC GGCGGTAACC CTCCGGCGAC CCCCGGGTAA
|
Protein sequence | MTVHNWQAYL TGYLVLTISG DDPEAFLNLA LARGLTLLDV TRTGKGQILV KMPVSEYRLL RPLARESHCR VRIVDKRGLP FFSRRLRGRQ LLLAGVLFFL VAIYILSACV WVVDVRPEKG TLRQVTPDQV IAAARAEGLK PGAWKGRIDI RALEYGLEQR LPQLAWVGVS FHGTRAEIKV VEKIPPPAGD DYELPASIIA AKDGVIKQIL VMNGEGRVAV GDTVRRGEVL ISGLILPPEP EKKPGENQPQ QQPRPEPRLV RARGIVRARV WYEEEKEIER LQVREQATGR QQRAIIIKTP GRQVVLRGPA RSPYAHYLQE NKVITLPSWR NFPPPVELII STYREIRIQK QQLGYEEAVR VAGQQALASL KARLPAGVTI TGEKIIPLTG TGEQRVRVRA WVETEEDIGQ VVPLHGRTPP GGNPPATPG
|
| |