Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0501 |
Symbol | |
ID | 3832824 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 518159 |
End bp | 519379 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 637828435 |
Product | hypothetical protein |
Protein accession | YP_429374 |
Protein GI | 83589365 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000000000761016 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTTGCC AGTATTGTGG TAAAAAAATA GAAGCGGGGT CCCGCTTTTG TAGTGGTTGT GGACATGAAC TTAGGTCAAT TAATAATGAA GATACAGTTG TTCTGGCCCG GCCGTCCCTG GATCAAGTCA AACAGGAGGA AGGATCGGAG CATACAACGA CCCAACAGCA GGTACCGGCC AAGGGTAAAA GCATCTGGCT ATTACCTTTA GCTACCGCGG TTTTAGTAGC GGTGGTATTG GGTGGATATT ATGCCTATGA ACAGTATATT AACCGGCTGG TGGAGCAAGA TCGGGTGCAA GCAGAAAATT TGGCCCTGCA GGGAGATCTG GATAAGGCGG AAAAACTGAT TAGCAATGCT TTAAACAAGC GGCCCCGGCA CAAAACACTG CAGGCTGACC TTGCATATGT TAGAGAAGGT CAGGAGGTGC AAAGTGAGTT AAATGAAGCC TGGGAACATA CCAAGCAGCA GCAATTTAAT CAAGCCCTGT CCTTAATTGA ACAGGCGGAG AAAAAAGTGT CCGGTAAGGA AGGAGATTTT TACAAATACC TTAGTCAATT AATTAATGAC AAAAAAGCCG CGGTCAATAT TATGCAGGTT AAACAGGAGA TGAATAATAA GAATTCTATC GAGGAACTGG CTGCTTTATT GACTAAAATA TCTTCCTTTA AAGTTAAGGA GGCCCAGGAA GTAGCGAAAG TTATAAAGAC CAAAATTTGC CAATTAATTT ATACCAAAGC CAATGAATTG TTAAAGAAAA AGGATTTCGC CGGTGCACTG GCTTTAGTGC AGCAGGGACT GGGCTACGAT AGCGAGAACC AGCAGTTGTT ATCATTCCAA AAAACCATTG AACAACAGAA GGCGGCCTTT GAGCAGAATG AGCAAATGAT TCTGGAACAA GCCCAACTGG CTGCCGCCCG GGAAAACGCC ATAAATCATA CCCAGGCTGT AGAAGTGTTA AAATGCGACG GCAGTGTTAC TGCCCAAGGT GATTTCCGGG TATGGGGAAC CGTACGCAAT GTAGCCACAC GTCCCATCTA CATGGTGGAG ATTTACTATA CCGTCTATGA TGCTGCCGGT AATGCCCTGA CCACTGATAG TACCTATGTT TATCCTAATT ATTTAAATCC CCGGGATCAG GGTAGCTTTG ACAACACTTC TTACGGGTTG TGGCAGGGTA ATCGAGTAAA AATTAACAGG ATCACCTGGT ATTTAAGGTA G
|
Protein sequence | MFCQYCGKKI EAGSRFCSGC GHELRSINNE DTVVLARPSL DQVKQEEGSE HTTTQQQVPA KGKSIWLLPL ATAVLVAVVL GGYYAYEQYI NRLVEQDRVQ AENLALQGDL DKAEKLISNA LNKRPRHKTL QADLAYVREG QEVQSELNEA WEHTKQQQFN QALSLIEQAE KKVSGKEGDF YKYLSQLIND KKAAVNIMQV KQEMNNKNSI EELAALLTKI SSFKVKEAQE VAKVIKTKIC QLIYTKANEL LKKKDFAGAL ALVQQGLGYD SENQQLLSFQ KTIEQQKAAF EQNEQMILEQ AQLAAARENA INHTQAVEVL KCDGSVTAQG DFRVWGTVRN VATRPIYMVE IYYTVYDAAG NALTTDSTYV YPNYLNPRDQ GSFDNTSYGL WQGNRVKINR ITWYLR
|
| |