Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1931 |
Symbol | |
ID | 3832423 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2005722 |
End bp | 2006945 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637829863 |
Product | hypothetical protein |
Protein accession | YP_430773 |
Protein GI | 83590764 |
COG category | [S] Function unknown |
COG ID | [COG2461] Uncharacterized conserved protein |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00000000706509 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00330228 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCGAAC TGCTCAACAA CCAGGATTAC CGTAAAGAAG CCTTAAAAGA AATTATCCGG GAATTGCACC GGGGCAAGAG CGTGGAGGAA GTGAAGGCCA GGTTTAATGA ACTGATCAAG GATGTGGCCC CGGCGGAGAT CTCCCTCATG GAGCAGGCCC TGATCAACGA AGGCCTGCCG GTGGAGGAGG TCCAGCGCCT CTGCGACGTC CACGCGGCGG TCTTTAAAGA GTCATTAGAA AGGGCGCCGC AACCGGAAAC CATCCCCGGT CACCCGGTGC ATACTTTTAA AGAAGAAAAC CGGGCCCTGG AAGATTTAAT GATCAGAGAG ATTCAACCGC TCCTGGCTGA ATTGCGCCGG GCGAACCCGG ACGTTGAAAA AGACCTGGCC ATAAAGCTGG CGGAAAAACT GAATCTTCTC CAGGATGTCA ACAAGCATTA TAGCCGCAAG GAGAATCTCC TCTTCCCTTA CCTGGAGAAG TACCAGATAG TAGGGCCGCC CAAGGTCATG TGGGGCGTCG ACGACGAGAT CAGGGACCTC TTAAAGGAAG CCCGGGACCT GGCCGTCAAT TACGTACCGG ATAAAAAAGA AGAACTCATT ACCAGAACAG AAGCCGCCCT GGCAAAAATC AAAGAAATGA TCTTTAAAGA AGAAAGGATT CTCTTCCCCA TGGCCCTGGA GACCCTGACC GAGGACGAGT GGTACCGGAT CATGCTTGAC AGCGCCAGTA TCGGTTATTG CCTCATCGAG CCCCGGGAAG ACTGGCGGCC GGCGCAGGTC AAACTTGACC AGAAAGAAAC TGTCGCCAGC GAGGAAACCA GGGGATACAT TAAGTTTGCT ACCGGTATCC TGACGCCCCG GGAGATCAGC TTGATCTTCG ATCACCTGCC AGTAGACATA ACCTTTGTCG ACAAGGATAA TGTGGTCAAG TATTTTTCCA ATACCAGGGA GCGCATCTTT ACCCGCAGCC GGGCGGTTAT CGGCCGGCGG GTCGAAAACT GTCATCCCCC GGCCAGCGTC CAGGTGGTGG AGAAACTCAT TGCCGACTTC AAAAGCGGGC GTAAAGACCG GGAAGCCTTC TGGCTGCACC TGGGTGATAA GTATGTGTTT ATCCAGTATT TTGCCGTCCG GGACGAAAAA GGCGACTTTG CCGGCACCCT GGAGGTGACC ATGGACCTCA AGCCTCTCCA GGCCATTAGC GGTGAGAAGA GGATTATGGA TTAG
|
Protein sequence | MTELLNNQDY RKEALKEIIR ELHRGKSVEE VKARFNELIK DVAPAEISLM EQALINEGLP VEEVQRLCDV HAAVFKESLE RAPQPETIPG HPVHTFKEEN RALEDLMIRE IQPLLAELRR ANPDVEKDLA IKLAEKLNLL QDVNKHYSRK ENLLFPYLEK YQIVGPPKVM WGVDDEIRDL LKEARDLAVN YVPDKKEELI TRTEAALAKI KEMIFKEERI LFPMALETLT EDEWYRIMLD SASIGYCLIE PREDWRPAQV KLDQKETVAS EETRGYIKFA TGILTPREIS LIFDHLPVDI TFVDKDNVVK YFSNTRERIF TRSRAVIGRR VENCHPPASV QVVEKLIADF KSGRKDREAF WLHLGDKYVF IQYFAVRDEK GDFAGTLEVT MDLKPLQAIS GEKRIMD
|
| |