Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2233 |
Symbol | |
ID | 3831279 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2329417 |
End bp | 2330346 |
Gene Length | 930 bp |
Protein Length | 309 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 637830153 |
Product | hypothetical protein |
Protein accession | YP_431063 |
Protein GI | 83591054 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000000000107793 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00000113425 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGTGGTG GCGGTTGGTA TTATTGGTCA AAGCATACTG ATATCGAAAA GGTCCTACGT GAAACAGAGT CTAATGAGAA GAGAATCGAG TACGAGGGAG AAGTCAATAG GTATCTACAA TCGTTACTAA CTAACTACAA TAATCGAAAG ACAGATGAAA TTCGGCAACA CCTGGAAACG ATTAGAGGGG CCTTGGATAA AAACATTGAA GGTGCCATAG AGCTTGTTTT CGGTGGTTCT TTAAGCAAGC ATACCTATGT AAACGGCCTT AGCGACATTG ATATGCTTGT TAGAATAAAT GATACTTCGT TAGCTAATGC TTCACCTGCT GACATCAAGG CCTACTTTGC TGAGCGACTT CTAAAGCGGC TTCCCAATAC CGAAGTAGAG GTAGGAAAGC TAGCCGTAAC TGTAAGGTTT TCTAGCACAG GCCATGAAAT TCAACTTCTA CCAGCCTTGC AGACCAAGAC CGGGGTAAGA ATCGCGGATC CCAATGGTGA TGGATGGAGC AAAGTTATTA GACCTATAAA ATTCGCCGAA AAGCTCACTA ACGTCAACCA GTCATGCAAC CATAAGGTTA TTCCAGTCAT TAAGCTTTTC AAAGGGCTTA ACGAGTCACT GCCAGAGAAT GTACGGTTAT CAGGTTATCA TATCGAATCA CTAGCTATTA AAGCCTTTGA AGGCTATACT GGTCGAAAAA CATATAAGGA TATGCTTCAG CATTTTTGCC GACAAGCTAT AAAACTGGTA CTATCTCCAA TCGTTGATTC TACGGGTCAA TCTGTTCATG TAGATGAGTA CTTAGGCGCA TCAAACAGCT TAAGGCGTAG GCAGTGTAGC GCAGCTTTAG AGAGACTTTC AAATCGGCTC AGCCGGGCCG ATGCACACAT GAACATCGAG CGCTGGAAGG AAATGTTCGA AAATGAATAA
|
Protein sequence | MSGGGWYYWS KHTDIEKVLR ETESNEKRIE YEGEVNRYLQ SLLTNYNNRK TDEIRQHLET IRGALDKNIE GAIELVFGGS LSKHTYVNGL SDIDMLVRIN DTSLANASPA DIKAYFAERL LKRLPNTEVE VGKLAVTVRF SSTGHEIQLL PALQTKTGVR IADPNGDGWS KVIRPIKFAE KLTNVNQSCN HKVIPVIKLF KGLNESLPEN VRLSGYHIES LAIKAFEGYT GRKTYKDMLQ HFCRQAIKLV LSPIVDSTGQ SVHVDEYLGA SNSLRRRQCS AALERLSNRL SRADAHMNIE RWKEMFENE
|
| |