Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1019 |
Symbol | |
ID | 3832639 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1047621 |
End bp | 1049006 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 637828947 |
Product | hypothetical protein |
Protein accession | YP_429876 |
Protein GI | 83589867 |
COG category | [S] Function unknown |
COG ID | [COG1434] Uncharacterized conserved protein |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00000506367 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.00000000514024 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGAAAAAAA ACATATCCAT AGCAATAATT TTTGCCGCCT TAATTACCTT GATATTAGGT AGTACGTACT GGTATATCGA ATGGTTTGGG TGGAACACGG CTCCCAAGCG AGCTGATGTG ATTATAGTGT TAGGTGCTGC TGTTTGGGCG AATGGTCCAA GCCCGGCTTT AATGGAACGT ATTACACTGG CCGAAACTCT TTATCAACAG GTATATGCAG CGACGCTAAT TACGACAGGT GGCATTGGGA GATCTAATCC TACCCCGGAA GGAAGCGCCG CCCGGCAGGT TCTTATTTCC CATGGTATAC CTGCCAATGT AATTTATGAG GAAGTCACTT CGAGCAACAC CAGGGAAAAC CTGGTTGGGG CCCTCAACAT TATGCGCAAA CACGGTTGGA AAAGTGCCGT TATCGTCACC CATGATTTTC ACCTTTTACG GGCCATGACC GAAGCTCGCC GGCTAGGCAT AGAGGTTTCC GGCGCCGGTG TCCATGAAAC AGCCATGTTT AGGCCGCCGC TGGTACTACG AGAGGTAATC GCTAACCTGG TTAAAGCGAT CGGGTATAAC TTGCAAATGT ACCGCATGGA AGAAGGGGAT AATGTGCTCG CCGGCAACAA TCGGATCCTG TTGAAAGCAT TAATCGTTAT GACCTTACTA TCAATCCTCG CCTTTACAGC CTGCGCCCGA TCTTCTCAAT CGCCAAAAGA AATCGAGGAG GCAGTAGCCC TTGTAAATGG TCAACCAATA AATAAGGAAG CTCTGGAAAA AGAAATGCTT AGAATGCAAT TAATGGCTGA AATGAGGGTT CAATCAGGAA CTGTTTCTAT AGACGAGTTT CTTAAGCAAT CCGGACGGGA CTGGTCCAAG ATGTCGCCAG AAGAAAAGCG TTACTACCTG CGGGCAAAAC GTCAAAGCGA AATGACAGGG GAGAAGAATG AGGCTTTTAA CCGGCTGGTG CGGGAAGAGG TTCTGTACCA GGAAGCGGTT AAAGAGGGAT ATGAAGTTTC TATAGACGAG GCCCGGCGGC GTTACCAGGA AATAGAGACC CTTTCCCAGG AATCCCTAAA AGAGGCGCTT AAAGACGCAA AGGCCAAGGA AGAGATAGAA AGGCTGCAAG AGGTTGAAAA GAAGTTCATG GAATTGATGG GCTTTACCAG TCCGGAAGCG CTAACAGAAT ACCGGGTGCA AAGGCTCATG CGAACCATGC CCATTAGCCG TTTACGGGAA AAGTTTAAAG CGGATTGGGG CAATAAACAC CCGGAAATCC GCGGGGACGA GTTCCGGTAC ATGGTTGAAA ATCGCTGGGA AGATTATACT AACGAACTTT TGCGCCAGGC TAATATTCGT ATCAAAGACA AAGACCTCGA GGTTATTTAT GAGTAG
|
Protein sequence | MKKNISIAII FAALITLILG STYWYIEWFG WNTAPKRADV IIVLGAAVWA NGPSPALMER ITLAETLYQQ VYAATLITTG GIGRSNPTPE GSAARQVLIS HGIPANVIYE EVTSSNTREN LVGALNIMRK HGWKSAVIVT HDFHLLRAMT EARRLGIEVS GAGVHETAMF RPPLVLREVI ANLVKAIGYN LQMYRMEEGD NVLAGNNRIL LKALIVMTLL SILAFTACAR SSQSPKEIEE AVALVNGQPI NKEALEKEML RMQLMAEMRV QSGTVSIDEF LKQSGRDWSK MSPEEKRYYL RAKRQSEMTG EKNEAFNRLV REEVLYQEAV KEGYEVSIDE ARRRYQEIET LSQESLKEAL KDAKAKEEIE RLQEVEKKFM ELMGFTSPEA LTEYRVQRLM RTMPISRLRE KFKADWGNKH PEIRGDEFRY MVENRWEDYT NELLRQANIR IKDKDLEVIY E
|
| |