Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1543 |
Symbol | |
ID | 3831929 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1586839 |
End bp | 1588158 |
Gene Length | 1320 bp |
Protein Length | 439 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637829475 |
Product | hypothetical protein |
Protein accession | YP_430395 |
Protein GI | 83590386 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 50 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.124081 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATTG GTAACTATGA GCTCAGCCGC CGGGAAGTGA GGTTGCTGGG TATAATGGCG GCAGTTTTAC TTGCTTTTTT CTTTTATCAC GTGGTCTGGG GAAAACAGCT TCCCGCCTAC AGGGAGGTCA GGAGCCGCCT TCTGGCGGAC CAGGCACGCC TGGCTGCCGC CCAGCAGGCA GCGGCTACGG CACCATCCCT GACAAAAAAT GCCGAACAGG CCCGGGCCGC CTGGGAGGCT ACCAGACAGC GACTGGGCTT TACCCTCCAG GGTACCTCAG CCTTTCTGGA TGCTGCTCAA CCCCGTGACC CGGCCCTCCG GATCCTGGTC TTCAAGCCCC TCCCGGTTGA AAAAAGAGAT CCCTTTCAGG TATATCCCTA TGAAGTCACC GTCAGCGGCC CTTATCCGGC GTTACAGGAT TATATAAGCC AGCTAGAATC CCTGCCGGCC CTGACGGCCA TCCACAATTT GAAGATCCTT GCACGCCAGG GAAACGCTGC AACAGTCGAA GCCAGCTTTA TTATCGACCT CTACGACCTG GGCGAGACTG TACCGGTACC GGCTGCGGTG GCCCTTTTCC CCGGCGGCAG GGCCGACAGC TTTGCCCCAC CGCCAGGGGT TGCCTCGCCG GTAGGGGGAG GGACGGTAAC GGCCTCCCCC GGCCAGGGAG TCCAGGTAAA CAGCAGCCAG GGCAGCCAGC AGGCACCAAC AAAACCCGGT CCGGCAGGCA GCCCGCCACC GGCGTCTTCC CAACCTTCGG CCCGGACGTC ACCTTCCGCA GCTCCGTCCC AGCCGGCTGA GGAGATTGGC GGGACGCCGG CCTATACCCT GCCCCGCCAG CAGGGGGGAC GGCTGGTTCC GGGTCCGGCT TTTACGGACC CTCCGGCCGG CAACGGTGAT GTCTGGCTGG ACGAGCTGCG CGTGTTGCGG AACGTCGGTC CTTTCTTCGT CCTCTCCAGG CCGGCAGCCC TGGCGGGTAT GAACCTGGGC CGCAGTATAG GCGTAAATTT AAGCAAAGGT CAAACTAAAG CCGAATTAAA GGTCGATCTC CGCGGCCGGT ACACCCGTCT CCAGGGGTAT ACCGGCATCG ACGATAGCTT TGCCAACAGC AGCGGCAAAG TTAAAGTAAC CATTTTTGCC GATGGCCGCC AGATTTATCA GGGGGAAATC AAGCCGGGGG ATTACCCCCG GTACCTGGAG TTACCCCTCT TTCTAGTCCG GCAACTGACC TTCAGCCTGG AATGGCAGGC TGGTGATACT GGTAGCTACG ATCAATTACT GGCTACCCTG GCCAGCATTC ATTTTTCTCG CCAGCCCTAG
|
Protein sequence | MKIGNYELSR REVRLLGIMA AVLLAFFFYH VVWGKQLPAY REVRSRLLAD QARLAAAQQA AATAPSLTKN AEQARAAWEA TRQRLGFTLQ GTSAFLDAAQ PRDPALRILV FKPLPVEKRD PFQVYPYEVT VSGPYPALQD YISQLESLPA LTAIHNLKIL ARQGNAATVE ASFIIDLYDL GETVPVPAAV ALFPGGRADS FAPPPGVASP VGGGTVTASP GQGVQVNSSQ GSQQAPTKPG PAGSPPPASS QPSARTSPSA APSQPAEEIG GTPAYTLPRQ QGGRLVPGPA FTDPPAGNGD VWLDELRVLR NVGPFFVLSR PAALAGMNLG RSIGVNLSKG QTKAELKVDL RGRYTRLQGY TGIDDSFANS SGKVKVTIFA DGRQIYQGEI KPGDYPRYLE LPLFLVRQLT FSLEWQAGDT GSYDQLLATL ASIHFSRQP
|
| |