Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0666 |
Symbol | |
ID | 3832153 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 698589 |
End bp | 699734 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 637828605 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_429535 |
Protein GI | 83589526 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.00000035657 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAAACAC CACGCTTGCA GCGAGTACTA ATATGTCGTT CAAATCCGGT GGCCCCTGAT CCGCGGGTAG AAAAACTAGC TGAAGCTTTG TGCAAGGCGA ACTATGAAGT TCTAGTGTTA GCATGGGATC GTACAGGGCA GTTGCCTCGT GAAGAAGAAC GTTCGGGATA TCGGATAGTA CGGTTTTCTC ATATCTCTCC TTATGGGTCG GGGTTGAAAA ATATTTTTAA ACAACTATCC TGGCAATTTT TCCTCTTGCG AAGGCTCATC CAGGAGGGAC AGGGCTTGGC TCTCATCCAT ATTTGTGATT TCGATACCCT TCTCCCATGC TTGCTGAGCA AGGCTCTTGG TGGCAAACGT ATCATTTACG ATATTTTTGA CCTTTACGCG GATAGTAGGC GTAACATACC TACCCTTATA CGCAAGTTGC TTCACGTGCT TGAATTTAAA GCTCTTGAAT GGGTCGATGC CGTTATTCTT GCAGATGAAA GCCGCAGGGA GCAAATCGCC GGAACTAGAC CTCGCCGGCT TATCACAATC TACAACAGTC CCCCAGATGT ATTAGATACC TTGAGGCGGA ACGGACCACC CCCCCGTCTG ACGGAGCTTT ACCTAGTTTA CGTCGGTCTC CTTCAAGTAG AACGGGGACT TTTAGAGATG ATGGAAATAC TTGGCCGTCA TCCGGAATGG CATTTGGATC TGGCCGGTCT TGGCGGGGGT GATGAAGAAC GCATTCTGGG CTTAGCTCGC TCGCTTCCCA ACGTTACCTG GCATGGACCA ATTATCTATA AGCGAGCCCT GAAGTTAAGT TATGCTGCTG ATGTCCTGAT CGCTACCTAT GACCCAACCA TACCCAACCA TCGCTACTCC AGCCCGAATA AAGTTTTTGA GGCTATGATG CTGGCTAAAC CTGTTGTAGT AGCCCGAAAC ACGGGCATAG ACCGTCTTGT TGAGAAGATT AATTGCGGCC TAGTCGTACC GTACGGGGAT GTGGCGGCCC TCGAAGCAGC ACTTATTCGC CTAGCCCGGG ACCCGGCTTT ACGGCAGCAG CTAGGGGAAA ACGGGAGGCG GGCTTATGAG GAAAAATACA GTTGGAATTT GATGCGCGAG CGTCTGCTTG CGTTATATAG GGAGTTATCC ATTTAA
|
Protein sequence | MQTPRLQRVL ICRSNPVAPD PRVEKLAEAL CKANYEVLVL AWDRTGQLPR EEERSGYRIV RFSHISPYGS GLKNIFKQLS WQFFLLRRLI QEGQGLALIH ICDFDTLLPC LLSKALGGKR IIYDIFDLYA DSRRNIPTLI RKLLHVLEFK ALEWVDAVIL ADESRREQIA GTRPRRLITI YNSPPDVLDT LRRNGPPPRL TELYLVYVGL LQVERGLLEM MEILGRHPEW HLDLAGLGGG DEERILGLAR SLPNVTWHGP IIYKRALKLS YAADVLIATY DPTIPNHRYS SPNKVFEAMM LAKPVVVARN TGIDRLVEKI NCGLVVPYGD VAALEAALIR LARDPALRQQ LGENGRRAYE EKYSWNLMRE RLLALYRELS I
|
| |