Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1915 |
Symbol | |
ID | 3830839 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1984455 |
End bp | 1986041 |
Gene Length | 1587 bp |
Protein Length | 528 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637829848 |
Product | glycoside hydrolase family protein |
Protein accession | YP_430758 |
Protein GI | 83590749 |
COG category | [S] Function unknown |
COG ID | [COG1543] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0191786 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCGTG GCTATGTCGC CCTGGTACTT CATGCCCACT TACCCTATGT CCGCGATACG GAAGACGATT TTTCCTTAGC GGAAAAATGG TACCACGAAG CCGTCACCGA AACCTATATA CCCTTAATAA ATATCTGCCA GCGCCTGAAC CGGGACCGAG TGCCTTACAG AATCACCATC TCCTTAAGCC CGCCCCTGGT GACCATGATG GCCGACCCCC TGGTTCAAGA ACACTACCGC CGCTACTTGG AACGCCTGCG GGAACTGGCT GCCCGGGAGG TCTGGCGTAC GCGGAATGAC CCCCGCTTTC ACCTGGTGGC CCGTATGTAC CAGGACCTAT TTGAAAATAC CGCCCGTACT TACCAGACTT ATGGCGGCAA TTTGATCAAC GCTTTCCGGG AGCTCCAGGA TAGCGGCAAG GTAGAGCTAA TCACCTGTGC CGCCACCCAC GGCTACCTGC CCCTCATAGG CCTGCAGCGG GAGGTGGTCC GGGCCCAGGT GGAAGTGGCG GTCAACAACC ATCGCCGCCT CTTCGGGCGG CCGCCGGCGG GTTTGTGGCT GCCGGAGTGC GCCTATAACC CGGGCGACGA CGCCATTCTG CGTGATTATG GCCTTAAATA CTTCTTTGTC GACGCCCACG GTCTCCTTTA CGCCACACCG CGGCCACGCT ACAGCATCTT CGCCCCCGTT TACACCCCTG CCGGGGTGGC GGCCTTCGGC CGTGACCTGG AGTCTTCGGA ACAGGTCTGG AGCGCCCAGG AAGGTTACCC CGGGGATTTC GACTACCGGG AATTTTACCG GGACATTGGC TACGACCTGG ATTTTGAGTA CATTAAACCC TACATCCATC CCTCCGGCCT GCGTCTGGAT ACGGGTCTTA AATATTATCG CATCACCGGC AAGTCCGGCT ATAAAGAACC CTATGTTCCG GAATGGGCCA GTTTCAAAGC CCATACCCAT GCCGGCAACT TTTTGTTCAA CCGGGAGCAG CAGATCAATT ACCTGGCCAC TTATATGGAC CGGCCGCCTT TAATCATCTG TCCCTATGAC GCCGAGCTCT TCGGTCACTG GTGGTTCGAG GGACCCCAGT GGCTGGAATC CCTTTTCCGC CAGGTGGCGG GTCTCGCCCC CCAGCCCTTT AGTTTTATCA CCCCCAGCGA GTACCTGGAG CGTTTTCCCG TCAACCAGCC GGCCACGCCC TGCATGTCCA GCTGGGGTAA CAACGGTTAT AACGAGGTCT GGCTGGAAGA TTCCAACCAT TGGATCTACC GCCACCTGCA CCATGCCGCC GCCGAAATGA TCCGCCTGGC CAACCAGCAC CCTACCGCCG GGGGCATCCT GCTGCGGGCC TTGAACCAGG CCGCCAGGGA ACTCCTGGTG GCCCAGAGCA GCGACTGGGC CTTCATCATG AAAACCGGCA CCATGGTCGA GTACGCCGTG AGCCGGACAA AAAAACACCT GCTCAATTTC TGGGAGCTCA CCCGTGGGAT TAATAAAAAC GACCTGGACC CGGCAAAGGT CCAGGCCCTG GAGGAGGCCA ATAATATCTT TCCGGATATA AACTATAGGA TTTTCGCCAG CAGGTAA
|
Protein sequence | MPRGYVALVL HAHLPYVRDT EDDFSLAEKW YHEAVTETYI PLINICQRLN RDRVPYRITI SLSPPLVTMM ADPLVQEHYR RYLERLRELA AREVWRTRND PRFHLVARMY QDLFENTART YQTYGGNLIN AFRELQDSGK VELITCAATH GYLPLIGLQR EVVRAQVEVA VNNHRRLFGR PPAGLWLPEC AYNPGDDAIL RDYGLKYFFV DAHGLLYATP RPRYSIFAPV YTPAGVAAFG RDLESSEQVW SAQEGYPGDF DYREFYRDIG YDLDFEYIKP YIHPSGLRLD TGLKYYRITG KSGYKEPYVP EWASFKAHTH AGNFLFNREQ QINYLATYMD RPPLIICPYD AELFGHWWFE GPQWLESLFR QVAGLAPQPF SFITPSEYLE RFPVNQPATP CMSSWGNNGY NEVWLEDSNH WIYRHLHHAA AEMIRLANQH PTAGGILLRA LNQAARELLV AQSSDWAFIM KTGTMVEYAV SRTKKHLLNF WELTRGINKN DLDPAKVQAL EEANNIFPDI NYRIFASR
|
| |