Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1854 |
Symbol | |
ID | 3831485 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1914369 |
End bp | 1916117 |
Gene Length | 1749 bp |
Protein Length | 582 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637829786 |
Product | glycoside hydrolase family protein |
Protein accession | YP_430697 |
Protein GI | 83590688 |
COG category | [S] Function unknown |
COG ID | [COG1543] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTACCC GGTCGGGTGA TAACCTAGGT TATCTGTCGC TGGTGCTCCA TGCTCATCTG CCCTTTGTCC ATAACCGGGA GCCCTATGTC AGCCTGGAAG AGAAATGGCT TTTCGAAGCC CTGACGGAGT CCTACCTGCC CTTGATCCTG AGCTGGGAGG AGCTGGCCGG CGAGGGGTTA GACTTTCACC TCACCCTCTC CCTGTCGCCG CCCCTCATCA GCATGTTGAT GGAACCTGCC CTGGGGGAGC GCTACGGTCG TTACCTGGAT AACCTCAGGG AACTGGCCGG CCGGGAAATT GAGCGGACCA GGGGTGACCC GACTTTCGCC CCCCTGGCGG AGTTTTATCA CCGGCGCCTG ACTCTCGTCG ACCGGGCCTT CAAAGAAACC TACCGGGGGA ATTTACTGGC GCCTATAAAA AGGCTGCGGG AACAGGGTCG CCTGGAACTG ATAACCACCG CGGCTACCCA TGGTTACCTT CCCCTGATGC TTACCGATGA GGCCAGGCGA GCCCAGGTGC GGGCAGCCCT GGATCTTTTC GGCATGACCA TGGGCTTTGT CCCGGACGGG CTCTGGCTGC CGGAGTGCGG CTATACCCCA GGGATTGAAA AAATCCTCCG GTCCGAGGGG ATTAAGTATT TCATCGTCGC CAGCCACGGC ATGTTAAATG CCACCCCCGT GGTAAAATCG GCTGTTTATG CTCCGGTCAG GGTGGGCGGC GTGGCTGTCT TCGGGCGGGA CTGGGAGACT TCCCACCAGG TTTGGAGCCG GACCGAGGGT TATCCCGGAG ATCCGGTGTA CCGCGAGTTC TACCGGGATA TTGGTTACGA CCTGGATTTC AATTACCTGG CACCCTACCT GGTGGGGGGG ATCCGGGGAG ATACCGGCTT TAAATACTAC CGGATTACCG GTAAAACCGG GGTCAAGGAG CCCTATGACT ACCGGGCCGC CCGGGAGCGG GCCCGGGAGC ATGCCCGGGA CTTTATCGCC AACCGGGAGA AACAGCTGGC TTACTGGGCC GGCAGGACCC AGGATAAGCC TGTCGTCGTC GCTCCCTATG ACGCCGAGCT CTTTGGCCAC TGGTGGTTTG AGGGGCCGGA CTGGCTGGCC GATGTCCTGC GCCTGGCAGG CGAAAGCCGG GTCTCCCTGA CCTCTCTTTC GGCTTACCTG GAGCAGTATC CGCCCCGCCA GGAAGTGACC ATGGGTCCTT CCAGCTGGGG AGAAGGGGGT TATAACCACG TCTGGTTGAA CCAGGCCAAC GACTGGCTTT ATCTCCACCT CCACCGGGCG GAGAGGGCCA TGATCAAGCT GGCTGCCGCC AATCCCCGCC CCGGCTCCCT GCAGGAGCGG GCACTGAACC AGGCTGCCAG GGAATTGCTC CTGGCCCAGA GTAGCGACTG GTCCTTTATC CTCACCACCG GGACGACGGT GGACTATGCC CGGCGTCGCC TCCGGGAACA CCTGGGGGCT TTTTTTAAAC TCTGCCAGGA TTATGAGCGA GATCGCCTGG ATGAAGATTT TCTGGCCCGC CTGGAGGCGG CGGACAATAT CTTCCCCGGC CTCGATTTTC GCCTCTACCG GCCAGCCGGA AGGGGAGTAG CCTGCCGGCC CGAGGTTCAT AATAAAACCA GGCCGGGAAT TCTTATGCTT AGCTGGGAAT TCCCACCCCG CCATGTTGGC GGCCTGGGTA TCCACGTCCG GGACTGGGCC AGGCCCTGGC CCGCCAGGGG GTGGATGTCC ACGTCCTGA
|
Protein sequence | MVTRSGDNLG YLSLVLHAHL PFVHNREPYV SLEEKWLFEA LTESYLPLIL SWEELAGEGL DFHLTLSLSP PLISMLMEPA LGERYGRYLD NLRELAGREI ERTRGDPTFA PLAEFYHRRL TLVDRAFKET YRGNLLAPIK RLREQGRLEL ITTAATHGYL PLMLTDEARR AQVRAALDLF GMTMGFVPDG LWLPECGYTP GIEKILRSEG IKYFIVASHG MLNATPVVKS AVYAPVRVGG VAVFGRDWET SHQVWSRTEG YPGDPVYREF YRDIGYDLDF NYLAPYLVGG IRGDTGFKYY RITGKTGVKE PYDYRAARER AREHARDFIA NREKQLAYWA GRTQDKPVVV APYDAELFGH WWFEGPDWLA DVLRLAGESR VSLTSLSAYL EQYPPRQEVT MGPSSWGEGG YNHVWLNQAN DWLYLHLHRA ERAMIKLAAA NPRPGSLQER ALNQAARELL LAQSSDWSFI LTTGTTVDYA RRRLREHLGA FFKLCQDYER DRLDEDFLAR LEAADNIFPG LDFRLYRPAG RGVACRPEVH NKTRPGILML SWEFPPRHVG GLGIHVRDWA RPWPARGWMS TS
|
| |