Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1856 |
Symbol | |
ID | 3831487 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1916998 |
End bp | 1918758 |
Gene Length | 1761 bp |
Protein Length | 586 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637829788 |
Product | Alpha amylase, catalytic region |
Protein accession | YP_430699 |
Protein GI | 83590690 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCTCCACC ATCGCCCGGG AATCCGTTTC TGCCAGCCCC TGGCACCCGA CCGGCTCCTC CTGCGCTTAA AAATTGGCCG CCGGGAACAG CAAAGCTGCC AGGTAATCTA TGAAGACCGG GGCCTTAAAA CGGCCCCCAT GCATGCCTAT GCCCGAACAC CCCGTTACCT TTATTACCAG GCTGAGATCA GCCTCTCCCG TCCCTGGCGC TGCCGTTATT TTTTCCGGCT GCGGGAAGGG GATGGGGATG ACCGCTATAT CTTTGCCGGT GGTACCGGCC AACAGGGCCG CCCCTTTACC TATCAGTGGA CCCCGGCAGA TATCTTTACC GTTCCCGACT GGATCTACGA CGCCGTTTCC TATCAAATAT TTCCCGATCG CTTTTACAAC GGCAACCCGG CCAACGACCC GCCGGGAACC AGGCCCTGGA GCGAAGCCCC TACCAGGGAA AACTTCTTCG GCGGCGACCT GGAGGGTATC CAGGCGAAGA TACCCTATCT AAAGTTTCTG GGGGTCAATG TCCTGTGGCT CAACCCCATT TTTGCCGCCT CCTCCAACCA TCGCTATAAC ACCCGCGATT ACCTGGCCGT GGACCCGGCC CTGGGAGATA CCGATACCCT GCGCCGCCTG GTGGCATCCC TCCATGGCGC CGGCATACGC ATCATCCTGG ATGGTGTCTT CAACCATACG GGGACCGATT TCTTTGCTTT TAAAGACGTG GTCGCCCGGG GAGCCGGTTC CCCGTATAAG GACTGGTACT ACTTCTATGA CTTCCCGGTA CGGAGTGAAC CCCGGGCCAA TTATGCCTGC TGGTGGGATA TCCCCAGCCT GCCCAAGCTC AACGTCAGAA ACCCCGAGGT TCGTAATTAC CTTCTTCACG TGGCGACCTA CTGGCTGTGG GATGCCGGTA CCGACGGCTG GCGCCTTGAC GTGCCCAATG AGATCGAGCC GCCCTTCTGG CGGGAGTTTT ACCAGCAGGT TAAAGGGACC AATCCGGAAG CCTACATTGT CGGCGAGATC TGGCGCGACG CCCGTTTCTG GCTGAACGGC CGTTACTTTG ACGGGGTGAT GAATTACCTC TTCCGGGACC TGGTCCTTGC ATACTTCGCC CGGAGGCTTT TTCCCATTTC CACCCTGGAT ATGCTCCTGG GCCTGGTGCG CCTGCGCTAC CCTGAGGCGG CCAATTTCGC CCTGTTAAAC CTCCTGGGCA GCCATGACAC GGCGCGAATT ATCACCGCCT TCCAGGAAGG GTTGGCCGGC GTTCCCGGAC ACTCCGGCAG CTACGCCGAG GCCGTAGCCC ACCTGCGACC GGCCCTCATC CTGCAGCTCA CCTATCCCGG TGCGCCCCTG ATCTACTACG GCGACGAGGT AGGCCTCACC GGCGGCCCGG ACCCCGACTG CCGCCGGACC ATGCCCTGGG AACCCCGGGA CTGGGACCGG GATCTCCTGA ACTTTTACCG GTGCCTGATA AGGTTGCGGC ACCAGTTGCG GCCTTTGCGA CGGGGATTTT TCCAGCCCCT TTTTACCGAC GACCAGGCCG AGGTCTACGC CTATGCCCGG CGCCTGGAGG GAGAGAAGGT AATTATAATT CTCAATGCCA GCGACCTCCC CCAGACGGTT ACCCTGGCGG CAGCCGTCGC GGGACTGGCT GAGGATAGTA CCTGGCGGGA CGGCCTGAGC AACCGCCTCC TGGAGGTGAA GGGCGGCCAG ATCAGCCTGC CCCTGGACGC CAATTCCGGG GCCGTTCTCT ACCAGGAGTA A
|
Protein sequence | MLHHRPGIRF CQPLAPDRLL LRLKIGRREQ QSCQVIYEDR GLKTAPMHAY ARTPRYLYYQ AEISLSRPWR CRYFFRLREG DGDDRYIFAG GTGQQGRPFT YQWTPADIFT VPDWIYDAVS YQIFPDRFYN GNPANDPPGT RPWSEAPTRE NFFGGDLEGI QAKIPYLKFL GVNVLWLNPI FAASSNHRYN TRDYLAVDPA LGDTDTLRRL VASLHGAGIR IILDGVFNHT GTDFFAFKDV VARGAGSPYK DWYYFYDFPV RSEPRANYAC WWDIPSLPKL NVRNPEVRNY LLHVATYWLW DAGTDGWRLD VPNEIEPPFW REFYQQVKGT NPEAYIVGEI WRDARFWLNG RYFDGVMNYL FRDLVLAYFA RRLFPISTLD MLLGLVRLRY PEAANFALLN LLGSHDTARI ITAFQEGLAG VPGHSGSYAE AVAHLRPALI LQLTYPGAPL IYYGDEVGLT GGPDPDCRRT MPWEPRDWDR DLLNFYRCLI RLRHQLRPLR RGFFQPLFTD DQAEVYAYAR RLEGEKVIII LNASDLPQTV TLAAAVAGLA EDSTWRDGLS NRLLEVKGGQ ISLPLDANSG AVLYQE
|
| |