Gene Moth_1856 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1856 
Symbol 
ID3831487 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1916998 
End bp1918758 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content61% 
IMG OID637829788 
ProductAlpha amylase, catalytic region 
Protein accessionYP_430699 
Protein GI83590690 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTCCACC ATCGCCCGGG AATCCGTTTC TGCCAGCCCC TGGCACCCGA CCGGCTCCTC 
CTGCGCTTAA AAATTGGCCG CCGGGAACAG CAAAGCTGCC AGGTAATCTA TGAAGACCGG
GGCCTTAAAA CGGCCCCCAT GCATGCCTAT GCCCGAACAC CCCGTTACCT TTATTACCAG
GCTGAGATCA GCCTCTCCCG TCCCTGGCGC TGCCGTTATT TTTTCCGGCT GCGGGAAGGG
GATGGGGATG ACCGCTATAT CTTTGCCGGT GGTACCGGCC AACAGGGCCG CCCCTTTACC
TATCAGTGGA CCCCGGCAGA TATCTTTACC GTTCCCGACT GGATCTACGA CGCCGTTTCC
TATCAAATAT TTCCCGATCG CTTTTACAAC GGCAACCCGG CCAACGACCC GCCGGGAACC
AGGCCCTGGA GCGAAGCCCC TACCAGGGAA AACTTCTTCG GCGGCGACCT GGAGGGTATC
CAGGCGAAGA TACCCTATCT AAAGTTTCTG GGGGTCAATG TCCTGTGGCT CAACCCCATT
TTTGCCGCCT CCTCCAACCA TCGCTATAAC ACCCGCGATT ACCTGGCCGT GGACCCGGCC
CTGGGAGATA CCGATACCCT GCGCCGCCTG GTGGCATCCC TCCATGGCGC CGGCATACGC
ATCATCCTGG ATGGTGTCTT CAACCATACG GGGACCGATT TCTTTGCTTT TAAAGACGTG
GTCGCCCGGG GAGCCGGTTC CCCGTATAAG GACTGGTACT ACTTCTATGA CTTCCCGGTA
CGGAGTGAAC CCCGGGCCAA TTATGCCTGC TGGTGGGATA TCCCCAGCCT GCCCAAGCTC
AACGTCAGAA ACCCCGAGGT TCGTAATTAC CTTCTTCACG TGGCGACCTA CTGGCTGTGG
GATGCCGGTA CCGACGGCTG GCGCCTTGAC GTGCCCAATG AGATCGAGCC GCCCTTCTGG
CGGGAGTTTT ACCAGCAGGT TAAAGGGACC AATCCGGAAG CCTACATTGT CGGCGAGATC
TGGCGCGACG CCCGTTTCTG GCTGAACGGC CGTTACTTTG ACGGGGTGAT GAATTACCTC
TTCCGGGACC TGGTCCTTGC ATACTTCGCC CGGAGGCTTT TTCCCATTTC CACCCTGGAT
ATGCTCCTGG GCCTGGTGCG CCTGCGCTAC CCTGAGGCGG CCAATTTCGC CCTGTTAAAC
CTCCTGGGCA GCCATGACAC GGCGCGAATT ATCACCGCCT TCCAGGAAGG GTTGGCCGGC
GTTCCCGGAC ACTCCGGCAG CTACGCCGAG GCCGTAGCCC ACCTGCGACC GGCCCTCATC
CTGCAGCTCA CCTATCCCGG TGCGCCCCTG ATCTACTACG GCGACGAGGT AGGCCTCACC
GGCGGCCCGG ACCCCGACTG CCGCCGGACC ATGCCCTGGG AACCCCGGGA CTGGGACCGG
GATCTCCTGA ACTTTTACCG GTGCCTGATA AGGTTGCGGC ACCAGTTGCG GCCTTTGCGA
CGGGGATTTT TCCAGCCCCT TTTTACCGAC GACCAGGCCG AGGTCTACGC CTATGCCCGG
CGCCTGGAGG GAGAGAAGGT AATTATAATT CTCAATGCCA GCGACCTCCC CCAGACGGTT
ACCCTGGCGG CAGCCGTCGC GGGACTGGCT GAGGATAGTA CCTGGCGGGA CGGCCTGAGC
AACCGCCTCC TGGAGGTGAA GGGCGGCCAG ATCAGCCTGC CCCTGGACGC CAATTCCGGG
GCCGTTCTCT ACCAGGAGTA A
 
Protein sequence
MLHHRPGIRF CQPLAPDRLL LRLKIGRREQ QSCQVIYEDR GLKTAPMHAY ARTPRYLYYQ 
AEISLSRPWR CRYFFRLREG DGDDRYIFAG GTGQQGRPFT YQWTPADIFT VPDWIYDAVS
YQIFPDRFYN GNPANDPPGT RPWSEAPTRE NFFGGDLEGI QAKIPYLKFL GVNVLWLNPI
FAASSNHRYN TRDYLAVDPA LGDTDTLRRL VASLHGAGIR IILDGVFNHT GTDFFAFKDV
VARGAGSPYK DWYYFYDFPV RSEPRANYAC WWDIPSLPKL NVRNPEVRNY LLHVATYWLW
DAGTDGWRLD VPNEIEPPFW REFYQQVKGT NPEAYIVGEI WRDARFWLNG RYFDGVMNYL
FRDLVLAYFA RRLFPISTLD MLLGLVRLRY PEAANFALLN LLGSHDTARI ITAFQEGLAG
VPGHSGSYAE AVAHLRPALI LQLTYPGAPL IYYGDEVGLT GGPDPDCRRT MPWEPRDWDR
DLLNFYRCLI RLRHQLRPLR RGFFQPLFTD DQAEVYAYAR RLEGEKVIII LNASDLPQTV
TLAAAVAGLA EDSTWRDGLS NRLLEVKGGQ ISLPLDANSG AVLYQE