Gene Moth_1852 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1852 
Symbol 
ID3831713 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1910654 
End bp1913218 
Gene Length2565 bp 
Protein Length854 aa 
Translation table11 
GC content63% 
IMG OID637829784 
ProductAlpha-glucan phosphorylase 
Protein accessionYP_430695 
Protein GI83590686 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0058] Glucan phosphorylase 
TIGRFAM ID[TIGR02094] alpha-glucan phosphorylases 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCCTT CACCCCATAT CTTCTTTGTC CAGCCGGTTT TGCCGGAAAA ACTTAAGCCC 
CTCCGGGATC TGGCCGCTAG TCTTTACTGG CCCAGGTACC CGGAAACAGC GGCCCTTTTC
CGGGACCTGG ACCCCGACCT CTGGGAGGAG ACCGGTCATA ACTCCGAACT CTTACTGAGG
TTACTGCCGG CGGCCAGATT AACGGCGGTC GCCAGGGACC CTCAATATAC CGGCAGGCTG
GAACAGGTGT GGCAGCAGTA CCGCACCTAC CTGGGCGCCC CGGTGAACCG GGGCAACCTT
CCCGGGCTGG AAAGCAATCA GGTTATTGCC TATTTTTCTG CCGAGTTCGG CCTGAGCGAG
GCCCTGCCCA TATATGCCGG GGGGCTGGGG TTCCTCGCCG GAGACCACCT GAAGTCGGCC
AGCGACCTGG GCCTGCCCTT GGTAGGGGTA GGCCTCCTGT ATCGCGAGGG TTACTTCCGC
CAGCGCCTCG ACCGCCGGGG GCAGCAACGG GAGGTTTATT CCCGCTATGA TTTCTACCAG
CTACCCCTGG AACTGGAGCG CCGGGCCGAC GGCTCGCCCC TGGAGGTGCA GGTGAACTTT
CCGGACCCGG ATCGCCAGGT CCGGGCCAGG GTCTGGCGGG CTCGGGTCGG CCGCCTCAAC
CTTTACCTCC TGGACAGCGA TTGCCCCGGC AACCTGGAGG CGGACCGGTT GATTACCGAC
CGGCTTTACG GCGGCGACCT GGAAAAACGG ATTCAGCAGG AGATCCTCCT GGGCATCGGC
GGCGTGCGGG CCCTGGCGGC CCTGGGTATC AACGCCACTA TCTTTCACCT GAATGAAGGC
CATTCGGCCT TCCTGGGCCT GGAAAGGATC CGGCAGTTGC AGGCGCGGTA TGGACTAGAT
CTGGCGTCGG CCCGGGAACT GGCTACCTGC AGTAACATCT TTACCACCCA TACCCCGGTG
CCGGCCGGTA TCGATGTCTT TCCCCCTTAT CTAATGGACA AATATTTCAC CGAATATTAC
CAGTCCCTGG GTCTTTCCCG CCACGAATTC CTGGCCCTGG GCCGGCAGGA CCCCAATAAC
CAGCAGGAAC CCTTCAATAT GGCCGTACTG GCCTTACGCC TGTCGGCCTG GGCCAACGGC
GTCAGCCAGC TCCACGGCCA GACGGCACGC CGGATGTGGC AGGTAATTTG GCCGGGAGTG
CCGGTAGCCG AGATTCCCAT TGGCGCCATT ACCAACGGTG TCCATACCAC CTCCTGGGTG
GGAGAAAAGA TGGCCGCCCT CTACGATCGC TACCTGGGAA CGGCCTGGCG GGAGGACCCG
GCCTCCCCGG CCAGCTGGGC GGGGGTGGCA GGCATACCGG CCCGGGAACT CTGGCAGGTT
CACGAAGAAC AGCGGCGGGA ACTCCTGGCC TTTGCCCGGC GGCGATTGGC GGCCCAGCTC
CGGGAGTGGG GGGCCGGGCC CAGGGAGATA GCCGCCCTGG AAGGGGTTCT CGATCCAGGG
GCATTGACCA TCGGTTTTGC CCGCCGTTTC GCTGCCTATA AACGGCCGGC CCTGCTCCTG
CGTAACCCGG AACGTTTGGC CAGAATCCTG GGGGACTCCC GGCGGCCGGT CCAGATCATC
TATGCCGGCA AGGCCCACCC CAGGGACGAA GAGGGGAAGG AGCTCATCCG GCAGATAACG
GCCTTTACCA GGGAGGAGGC CTTCCGTGGT CGTTTGATCT TCCTGGAGGA TTATGGCCTG
CAGGTGGCCC GTCATCTGGT CCAGGGTGTC GACTTGTGGC TGGGTAACCC GCGACGGCCC
CTGGAGGCCA GCAGTACCAG CGGCATGAAG GCCGTTCTGA ACGGCGCCCT CCATGCCAGC
ACCCTGGATG GCTGGTGGGC CGAAGCCTGG ACCCCGGATA CCGGCTGGGC CATTGGTTCC
GGCGCGGTTT ACGAGGATAC CGGCTACCAG GACGCGGTAG AAGGGGAGGC CCTTTATAAC
TTGCTGGAGA AGGAGATAGT CCCCCTCTTT TACGAGCGAG ACGCCGGGGG CCTGCCCGCG
GCCTGGGTGG AGATGATGAA AAGGTCCATC AGGGCCTACG GGCCGGTGTT CAATACCCAC
AGGATGGTGG CGGAATACAG CCGGCAGTTT TACGAACCTG CCGCCCGGCT TTACCAGAGG
TTGCAGGCCG GCAACCAGGA GCGGTTAAAG GAGTTGGCCG GCTGGCGGAG CCGGGTCCGG
GACTGGTGGC CGGCCGTGAG GATTGAGGGG GTGGAGGACG ACAGGGAAGG GGATCTGATC
GTGGGTGGTA GCCTGAAGGT CCGGGCCCGG GTTTTCTTGG GCTTCCTACA GCCGGAGGAT
GTGACGGTAG AACTCTATTA CGGCCCGGTT GACGCCGGCG GTGAAATTGT GACCGGAGAG
AAGGAGACCA TGATCCAGTA CCAGGATCAG GGCGAAGGAA GGTACCTGTA TAGCGGCGTC
ATCCCCTGCC GTCGAGGGGG ACGCCAGGGC TACAACCTGC GGGTGTTGCC GCGTCATGCA
GACCAGGTTC ATCCCTACCA GAGCGGCCTG ATTCTTTGGG GTTAG
 
Protein sequence
MMPSPHIFFV QPVLPEKLKP LRDLAASLYW PRYPETAALF RDLDPDLWEE TGHNSELLLR 
LLPAARLTAV ARDPQYTGRL EQVWQQYRTY LGAPVNRGNL PGLESNQVIA YFSAEFGLSE
ALPIYAGGLG FLAGDHLKSA SDLGLPLVGV GLLYREGYFR QRLDRRGQQR EVYSRYDFYQ
LPLELERRAD GSPLEVQVNF PDPDRQVRAR VWRARVGRLN LYLLDSDCPG NLEADRLITD
RLYGGDLEKR IQQEILLGIG GVRALAALGI NATIFHLNEG HSAFLGLERI RQLQARYGLD
LASARELATC SNIFTTHTPV PAGIDVFPPY LMDKYFTEYY QSLGLSRHEF LALGRQDPNN
QQEPFNMAVL ALRLSAWANG VSQLHGQTAR RMWQVIWPGV PVAEIPIGAI TNGVHTTSWV
GEKMAALYDR YLGTAWREDP ASPASWAGVA GIPARELWQV HEEQRRELLA FARRRLAAQL
REWGAGPREI AALEGVLDPG ALTIGFARRF AAYKRPALLL RNPERLARIL GDSRRPVQII
YAGKAHPRDE EGKELIRQIT AFTREEAFRG RLIFLEDYGL QVARHLVQGV DLWLGNPRRP
LEASSTSGMK AVLNGALHAS TLDGWWAEAW TPDTGWAIGS GAVYEDTGYQ DAVEGEALYN
LLEKEIVPLF YERDAGGLPA AWVEMMKRSI RAYGPVFNTH RMVAEYSRQF YEPAARLYQR
LQAGNQERLK ELAGWRSRVR DWWPAVRIEG VEDDREGDLI VGGSLKVRAR VFLGFLQPED
VTVELYYGPV DAGGEIVTGE KETMIQYQDQ GEGRYLYSGV IPCRRGGRQG YNLRVLPRHA
DQVHPYQSGL ILWG