Gene Moth_1936 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1936 
Symbol 
ID3832428 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2010888 
End bp2012351 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content59% 
IMG OID637829867 
Productglycogen/starch synthases, ADP-glucose type 
Protein accessionYP_430777 
Protein GI83590768 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0297] Glycogen synthase 
TIGRFAM ID[TIGR02095] glycogen/starch synthases, ADP-glucose type 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0962192 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000030545 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACAAAC CCTTGAAGAT CTTGCTGGTT TCTCCCGAGG TTGCACCCCT GGCCAAAACC 
GGCGGCCTGG CTGATGTGGC CGGTAGCCTG CCCAAAGCCC TGGCGGCCAA GGGCCACGAG
GTCAGGGTAG CCATGCCCCG TTACCGCCAG GTCAAGGAGG TTAACTACCT CACCGATCTG
CCGGTAGAGA TGGACGGCAG CCTGGAGACA GCCGTCATTC GCCAGGGGAA ACTGCCCGGG
GAAGCCGGGA TCCCGGTATA CCTGATCGAC AACTACAAGT TTTTCTACCG TGATGGCATG
TATGGTTACG GCGATGACGC CGCACGGTTC AATTTCTTCT GCAAAGCCGT GCTGTCCATG
CTGCCCTGGC TGGAGTTTCA GCCGGATATC ATCCATTGTA ACGACTGGCA GACCGGTCCC
ATACCCCTGT TCCTCAAGGT AAAGCACGAG GACAACCCTT TTTACCGGGA GACGGCAACC
ATCTATACCA TCCATAACCT GCAGTACCAG GGTACCTTTC CCCGCAACAT CCTCAAGACC
ATGGCCCTCA GCGAGGAATT CTTTGTCCCG GAACGCCTGG AGTTTTACGG GCAGGTCAGC
TATATGAAGG CCGGGATCCT GTACGCCGAC CTGGTGAACA CCGTCAGCAA GAAATACGCC
CTGGAAATCC AGACGCCGGA GTACGGGGAG CGCCTGGACG GCCTGCTCCG TAAAAGGGCA
GCCGACCTGA GGGGCATCCT GAACGGCATC GACTATGAGG AGTTCGACCC GGCCACCGAC
CGGCGCCTGG CAGTCAATTA CGACGCCGAT CACCTGGAGA AGAAAGGGGA AAACAAGGCG
GCCCTGCAGC GGGAGATGGA ACTGCCCGTC AGGGACGTCC CCGTCCTGGG CCTGATCTCC
CGCCTGGTGA GCCAGAAGGG TCTCGACCTC CTGGCCGCTA TCCTGGACCC ATTGATGCAA
CAGGACCTGC AGTTCGTCCT CCTGGGCAGC GGCGAGGACT ACTACCAGCA GCTTTTCTCC
CGATATAAGG TAAAATATCG CGATAAAATG GCCGTGAAAA TCGGCTTTGA CCCGGTCCTG
GCCCAGCATA TCTACGCCGG GTGCGATATC TTCCTGATGC CATCCCGGTT CGAGCCCTGC
GGCCTGGGGC AGATGATCAG CCTGCGCTAT GGTGCCGTCC CGGTGGTCAG GGCAACTGGC
GGCCTGGAGG ATACCATCAA AGACTTGCAC CAGTATCCGG GAGTGGGTAA CGGCTTTACC
TTCCGTGATT ACCAGCCCCA GGCCCTCCTG GATACCATCA ACCGCGCCCT GCACGTCTAC
CGCCACGAAC CCGGAGAATG GCGTAAACTG ATGCGGCGGG GCATGGCCGC CGATTTCTCC
TGGAGCGCTT CGGCCGGTCA CTACGAGGAA ATGTACCGCG AGGCCCTGGA GAAGAGGCGG
GCCGCCATGT TTAAGGTAGG GTAA
 
Protein sequence
MNKPLKILLV SPEVAPLAKT GGLADVAGSL PKALAAKGHE VRVAMPRYRQ VKEVNYLTDL 
PVEMDGSLET AVIRQGKLPG EAGIPVYLID NYKFFYRDGM YGYGDDAARF NFFCKAVLSM
LPWLEFQPDI IHCNDWQTGP IPLFLKVKHE DNPFYRETAT IYTIHNLQYQ GTFPRNILKT
MALSEEFFVP ERLEFYGQVS YMKAGILYAD LVNTVSKKYA LEIQTPEYGE RLDGLLRKRA
ADLRGILNGI DYEEFDPATD RRLAVNYDAD HLEKKGENKA ALQREMELPV RDVPVLGLIS
RLVSQKGLDL LAAILDPLMQ QDLQFVLLGS GEDYYQQLFS RYKVKYRDKM AVKIGFDPVL
AQHIYAGCDI FLMPSRFEPC GLGQMISLRY GAVPVVRATG GLEDTIKDLH QYPGVGNGFT
FRDYQPQALL DTINRALHVY RHEPGEWRKL MRRGMAADFS WSASAGHYEE MYREALEKRR
AAMFKVG