Gene Moth_1850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1850 
Symbol 
ID3831711 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1908171 
End bp1910243 
Gene Length2073 bp 
Protein Length690 aa 
Translation table11 
GC content65% 
IMG OID637829782 
Product4-alpha-glucanotransferase 
Protein accessionYP_430693 
Protein GI83590684 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1640] 4-alpha-glucanotransferase 
TIGRFAM ID[TIGR00217] 4-alpha-glucanotransferase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGGAA CCTTGTCAGA TAAAGAGTTG CGTCTTTTGC ACCGACTATG CCGGTGGTAC 
GGCGTGGAGC CTGCCTATCG CGATGGGGAG GGTAAACTCA GGAGGGCCGG GCCGGAGTCG
TTGCTGGCGG TATTGCGGGC CCTGGGGGCA CCGGTGGCAG GCCTGGCTGA CCTTCCCGGT
GCCCTCCGGG AGCGGCGGCA GCAATACTGG CGGCGCTGTT GTGAGCCGGT AGCCGTAGCC
TGGGCCGGCC GGTTGTCGCA TATGGAACTG CGTCTCCCGG CCGGTCGGGC AACCGGGCCC
CTGGAGTGCC GGCTGCGGCT GGAAGACGGC CGGGTATGGC GGATGGTAAT CGATCCGGGT
AGCCTGCCCC TTCTCCGGAC TACCGTTGTG GAAGGGGTAG CCTTTGAGGC CAGGCAGCTC
ACCCTTCCCG CCAGGCTGCC CTGGGGCTAC CATCACCTCC ATCTGCGCCT GCCGGGCCTT
ACCCGGGAGG TATTGCTTAT TGCCGCCCCC TCCCGGGCCG GTGCCCCGCT AACAGGCCAG
GGGGAACACC TCTGGGGGTG TTTTTTACCC CTCTATGCCC TCCATTCCCA CCGTAGCCTG
GGGGCCGGGG ATTTTGGCGA CCTGGAGGCT CTATCCCTCT GGGTCAACAG CATGGGGGGC
AGTTTTACCG GCACCCTGCC CTTCCTGGCG GCCTTCCTGG ACGAGCCCTT TGCCCCCAGT
CCCTATCAGC CGGTTAGCCG CCTCTTCTGG AATGAGTTTT ACCTGGATAT TTCCCGTCTG
GAAGAGGTGC AGCAGTGCCG GGAAGCCCGG GATTTTCTAA ACTCGGCGGC GGTGCAGAAG
GAGATAGCCG CTTTACGGGC CGCTCCCCTG GTAGACTACC GCCGGGGGAT GGCCCTGAAA
AGACGCCTGT TAGCTCTCTG CGCCCGTACC TTTTTTACCG GTGCTCCCGG CCGGAGGGAA
GAGATGGCGG CCTGGCTGGC CGGCAACCCG GCGGCCCGGG ACTACGCCCG CTTCCGGGCC
GCCGTCGAGA AACAGCACGC CACCTGGCCG GAGTGGCCGG CCCCGATGCG GGATGGTAAC
CTCGGCGAGG GAGATTACGA TCCGGAAGCC ATGCAGTACC ACCTGTACGT CCAGTGGCAG
GCCCACCGAC AGGTGCAGGC CCTGGCAGCC CGCGCCCGGC GCTCCGGTCC GGGTTTATAC
CTGGACCTGC CCCTGGGAGT CCACCGGGAG GGTTATGATG TCTGGCGTCA TCGCCGGGCC
TTCGCCCTGG CGGCCAGCAG CGGGGCCCCA CCGGATGCCC TCTTCCGCCG GGGTCAGGAC
TGGGGGTTTC CCCCCTTCCA CCCTGAGGGA ATCCGGGAAG ACGGTTATCA CTATTACATC
GCCTGCCTGC GCCATCACCT GCGCCACGCC GGCATCCTGC GCCTGGATCA CGTCATGGGG
TTGCACCACC TCTACTGGAT ACCCCGCGGC CTGGCGGCCA CGGAGGGGGT TTATGTGCGC
TATCACGCCG GGGAATTTTA CGCCATCCTG TCCCTGGAAT CCCGGCGCCA CGGGGCCCTC
CTGGTGGGGG AAGACCTGGG GACGGTGCCA GCTTACGTGC GCCGGGCCAT GACCAGGCAC
AATATCAGCC GCATGTATAT CTTGGCCGTA GAGTATACCG GGAAAACGGG CCGGGCCCTG
GGACCGGTGC CCCCAGAGAG CCTGGCCGGC CTGAATACCC ACGACATGCC GCCCTTTGCT
GCCTTCTGGC GGGAAAGGAA AAAGAACAGC CGCCAGCTGG CGGCCCTGCC TGTCTTCCTT
TATAACCGGG GTCGCCTGGA AGTGCCAACG ACGGCGACCA GAAGCCTTCT AAGGGGCTGC
CTGGCGTACC TGGCCGCCAG CCCGGCGCGT TTGTTGCTGG TAAACCTGGA GGATCTGTGG
CTGGAGACGG AACCCCAGAA TATCCCCGGC ACAAGCACCG AGTACCCCAA CTGGCGGCGT
AAGGCCCGCT ACAGCCTGGA GGAGTTCAGC CGGCAGCCAG GAGTAGTGGC TCTCCTGCGG
GAGGTTAACT ACTGGCGAGG TACAGCTAAA TAA
 
Protein sequence
MDGTLSDKEL RLLHRLCRWY GVEPAYRDGE GKLRRAGPES LLAVLRALGA PVAGLADLPG 
ALRERRQQYW RRCCEPVAVA WAGRLSHMEL RLPAGRATGP LECRLRLEDG RVWRMVIDPG
SLPLLRTTVV EGVAFEARQL TLPARLPWGY HHLHLRLPGL TREVLLIAAP SRAGAPLTGQ
GEHLWGCFLP LYALHSHRSL GAGDFGDLEA LSLWVNSMGG SFTGTLPFLA AFLDEPFAPS
PYQPVSRLFW NEFYLDISRL EEVQQCREAR DFLNSAAVQK EIAALRAAPL VDYRRGMALK
RRLLALCART FFTGAPGRRE EMAAWLAGNP AARDYARFRA AVEKQHATWP EWPAPMRDGN
LGEGDYDPEA MQYHLYVQWQ AHRQVQALAA RARRSGPGLY LDLPLGVHRE GYDVWRHRRA
FALAASSGAP PDALFRRGQD WGFPPFHPEG IREDGYHYYI ACLRHHLRHA GILRLDHVMG
LHHLYWIPRG LAATEGVYVR YHAGEFYAIL SLESRRHGAL LVGEDLGTVP AYVRRAMTRH
NISRMYILAV EYTGKTGRAL GPVPPESLAG LNTHDMPPFA AFWRERKKNS RQLAALPVFL
YNRGRLEVPT TATRSLLRGC LAYLAASPAR LLLVNLEDLW LETEPQNIPG TSTEYPNWRR
KARYSLEEFS RQPGVVALLR EVNYWRGTAK