Gene Moth_1854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1854 
Symbol 
ID3831485 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1914369 
End bp1916117 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content61% 
IMG OID637829786 
Productglycoside hydrolase family protein 
Protein accessionYP_430697 
Protein GI83590688 
COG category[S] Function unknown 
COG ID[COG1543] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTACCC GGTCGGGTGA TAACCTAGGT TATCTGTCGC TGGTGCTCCA TGCTCATCTG 
CCCTTTGTCC ATAACCGGGA GCCCTATGTC AGCCTGGAAG AGAAATGGCT TTTCGAAGCC
CTGACGGAGT CCTACCTGCC CTTGATCCTG AGCTGGGAGG AGCTGGCCGG CGAGGGGTTA
GACTTTCACC TCACCCTCTC CCTGTCGCCG CCCCTCATCA GCATGTTGAT GGAACCTGCC
CTGGGGGAGC GCTACGGTCG TTACCTGGAT AACCTCAGGG AACTGGCCGG CCGGGAAATT
GAGCGGACCA GGGGTGACCC GACTTTCGCC CCCCTGGCGG AGTTTTATCA CCGGCGCCTG
ACTCTCGTCG ACCGGGCCTT CAAAGAAACC TACCGGGGGA ATTTACTGGC GCCTATAAAA
AGGCTGCGGG AACAGGGTCG CCTGGAACTG ATAACCACCG CGGCTACCCA TGGTTACCTT
CCCCTGATGC TTACCGATGA GGCCAGGCGA GCCCAGGTGC GGGCAGCCCT GGATCTTTTC
GGCATGACCA TGGGCTTTGT CCCGGACGGG CTCTGGCTGC CGGAGTGCGG CTATACCCCA
GGGATTGAAA AAATCCTCCG GTCCGAGGGG ATTAAGTATT TCATCGTCGC CAGCCACGGC
ATGTTAAATG CCACCCCCGT GGTAAAATCG GCTGTTTATG CTCCGGTCAG GGTGGGCGGC
GTGGCTGTCT TCGGGCGGGA CTGGGAGACT TCCCACCAGG TTTGGAGCCG GACCGAGGGT
TATCCCGGAG ATCCGGTGTA CCGCGAGTTC TACCGGGATA TTGGTTACGA CCTGGATTTC
AATTACCTGG CACCCTACCT GGTGGGGGGG ATCCGGGGAG ATACCGGCTT TAAATACTAC
CGGATTACCG GTAAAACCGG GGTCAAGGAG CCCTATGACT ACCGGGCCGC CCGGGAGCGG
GCCCGGGAGC ATGCCCGGGA CTTTATCGCC AACCGGGAGA AACAGCTGGC TTACTGGGCC
GGCAGGACCC AGGATAAGCC TGTCGTCGTC GCTCCCTATG ACGCCGAGCT CTTTGGCCAC
TGGTGGTTTG AGGGGCCGGA CTGGCTGGCC GATGTCCTGC GCCTGGCAGG CGAAAGCCGG
GTCTCCCTGA CCTCTCTTTC GGCTTACCTG GAGCAGTATC CGCCCCGCCA GGAAGTGACC
ATGGGTCCTT CCAGCTGGGG AGAAGGGGGT TATAACCACG TCTGGTTGAA CCAGGCCAAC
GACTGGCTTT ATCTCCACCT CCACCGGGCG GAGAGGGCCA TGATCAAGCT GGCTGCCGCC
AATCCCCGCC CCGGCTCCCT GCAGGAGCGG GCACTGAACC AGGCTGCCAG GGAATTGCTC
CTGGCCCAGA GTAGCGACTG GTCCTTTATC CTCACCACCG GGACGACGGT GGACTATGCC
CGGCGTCGCC TCCGGGAACA CCTGGGGGCT TTTTTTAAAC TCTGCCAGGA TTATGAGCGA
GATCGCCTGG ATGAAGATTT TCTGGCCCGC CTGGAGGCGG CGGACAATAT CTTCCCCGGC
CTCGATTTTC GCCTCTACCG GCCAGCCGGA AGGGGAGTAG CCTGCCGGCC CGAGGTTCAT
AATAAAACCA GGCCGGGAAT TCTTATGCTT AGCTGGGAAT TCCCACCCCG CCATGTTGGC
GGCCTGGGTA TCCACGTCCG GGACTGGGCC AGGCCCTGGC CCGCCAGGGG GTGGATGTCC
ACGTCCTGA
 
Protein sequence
MVTRSGDNLG YLSLVLHAHL PFVHNREPYV SLEEKWLFEA LTESYLPLIL SWEELAGEGL 
DFHLTLSLSP PLISMLMEPA LGERYGRYLD NLRELAGREI ERTRGDPTFA PLAEFYHRRL
TLVDRAFKET YRGNLLAPIK RLREQGRLEL ITTAATHGYL PLMLTDEARR AQVRAALDLF
GMTMGFVPDG LWLPECGYTP GIEKILRSEG IKYFIVASHG MLNATPVVKS AVYAPVRVGG
VAVFGRDWET SHQVWSRTEG YPGDPVYREF YRDIGYDLDF NYLAPYLVGG IRGDTGFKYY
RITGKTGVKE PYDYRAARER AREHARDFIA NREKQLAYWA GRTQDKPVVV APYDAELFGH
WWFEGPDWLA DVLRLAGESR VSLTSLSAYL EQYPPRQEVT MGPSSWGEGG YNHVWLNQAN
DWLYLHLHRA ERAMIKLAAA NPRPGSLQER ALNQAARELL LAQSSDWSFI LTTGTTVDYA
RRRLREHLGA FFKLCQDYER DRLDEDFLAR LEAADNIFPG LDFRLYRPAG RGVACRPEVH
NKTRPGILML SWEFPPRHVG GLGIHVRDWA RPWPARGWMS TS