Gene Moth_1915 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1915 
Symbol 
ID3830839 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1984455 
End bp1986041 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content58% 
IMG OID637829848 
Productglycoside hydrolase family protein 
Protein accessionYP_430758 
Protein GI83590749 
COG category[S] Function unknown 
COG ID[COG1543] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0191786 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCGTG GCTATGTCGC CCTGGTACTT CATGCCCACT TACCCTATGT CCGCGATACG 
GAAGACGATT TTTCCTTAGC GGAAAAATGG TACCACGAAG CCGTCACCGA AACCTATATA
CCCTTAATAA ATATCTGCCA GCGCCTGAAC CGGGACCGAG TGCCTTACAG AATCACCATC
TCCTTAAGCC CGCCCCTGGT GACCATGATG GCCGACCCCC TGGTTCAAGA ACACTACCGC
CGCTACTTGG AACGCCTGCG GGAACTGGCT GCCCGGGAGG TCTGGCGTAC GCGGAATGAC
CCCCGCTTTC ACCTGGTGGC CCGTATGTAC CAGGACCTAT TTGAAAATAC CGCCCGTACT
TACCAGACTT ATGGCGGCAA TTTGATCAAC GCTTTCCGGG AGCTCCAGGA TAGCGGCAAG
GTAGAGCTAA TCACCTGTGC CGCCACCCAC GGCTACCTGC CCCTCATAGG CCTGCAGCGG
GAGGTGGTCC GGGCCCAGGT GGAAGTGGCG GTCAACAACC ATCGCCGCCT CTTCGGGCGG
CCGCCGGCGG GTTTGTGGCT GCCGGAGTGC GCCTATAACC CGGGCGACGA CGCCATTCTG
CGTGATTATG GCCTTAAATA CTTCTTTGTC GACGCCCACG GTCTCCTTTA CGCCACACCG
CGGCCACGCT ACAGCATCTT CGCCCCCGTT TACACCCCTG CCGGGGTGGC GGCCTTCGGC
CGTGACCTGG AGTCTTCGGA ACAGGTCTGG AGCGCCCAGG AAGGTTACCC CGGGGATTTC
GACTACCGGG AATTTTACCG GGACATTGGC TACGACCTGG ATTTTGAGTA CATTAAACCC
TACATCCATC CCTCCGGCCT GCGTCTGGAT ACGGGTCTTA AATATTATCG CATCACCGGC
AAGTCCGGCT ATAAAGAACC CTATGTTCCG GAATGGGCCA GTTTCAAAGC CCATACCCAT
GCCGGCAACT TTTTGTTCAA CCGGGAGCAG CAGATCAATT ACCTGGCCAC TTATATGGAC
CGGCCGCCTT TAATCATCTG TCCCTATGAC GCCGAGCTCT TCGGTCACTG GTGGTTCGAG
GGACCCCAGT GGCTGGAATC CCTTTTCCGC CAGGTGGCGG GTCTCGCCCC CCAGCCCTTT
AGTTTTATCA CCCCCAGCGA GTACCTGGAG CGTTTTCCCG TCAACCAGCC GGCCACGCCC
TGCATGTCCA GCTGGGGTAA CAACGGTTAT AACGAGGTCT GGCTGGAAGA TTCCAACCAT
TGGATCTACC GCCACCTGCA CCATGCCGCC GCCGAAATGA TCCGCCTGGC CAACCAGCAC
CCTACCGCCG GGGGCATCCT GCTGCGGGCC TTGAACCAGG CCGCCAGGGA ACTCCTGGTG
GCCCAGAGCA GCGACTGGGC CTTCATCATG AAAACCGGCA CCATGGTCGA GTACGCCGTG
AGCCGGACAA AAAAACACCT GCTCAATTTC TGGGAGCTCA CCCGTGGGAT TAATAAAAAC
GACCTGGACC CGGCAAAGGT CCAGGCCCTG GAGGAGGCCA ATAATATCTT TCCGGATATA
AACTATAGGA TTTTCGCCAG CAGGTAA
 
Protein sequence
MPRGYVALVL HAHLPYVRDT EDDFSLAEKW YHEAVTETYI PLINICQRLN RDRVPYRITI 
SLSPPLVTMM ADPLVQEHYR RYLERLRELA AREVWRTRND PRFHLVARMY QDLFENTART
YQTYGGNLIN AFRELQDSGK VELITCAATH GYLPLIGLQR EVVRAQVEVA VNNHRRLFGR
PPAGLWLPEC AYNPGDDAIL RDYGLKYFFV DAHGLLYATP RPRYSIFAPV YTPAGVAAFG
RDLESSEQVW SAQEGYPGDF DYREFYRDIG YDLDFEYIKP YIHPSGLRLD TGLKYYRITG
KSGYKEPYVP EWASFKAHTH AGNFLFNREQ QINYLATYMD RPPLIICPYD AELFGHWWFE
GPQWLESLFR QVAGLAPQPF SFITPSEYLE RFPVNQPATP CMSSWGNNGY NEVWLEDSNH
WIYRHLHHAA AEMIRLANQH PTAGGILLRA LNQAARELLV AQSSDWAFIM KTGTMVEYAV
SRTKKHLLNF WELTRGINKN DLDPAKVQAL EEANNIFPDI NYRIFASR