Gene Moth_0664 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0664 
Symbol 
ID3832151 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp694102 
End bp696930 
Gene Length2829 bp 
Protein Length942 aa 
Translation table11 
GC content41% 
IMG OID637828603 
Producthypothetical protein 
Protein accessionYP_429533 
Protein GI83589524 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000150651 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000068678 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGACAAGA TGAAAACTAG AATCCTCGTT ATCAGTGAGT ACTTCTCACG TGGCGGTTTA 
GAAACTCATA TTGTCGGTCA AGCGCGCGTT CTATCGAAAC TTGGTGTGGA TCTTCTGTTA
GCAACAGGTT CCTCAGCAGC CGATTGCCCC GATGGTGTGT TTGCTGCAGC CCTTACAGAT
CTTCGAATGG GCGCCCAGGT ATCATATGAC GAATTATGTG TGACGCTTAA AAGGCTCAAG
AAATTCATTT CAGACGAGCG CATAACGTTA ATCCATGCGC ATCCGTTTTA TAGCGCAATC
GTCGGTCTCT TGGCAGCGCA ACAATCACGA TTGCCTTTTG TCGTTACGAT ACATAGTCCG
TTATCATTAA GTAGTACGTT TGGGCAACTC TATGATTTTT TGCTCAAGTC TGTTGTGCTC
CCAGTAGCAG GTCGAGTTTT CTGTGTATCG AAAGAGACAG AACTTTTGTG TCGCTCTTTG
GCAGAATGTA GGACTGAACT TTTACTGAAC GCTGTTCAAA TACAGAACTC GAACCCACCA
AACGTAGCCA AAGATGGCCC ATGGTTATGG GCTGGACGGC TTGATAAAGA TAAGTCTAAC
GGCCTTCTCG ATTTGATAGA GAAAATTGAT CAGGCAACAG TCGGCGAACT CCATATATTT
GGAGATGGGC CTCAAGTTCA TTTAATTGAA TCGTTTCTTA ATAGTCGGCC AGACAAGGCT
GAGTTCGTGC GACTGATGGG ATGGCGTCAT AATATAACCA CGATAATGCC TGCTTATGCT
GGCATTGCTG GGATGGGGCG CGTAATTCTT GAGGGTTCTG CCTTAAACAG ACCATGTTTG
CTGGTTGGCT ATGATGGGGT TAAAGGCCTA CTCGATATTA ATAGGTTCGA ACGAGCCTCT
TTTTGGAATT TCTCGGGTCG TGGATTACCA ACAATTACAG CGGATGCCCT TCATCAGGAA
TTCTATAGAT TGTCCAAAGA TAAAGGCCCT TTTCTTCTTC GCCAATGGGT CGCCGATAAT
CGGGACGAAA GAGTGATCTG GCAGCGCTAT GCGGAAAAGA TAAAAGATTT AGCTCCGCTT
GATAATCCGT TGGCTCAAAA CATCTTGGAT GCGTTGCAAT ATCGCGGTTC CATTTCCGAA
CCTGTGTGGT GGGACAAAGA GTTGATGCGG GTTATTCTGG GCTTGTTCTC CAATGAGCCA
TACGGGAAGG AACAGGCGAA GAGTATGACC CATCAGATCT TGGTAGCGCA CCTCCACTCT
AAATTAGAAG CGATAAAGTA TGAAACAACA ATGTTACGGG AGAAAATAGA TTTCCTTAAG
AGTGCGCTGG CTGAGCGTGA CGAAAAGATC ACTTCTCTCA ATCAAGCTGT GGCTGAACGT
GACGAAAAGA TCATCTCCTT AAATCAAGCT GTGATTGAGC GAGATGAAAA GATTGCTTCG
CTACAAAAGC ACATTCAAGA TATTTGGGCC AGCACATCGT GGCGGATTAC ACGCCCTTTA
AGATTTCTTA AGAAGCTTGT TAGTGATCCC GAGCCGACAA CTTATTTTAT ACTAAAGCGC
ATTTATTGGG GACTTCCAGG AAATTTGCGG ATACGTCTAA ATGGACTTAG ATGTCTTATT
ATTCGTTTTT TCTTATCAAA GCGCAAAAAC AATGTAGGTC TAATGGCTAA TCAAGAAGGT
ATACATGGCC TTTCTTGGGA AGAGTTTCAA GATAAAGTAC TATCGAAACG GGAACAACAC
AAGGGAATTT TTATTTTAGA GGCTACCCAT ATAGATTGGA ATATGAATTT ATTTCAGAGA
CCTCAACATA TGGCAAACGC ACTTTCAAAG CTTGGTTACC TCGTAATTTT TAAGACTGCA
AATTTTTATG ATAATGTATC TGGGTTTAAA AAAATATCCG ATAACCTTTG GTTGACAAAT
AATGATAAAG TAGACAGTAT TTACGGAGCA GTAAGAAGCT TTTACAGTAC CTCTTCTGTT
TACACTAAAG AAATCTATGA TGACCGCAGA AAATACGGCC TTGTTGTTTA CGAATACATA
GATCATATTG ACCCCGCTAT ATCTGGAGAT GAGGAGAACA TCCGACGCCT GAATGCTCTC
AAGAATTATG CCTTCAATGG TGGAGTAGAT TTTATCGTGG CTTCAGCTAA AGCATTGTAT
AGAGAGGCTG TTCAAGCTGT AGGCGAAGAT AAAGTTATTT TAATTCCAAA CGGTGTGGAT
GTTGAGCATT ATCGCGACCA ACGCCACAAG TACTGTACTA TTCCTAAGTC GTTGATTAAG
TTCAAGAATG TACATAAAAT AATTGTGGGT TACTTTGGGG CACTTGCACC TTGGCTGTGG
TATGAAGAGA TAGAGAAACT TGCAGCCCTT AGGCCAGAAG TAGGTTTTGT TTTCATTGGG
CCTGATTATT ATGGCGGTTC ATCTCTGTTA CCGAAAGCGA AGAATATTTT TTGGATGGGG
CCGGTTGATT ACAAAATTTT ACCCGGTTAT GCACTTCATT TTGATATATG TTTTATTCCA
TTCCGACCAG GCGAGATAGC TCGGACCACA TCTCCATTAA AATTATTTGA ATATTTTGCT
CTAGAGAAAC CTGTAATAGT AACTTCGAGT ATGTTAGAAT GTATACAATT TTGCGAGGTT
TTGAGTGGAA GTTGTGCTAC AGAGCTATCA AAATGCATAG ATAAAGCTTT AGATTTATCT
CGTGATGAAC ATTTTAAAAA GCGTTTGGCT GAATTAGCTG ATCAAAATTC TTGGATTGAA
AGAGCGAAAA AATATGAGAT TATTTTTGAA CAGACTAAGA AATGGATTTG TCATAAGAAG
GATATTTAA
 
Protein sequence
MDKMKTRILV ISEYFSRGGL ETHIVGQARV LSKLGVDLLL ATGSSAADCP DGVFAAALTD 
LRMGAQVSYD ELCVTLKRLK KFISDERITL IHAHPFYSAI VGLLAAQQSR LPFVVTIHSP
LSLSSTFGQL YDFLLKSVVL PVAGRVFCVS KETELLCRSL AECRTELLLN AVQIQNSNPP
NVAKDGPWLW AGRLDKDKSN GLLDLIEKID QATVGELHIF GDGPQVHLIE SFLNSRPDKA
EFVRLMGWRH NITTIMPAYA GIAGMGRVIL EGSALNRPCL LVGYDGVKGL LDINRFERAS
FWNFSGRGLP TITADALHQE FYRLSKDKGP FLLRQWVADN RDERVIWQRY AEKIKDLAPL
DNPLAQNILD ALQYRGSISE PVWWDKELMR VILGLFSNEP YGKEQAKSMT HQILVAHLHS
KLEAIKYETT MLREKIDFLK SALAERDEKI TSLNQAVAER DEKIISLNQA VIERDEKIAS
LQKHIQDIWA STSWRITRPL RFLKKLVSDP EPTTYFILKR IYWGLPGNLR IRLNGLRCLI
IRFFLSKRKN NVGLMANQEG IHGLSWEEFQ DKVLSKREQH KGIFILEATH IDWNMNLFQR
PQHMANALSK LGYLVIFKTA NFYDNVSGFK KISDNLWLTN NDKVDSIYGA VRSFYSTSSV
YTKEIYDDRR KYGLVVYEYI DHIDPAISGD EENIRRLNAL KNYAFNGGVD FIVASAKALY
REAVQAVGED KVILIPNGVD VEHYRDQRHK YCTIPKSLIK FKNVHKIIVG YFGALAPWLW
YEEIEKLAAL RPEVGFVFIG PDYYGGSSLL PKAKNIFWMG PVDYKILPGY ALHFDICFIP
FRPGEIARTT SPLKLFEYFA LEKPVIVTSS MLECIQFCEV LSGSCATELS KCIDKALDLS
RDEHFKKRLA ELADQNSWIE RAKKYEIIFE QTKKWICHKK DI