Gene Moth_1536 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1536 
Symbol 
ID3831922 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1579958 
End bp1581010 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content65% 
IMG OID637829468 
Producthypothetical protein 
Protein accessionYP_430388 
Protein GI83590379 
COG category[S] Function unknown 
COG ID[COG3854] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR02858] stage III sporulation protein AA 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.127679 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTACCG TACACCTGGT ACAGGTTGAA ACCGCCAATC AGCCTCCTGG ACAGGCACCC 
CGCCGGGATG GAGACCCCCT GGCAGGGATC GAGGAGTTGC TACCGCCGAA TATCAAGGCC
GCTGTGGAAA GTCTTCCCGC CGGAATTAGG GATAATCTGG AAGAGATTCG CCTGCGCCGG
GAGCGTCCCC TTCAGGTTCG CTGGAGCGGC GGCGAAGGCT GGGTCGCAGC CAGCGGCGGC
CTGGCCGCCG GCCCGGATGG CGCTTATAAA GTAACTGCTG CCGATCTGGG GCGGACCATT
GAGGCCCTGA CCAGGAGTTC CCTTTACGCC CTGGAAGAGG AGCTGCGTTC CGGTTATATC
ACCATCAGCG GCGGCCACCG GGTAGGCCTG GTAGGGGAGG CGGTTGTACT CCAGGGGGAA
ATCCGCACCT TGAAAAACTT TGCCGGACTC AACCTGCGCC TGGCGAGGGA TATCCCCGGT
TGCGCCAGGA GCCTCATTCC TTACCTTCTG GAGGGAGGGC GCCCCCTGCA TACCCTGATC
CTCTCACCGC CGCGGTGCGG CAAGACAACC CTCCTGCGGG ATCTCATCCG CCTCCTAAGT
ACCGGCGTAC CAGAGCTTAA GTTTTCCGGT GTCAATGTGG GTGTGGTGGA CGAGCGGTCG
GAAATTGCCG GCTGCTGGCT GGGGGTACCG CAGCTCGAGG TAGGCCCGCG GACGGATGTC
CTGGACCGCT GCCCCAAAGC GGCAGGGATG CTCATGCTCC TGCGGTCTAT GGGACCGGAA
GTCATTGCCA CCGACGAGAT CGGCCGGCCG GAGGAACTGG CGGCCCTGCA GGATGTTCTC
CACGCCGGCG TCACCATGCT GGCCAGCGTC CACGCCGGTA GCCTGGAAGA GCTGCAACAC
CGCCCGGGCT GGGGCCCCCT GCTTAAGCAG GGCTTCTGGC AGCGCCTGGT GCTCCTGGGG
CGCACCCTGG GTCCGGGAAC TATTGAAGGC GTTTTTTCCG GGGATCACCG TACCCTGAAG
CGGGGTCCCT GGCGGGGGGA GGCCCGACCG TGA
 
Protein sequence
MATVHLVQVE TANQPPGQAP RRDGDPLAGI EELLPPNIKA AVESLPAGIR DNLEEIRLRR 
ERPLQVRWSG GEGWVAASGG LAAGPDGAYK VTAADLGRTI EALTRSSLYA LEEELRSGYI
TISGGHRVGL VGEAVVLQGE IRTLKNFAGL NLRLARDIPG CARSLIPYLL EGGRPLHTLI
LSPPRCGKTT LLRDLIRLLS TGVPELKFSG VNVGVVDERS EIAGCWLGVP QLEVGPRTDV
LDRCPKAAGM LMLLRSMGPE VIATDEIGRP EELAALQDVL HAGVTMLASV HAGSLEELQH
RPGWGPLLKQ GFWQRLVLLG RTLGPGTIEG VFSGDHRTLK RGPWRGEARP