Gene Moth_0430 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0430 
Symbol 
ID3830954 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp433790 
End bp434869 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content43% 
IMG OID637828365 
Productmannonate dehydratase 
Protein accessionYP_429304 
Protein GI83589295 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1312] D-mannonate dehydratase 
TIGRFAM ID[TIGR00695] mannonate dehydratase 


Plasmid Coverage information

Num covering plasmid clones61 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.174722 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATGA CATTTAGATG GTTCGGAGAG GGCTATGATA GTATCTCTTT GGATAAAATC 
AGGCAAATAC CGGGGAAGCC CGGGATTGTA AGCGCTATTT ATGATGTACC TGTGGGCGAA
GTATGGCCTG AAGAAAAAAT AAAAAAATTA AAGGAGACAG TAGAAAATGC GGGACTGGAA
TTAGAGGTCA TAGAAAGCGT TAATGTCCAT GAGGATATCA AACTTGGACT TCCCAGTAGG
GACCGTTATA TTGAGAACTA CCAGCAGACC TTGAGAAATC TGGCTAAATT CGGCATTAAG
GTCGTGTGCT ACAATTTTAT GCCCATATTT GATTGGACAC GGTCGGATTT AGCGAAAGTC
CTGCCAGATG GTTCCACTGC TCTTTCCTAT GAAGAGGAAA AGGTACAGAA GGTGGACCCC
AATAGGATGG TGGAAGAAGT AGAGGCCAAC TCTAACGGCT TTGAGCTGCC TGGCTGGGAG
CCTGAAAGAC TTAAAACACT AAAGGTGCTG TTTGAACAAT ACAAGAGTGT GGATGAGGAA
AAGCTATTAA AAAACCTGGG GTATTTTTTA AGGGCAATTA TTCCTGTGGC TGAAGAAGTT
GATATAAAAA TGGCCATTCA TCCCGACGAT CCGCCGTGGT CTATATTTGG TCTTCCCAGG
ATTGTAAAAT CCAAAGAAAG CCTGGAAAAG ATCATGGCCC TGGTAGACAG CCCCTACAAT
GGTATCACAC TATGTAGTGG TAGTCTTGGG GCAAATCCGG ACAACGATAT TCCTGCTCTT
ATACGCTATT TCGGCGCTAA AGGAAGAATA CACTTCGGTC ATGTAAGGAA TATTAAGATA
CATTCACTCC GCAATTTTGA TGAGTCTTCT CATTTGTCTT CGGATGGATC TTTGGATATG
TTTGAGATTA TGAAGGCATA CCATGATATT GATTTCAAGG GATATATCAG GCCGGACCAT
GGTCGAATGA TCTGGGGAGA AGTAGGCAGG CCTGGGTATG GCCTGTATGA CAGGGCTCTT
GGGATCGCCT ATCTGAACGG GTTATGGGAA GCAATTGGTA AAATGAAAAA GGTATGTTAA
 
Protein sequence
MKMTFRWFGE GYDSISLDKI RQIPGKPGIV SAIYDVPVGE VWPEEKIKKL KETVENAGLE 
LEVIESVNVH EDIKLGLPSR DRYIENYQQT LRNLAKFGIK VVCYNFMPIF DWTRSDLAKV
LPDGSTALSY EEEKVQKVDP NRMVEEVEAN SNGFELPGWE PERLKTLKVL FEQYKSVDEE
KLLKNLGYFL RAIIPVAEEV DIKMAIHPDD PPWSIFGLPR IVKSKESLEK IMALVDSPYN
GITLCSGSLG ANPDNDIPAL IRYFGAKGRI HFGHVRNIKI HSLRNFDESS HLSSDGSLDM
FEIMKAYHDI DFKGYIRPDH GRMIWGEVGR PGYGLYDRAL GIAYLNGLWE AIGKMKKVC