Gene Moth_1108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1108 
Symbol 
ID3833074 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1134624 
End bp1136081 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content57% 
IMG OID637829036 
Productinosine-5'-monophosphate dehydrogenase 
Protein accessionYP_429965 
Protein GI83589956 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0516] IMP dehydrogenase/GMP reductase
[COG0517] FOG: CBS domain 
TIGRFAM ID[TIGR01302] inosine-5'-monophosphate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.927877 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACGACGG ATAAAATAAT TGGCGAGGGT TTAACTTTTG ATGACGTCCT CCTGGTCCCC 
GGTGAATCAG AGGTGTTACC GCGGGAAGTT GATATCAGCT CCAATTTTAC CCGTCATATT
CGCCTCAATA CTCCCCTGGT GAGCGCTGCC ATGGATACAG TGACTGAGGC CCGGACGGCG
ATCAGCATGG CCCGGGAGGG GGGCATCGGC GTTATCCATA AGAACATGAC CATCGAACGC
CAGGCCAGGG AGGTCGACCG GGTCAAGCGT TCAGAACATG GCGTCATTAC TGACCCCATT
TCCTTGAGCC CGGATCATAA GGTCCGGGAA GCCATCGCCC TGATGGAGCA CTACCATATC
TCAGGGGTTC CCATTACCGA TAATGGTAAG CTGGTAGGCA TCATTACCAA CCGGGATATA
CGTTTTGAAG ACAACCACGA GCGGCCTATT AAGGAGGTTA TGACCAAAGA CAACCTGGTA
ACGGCGCCGG TAGGTACTAC CCTGGCCGAG GCCATGGCCA TTTTAAGGGC CCACAAGATT
GAGAAACTCC CCCTGGTAGA CGCCGACTAT AACTTGAAGG GGCTAATTAC CATCAAGGAT
ATTGAGAAGA CACGCCGGTA TCCACAGGCC GCCAAGGATG AGAGGGGGCG CCTGCGGGTG
GCAGCGGCAG TGGGTACCTC AGCCGATACC ATGACCAGGG TAGAGGCCCT GGTAGCCGCC
GGGGTAGACG CCATTGTTGT GGATACAGCC CATGGCCAGT CCCGGAGTGT TATTGAAACA
GTGAAACGTA TCAAGGCTGC CTTCCCGGCG GTGGAGCTGG TGGCCGGTAA TGTAGCAACT
TACGACGGCG CCCGGGCCCT GGCTGAGGCC GGGTTTGACG CCGTGAAGGT TGGGGTTGGA
CCAGGTTCCA TTTGTACTAC CAGGGTTATC GCCGGCATTG GCGTCCCCCA GATTACGGCA
GTGATGGAGT GCGCCCGGGC AGCGGCGGAG TTTGGTATTC CGGTAATTGC CGATGGGGGT
ATTAAATACT CCGGTGATAT TACCAAGGCC ATTGCCGCCG GCGCCAACAC AGTAATGATC
GGCAGTCTCC TGGCCGGCAC AGAGGAAAGC CCTGGTGAGA TTGAAATCTT CCAGGGCCGC
AGTTTTAAGA GTTATCGCGG CATGGGTTCC CTCGCGGCCA TGAAGGAAGG CAGTAAAGAC
CGCTATTTCC AGGAAGAAGC CGAAAAACTG GTACCGGAAG GGATTGAAGG CCGCGTCCCT
TATAAAGGCC CCCTCTCGGA GACTATTTTC CAGCTGGTGG GCGGTTTACG AGCCGGCATG
GGTTACTGTG GTGCCCGTAA TATCGCTGAA CTCCAGGCCC GAGGGCGCTT TATCCGCATT
ACCCCGGCGG GCCTGCGGGA GAGCCATCCC CATGACGTGA TGATCACCAA AGAAGCCCCC
AACTACCGTA TTTCCTAG
 
Protein sequence
MTTDKIIGEG LTFDDVLLVP GESEVLPREV DISSNFTRHI RLNTPLVSAA MDTVTEARTA 
ISMAREGGIG VIHKNMTIER QAREVDRVKR SEHGVITDPI SLSPDHKVRE AIALMEHYHI
SGVPITDNGK LVGIITNRDI RFEDNHERPI KEVMTKDNLV TAPVGTTLAE AMAILRAHKI
EKLPLVDADY NLKGLITIKD IEKTRRYPQA AKDERGRLRV AAAVGTSADT MTRVEALVAA
GVDAIVVDTA HGQSRSVIET VKRIKAAFPA VELVAGNVAT YDGARALAEA GFDAVKVGVG
PGSICTTRVI AGIGVPQITA VMECARAAAE FGIPVIADGG IKYSGDITKA IAAGANTVMI
GSLLAGTEES PGEIEIFQGR SFKSYRGMGS LAAMKEGSKD RYFQEEAEKL VPEGIEGRVP
YKGPLSETIF QLVGGLRAGM GYCGARNIAE LQARGRFIRI TPAGLRESHP HDVMITKEAP
NYRIS