Gene Moth_1247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1247 
Symbol 
ID3833042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1288069 
End bp1289136 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content61% 
IMG OID637829183 
Productradical SAM family protein 
Protein accessionYP_430104 
Protein GI83590095 
COG category[R] General function prediction only 
COG ID[COG0535] Predicted Fe-S oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.421544 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTTGTCT GGAACTGCAC GCGGGACTGC AACCTCAAGT GCCGGCATTG TTATGCCGGT 
GCCGGGAGCG GGGTGGCCGG GGACGAAATG ACGACACCGG AGGCCAGGGA CTTCCTGGAC
CAGCTGGTAG CTTTCCGGGT GCCGGTCCTC CTCTTATCAG GGGGCGAGCC CCTGGTACGG
CCGGATATCT TTGACCTGAT GGCCACCGCC GTCAAGGGGG GACTGCGGGT CACCCTTTCC
ACCAACGGCA CCCTTATTGA TCGCAGCACC GCCCGGGAAC TGAAAAAAAT CGGCATCAGC
TATGTGGGTA TCAGCCTGGA TGGCATTGAG TCCAAACATG ACGCCTTCCG GGGCGTGAAG
GGGGCCTTCC AGGCAACCCT GGAAGGCATC CGCAACTGCC TGGCGGTAGA CCAGCGGGTG
GGCTTGCGCT TTACCATCAG CCGGGCCAAT GTCGACCAGC TGGAGGAAAT TTTTTACCTG
ATCCGGGAAG AGAACATTCC CCGGGCCTGT TTCTACCACC TGGTTTACAG CGGCCGGGGC
AGTGAACTGG CCGTTGAAGA CCTGAATCAT GAAGAAAGCC GGGCGGTTAT GGATTTTCTG
ATTACAGCCG CCAGGCGCCT GAAAAAGCAG GGCCGGGAAG TCGAGATTTT AACGGTGGAC
AATCATGCCG ACGGAATCTA CCTCTACCTG AAATTAATCC GGGAAGACCC GGAACGGGCG
GTGGCCGTCC GGGAGCTATT GCGCCTGAAT GGCGGTAACC GCAGCGGTAT CGCCATCGGC
GCCGTCGACT GGGCCGGTGC CGTCCATCCG GACCAGTTCA CCATGCACCA CATCCTGGGG
AACGTCCGGG AACGCCCCTT CGGCGAGATA TGGACGGATC TCAGCAACCC CCTGCTGAAG
GGCCTGCGGG ACCGCAAACC CCTGTTGAAG GGTCGCTGCC GTACCTGCGC CTGGCTGGAC
TTGTGCAATG GCAACTGCCG CGCCCGGGCG GAAAGCGTCA CCGGCGACTT CTGGGAATCC
GACCCAGCCT GTTATTTGAC GGACGGGGAA ATATCAGATA GGAGGTAG
 
Protein sequence
MVVWNCTRDC NLKCRHCYAG AGSGVAGDEM TTPEARDFLD QLVAFRVPVL LLSGGEPLVR 
PDIFDLMATA VKGGLRVTLS TNGTLIDRST ARELKKIGIS YVGISLDGIE SKHDAFRGVK
GAFQATLEGI RNCLAVDQRV GLRFTISRAN VDQLEEIFYL IREENIPRAC FYHLVYSGRG
SELAVEDLNH EESRAVMDFL ITAARRLKKQ GREVEILTVD NHADGIYLYL KLIREDPERA
VAVRELLRLN GGNRSGIAIG AVDWAGAVHP DQFTMHHILG NVRERPFGEI WTDLSNPLLK
GLRDRKPLLK GRCRTCAWLD LCNGNCRARA ESVTGDFWES DPACYLTDGE ISDRR