Gene Moth_0554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0554 
Symbol 
ID3831454 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp576789 
End bp578669 
Gene Length1881 bp 
Protein Length626 aa 
Translation table11 
GC content61% 
IMG OID637828495 
Productradical SAM family protein 
Protein accessionYP_429427 
Protein GI83589418 
COG category[C] Energy production and conversion 
COG ID[COG1032] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCAGATA CTGTTATTAT GAAGTTGTTA AGCCAGGTCA CCAAACCAGC CCGCTACCTG 
GGTACGGAAT GGAACGCCGT TCATAAGGAC TGGGAGGAGA ACCCTGCCCG CATGGTTTTT
ATCTACCCGG ACCTTTACGA AGTGGGCATG TCCCACCTGG GGTTGGCCAT CCTTTACGGT
GCGGTCAATG AACAACCGGG AATGCTTATG GAGCGGGCCT TTGCCCCCGG ACCGGATATG
GAAGCCCTGT TGCGGGAGAA CCACATCCCC CTTTTCAGCC TGGAATCCCA TAGGCCCCTG
GCTGATTTTG ATGTTATCGG CTTCACTCTA CAGTATGAAA TGAGTTATAC TACAATTCTA
AATATCCTCG ACCTGGCCGG GATCCCCGTG CTGGCAACAG AACGCGGGGC CGGCTATCCT
TTAATAATTG GCGGGGGGCC GGGGGTGGCC AATCCAGAGC CGGTTGCCCC CTTTTTTGAT
TGTTTCCTCC TGGGGGACGG CGAGGAGGCC CTGCCGGAGT TCCTGAACCT GGTGGGCCAG
TTAAAAAAGG AGGGACTGTG GGACCATCGC CGTGAGGTCC TGGACCGGAT CGCGTCTCTG
CCGGGTTTTT ACGTACCCTC CTTTTATGAT GTGACCTATA ATAGCGACGG TACCGTGGCC
GCCGTTGTAC CCAACCGGCC CGGCATACCG GAACGGGTGT CGAAAAGGGT CCTCCCGGAC
CTGGACGAGG CTTACTTTCC TACCCGGCCA ATTGTCCCCT TCCTGGAGGT CGTTCACGAC
CGCATTATGC TGGAGGTCAT GCGGGGCTGC ACCCACGGCT GCCGCTTCTG CCAGGCCGGG
GCCATCTACC GCCCGGTGCG GGAAAGGGAC CTGGCCGTCC TCCTGCGCCA GGCCGAAGAG
CTGGTGCGTC ATACCGGCCA TGAAGAAATA TCCCTCACTT CCCTGAGTAC CGCAGATTAC
TCCCGGGTGG AGGAACTGGC CCGCGCCCTG GTCGCAGCCT ATGGGGACCT GGGGGTCAGC
GTCTCCCTGC CTTCCCTGCG GGTGGATGCC TTTAGTGTCC GCCTGGCGGA CGCCGTCCAG
CAGGTGCGTA AAAGCTCCCT TACCTTTGCC CCTGAGGCCG GCAGCCAGCG CCTGCGGGAT
GTGATTAACA AGGGGGTTAC CGAGGACGAT ATCCTGACGG CCACGGCCGA GGCCTTCCGG
GCCGGCTGGC AGGCCATCAA ACTCTACTTT ATGCTGGGAC TGCCGACCGA GGGGGAGGAG
GACCTCCAGG GCATTGCCGG CCTGGCCCGG CGGATTCTCT ACCAGGGCCG TGAGCTCGCT
CCGGGAAAGA AACCGACGGT AACAGTCAGC GTTTCCTCCT TCGTACCCAA GGCCTGGACG
GCCTTTCAAT GGGAACCTCA GGACCAGGTA GCGGTACTCA AGGAAAAGCA GCAGCTCCTG
CGAAACCATA TTAAGGGGCC GGGTCTGCGT TTTAACTGGC ACGACGCCGA AATAAGTTAT
ATCGAGGCCG TTCTCGCCCG GGGGGACCGG CGCCTGGCCG CGGCCATTAT GGCCGCCTGG
CGTCGCGGGG CCAGGCTGGA GGGCTGGTCG GAATACTTCA GCTATGCCTG CTGGGAAAGG
GCTTTCCAGG AAACAGGCCT TGATCCTGCC TTTTACGCCA ACCGGGAACG CCGGGAGGAG
GAGGTCTTCC CCTGGGATCA CCTGGATTTC GGCGTCAGTA AAACCTTTTT GCTCCGGGAA
CGCCGGCGGG CGCGGGCCGG CCAGTTGACG GCTGATTGCC GCTCGGGCCG TTGTACCGGC
TGTGGCGTTT GCCCGGGCCT GGGGGTTGAC CTGCGCCTGA AGGGAGGCCC GGAGGATCGT
GCGCCTGCGG ATAAAGTATA G
 
Protein sequence
MPDTVIMKLL SQVTKPARYL GTEWNAVHKD WEENPARMVF IYPDLYEVGM SHLGLAILYG 
AVNEQPGMLM ERAFAPGPDM EALLRENHIP LFSLESHRPL ADFDVIGFTL QYEMSYTTIL
NILDLAGIPV LATERGAGYP LIIGGGPGVA NPEPVAPFFD CFLLGDGEEA LPEFLNLVGQ
LKKEGLWDHR REVLDRIASL PGFYVPSFYD VTYNSDGTVA AVVPNRPGIP ERVSKRVLPD
LDEAYFPTRP IVPFLEVVHD RIMLEVMRGC THGCRFCQAG AIYRPVRERD LAVLLRQAEE
LVRHTGHEEI SLTSLSTADY SRVEELARAL VAAYGDLGVS VSLPSLRVDA FSVRLADAVQ
QVRKSSLTFA PEAGSQRLRD VINKGVTEDD ILTATAEAFR AGWQAIKLYF MLGLPTEGEE
DLQGIAGLAR RILYQGRELA PGKKPTVTVS VSSFVPKAWT AFQWEPQDQV AVLKEKQQLL
RNHIKGPGLR FNWHDAEISY IEAVLARGDR RLAAAIMAAW RRGARLEGWS EYFSYACWER
AFQETGLDPA FYANRERREE EVFPWDHLDF GVSKTFLLRE RRRARAGQLT ADCRSGRCTG
CGVCPGLGVD LRLKGGPEDR APADKV