Gene Moth_0906 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0906 
Symbol 
ID3831294 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp941110 
End bp942174 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content61% 
IMG OID637828837 
Producthypothetical protein 
Protein accessionYP_429766 
Protein GI83589757 
COG category[R] General function prediction only 
COG ID[COG0820] Predicted Fe-S-cluster redox enzyme 
TIGRFAM ID[TIGR00048] radical SAM enzyme, Cfr family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000883258 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTACCA GAATTGACTT GCGGGGGCTG TTGCCCCAAG AATTGGAGGA GCTGGCAGTT 
CGGCTGGGGG AGGCGCCCTA CCGTGGCCGG CAGATCTTTC GCTGGTTGCA CGCCCGTCGG
GCGAAAGGAA TAGAGGTTAT GTCCGATTTG CCCCGGGCTT TCCGGGAGCG TCTGGCGTTA
GTAGCCGAAC TACCTCCGGT AAGGGTTCTG AACCGCCTGG TGGCGGCTGA CGGCCTGACG
CGCAAGTTGC TCCTGGGCCT GGGTGACGGT AATAGCATCG AATGTGTTCT CATGATTTAC
AAAGACGGCC GCCGCAGGAA TACCGCCTGC CTGTCCAGCC AGGTGGGTTG CGCCATGGGA
TGCAGTTTTT GCGCCACCGG TCAGGGCGGC CTCCAGCGTA ACCTGACCGC CAGTGAGATT
ATCCTCCAGG CCCTGGCCCT GGGGGCGGAA CTGGCGGAGG GGGAAGGGGG GAACCGGATC
AGCAATATCG TCTTTATGGG TATGGGGGAA CCACTCAATA ACTATGAGGC CGTCATGAAA
GGGGTACGTA TTTTCGAAGA TCCTTCGGGA TGGGGCATCA GCCACAGGCG GATTACCCTG
TCCACCTGCG GCATTGTTCC CGGCATCGAG CGACTGGCCA GGGAAAAACC GCCCCTGGAG
CTGGCTGTTT CCCTGCATGC GGTCACTAAC GAACTGCGGG ATAAGCTGAT GCCCATCAAC
AGGCGTTACC CCCTGGAAGA GCTTATCCCG GCCTGCCGCC GTTATGCTGA AATAACCGGG
CGGCGGGTTA CCTTCGAGTA TGCCCTGATA GCCGGGGTCA ACGACCGTCG GGAGGATGCC
CGGGGTTTAA GCAGGCTTCT CCGGGATATG CTGGCCTTCG TAAACATAAT CCCCCTGAAC
CCGGTGGCCG GGAGCGGGTT CAAAGGGGTA CCCCCGGCGG CAGCCAGGGC TTTTGTTGCG
CTGTTGCAGG AGGCGGGGCT GGAGGCAGCC ATCCGTGATA GCCGGGGACA GGATATCGCC
GCTGCTTGTG GCCAGTTACG TTTCGCGTCC AGGGAGGTGT TATAA
 
Protein sequence
MTTRIDLRGL LPQELEELAV RLGEAPYRGR QIFRWLHARR AKGIEVMSDL PRAFRERLAL 
VAELPPVRVL NRLVAADGLT RKLLLGLGDG NSIECVLMIY KDGRRRNTAC LSSQVGCAMG
CSFCATGQGG LQRNLTASEI ILQALALGAE LAEGEGGNRI SNIVFMGMGE PLNNYEAVMK
GVRIFEDPSG WGISHRRITL STCGIVPGIE RLAREKPPLE LAVSLHAVTN ELRDKLMPIN
RRYPLEELIP ACRRYAEITG RRVTFEYALI AGVNDRREDA RGLSRLLRDM LAFVNIIPLN
PVAGSGFKGV PPAAARAFVA LLQEAGLEAA IRDSRGQDIA AACGQLRFAS REVL