Gene Moth_0368 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0368 
Symbol 
ID3832724 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp373185 
End bp374132 
Gene Length948 bp 
Protein Length315 aa 
Translation table11 
GC content58% 
IMG OID637828303 
Productradical SAM family protein 
Protein accessionYP_429245 
Protein GI83589236 
COG category[R] General function prediction only 
COG ID[COG1313] Uncharacterized Fe-S protein PflX, homolog of pyruvate formate lyase activating proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000000743683 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.888767 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAATAGCC GCACGGAACA CGAAAAGGAG CGAAATAATA GCCCGGGGGT GGCGGTCGAA 
GGGTTTCTTG CTACCCTTAC CAGCTGTCGG CTCTGCCCCA GGGCCTGCGG GGTTAATCGT
CTGGCCGGGG AAAAGGGTTT TTGCCGGGCT GGTGTCCGGC CCCGGGTGGC CCTGGCAACC
CTGCATCACT GGGAAGAGCC CTGCCTCAGC GGCAGCCGGG GTTCGGGGGC AGTATTCTTT
TCATACTGCA ACCTGCGCTG CGCCTTTTGC CAGAACTACC GCATCAGCTG GCAGGGCCGG
GGTAGGGAGA TGGAAATCGA AGACCTTACG GCACTTTTTC TGGATCTCCA GGCGCAGGGA
GCCCATAATA TCAACCTGGT TTCGGCGACT CCCTATACCC CCATGCTCGT CCCGGCCCTA
AGGGAGGCCA AGAAGCGAGG CCTGAAGATA CCGGTGGTTT ATAATTGTAA TGCTTATGAG
AGCTTGGAGT CCCTGCGTTC CCTGGCGGGG CTGGTGGATA TCTACCTGCC GGACCTTAAA
TATATCGATG ATGGGCCGGC CCGGCGTTAC AGTCAGGCCC CAAATTACTT TTCTGTGGCC
ACAGCAGCCA TCCTGGAGAT GCAGCGCCAG GTAGGGATTT TAACCTTTGA TGCAGCCGGA
CTGGCCCGTC GGGGGCTCCT GATCCGGCAC CTGGTCTTGC CAGGCCAGGC AGAGGCCGCT
TGCCGGGTAT TGGAGTGGGT GAAGGCCAAC CTGCCCCGGG AGACCTACCT GAGTATCATG
GCCCAGTATG TACCCGTATG GCAAGCGCAA CGCTATCCCG AGATCAATCG CCGCCTGACT
GCTGCTGAAT ACGAAAGCGT TCTGGAATAT TTCGATGCCC TGGGTTTGGA AAACGGCTTT
TGCCAGGAAC TGGATGCAGC CACAACAGAC TATATTCCCA ATTTTTAA
 
Protein sequence
MNSRTEHEKE RNNSPGVAVE GFLATLTSCR LCPRACGVNR LAGEKGFCRA GVRPRVALAT 
LHHWEEPCLS GSRGSGAVFF SYCNLRCAFC QNYRISWQGR GREMEIEDLT ALFLDLQAQG
AHNINLVSAT PYTPMLVPAL REAKKRGLKI PVVYNCNAYE SLESLRSLAG LVDIYLPDLK
YIDDGPARRY SQAPNYFSVA TAAILEMQRQ VGILTFDAAG LARRGLLIRH LVLPGQAEAA
CRVLEWVKAN LPRETYLSIM AQYVPVWQAQ RYPEINRRLT AAEYESVLEY FDALGLENGF
CQELDAATTD YIPNF