Gene Moth_0831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0831 
Symbol 
ID3831528 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp861378 
End bp862712 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content58% 
IMG OID637828761 
Productradical SAM family protein 
Protein accessionYP_429691 
Protein GI83589682 
COG category[C] Energy production and conversion 
COG ID[COG1032] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.00311604 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.515432 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATCG CCTTGATTGC TCCGGCCTGG CACGACCCCC TCTGGGAAAG TGAAAAGGAA 
AAGTCTATCT TCCCGCCCCT GAATCTCATC ACCCTGGCAG CCATGACGCC ACCCCAGCAT
GAGGTGACTA TCCTGGATGA GAGTTTAACC GATCTCGACT TCAATGAGAA GTACGACCTG
GTCGGCATCT CGGCCATGAC AGCCCTGGCA CCTCGCGCCT ACGAGATCGC CGATGCCTTC
CGGGAAAGGG GTACCATGGT GGTCCTGGGT GGTATGCACC CCTCCGCCCT GCCGGAGGAA
GCCATCGCCC ACGCCGATGC CGTCGTGGTC GGCGAAGCCG AGGGTTCATG GCAACGGCTC
CTGACCGACC TGGAAAACGG CCAGTTGCAA GCCTTTTACC GCCAGGAAAA GCGCCCTTCC
CTGGAACATA TGGTTATCCC CCGCCGGGAC CTCTTACAAA GGAGTCGCTA CCTGGTTCCC
GACACCGTCC AGACTACCCG TGGCTGTCCC TTCGCCTGCT CCTTCTGCTC CGTCAGCCAG
TTCTTCGGCC ACAGCTACCG TTTTCGTCCG GTAGAAGAAG TCATCAGCGA AGTCCGGGAC
CTGGAGGGCG AGGTAATTGC CTTTATTGAC GACAATATTG TCGGTAATCC CGCCTACGCC
CGCCGCCTCT TCACCGAGCT GGCCCGCTTA CCGCGCAAAG TAAAATGGTT TAGCCAGGGG
TCCTTAAATA TCGCCCGGGA CGAGGAATTA TTGCGGCTCG CCGCCGCGAG CGGCTGCATC
GGTCTTTTTA TCGGCTTTGA ATCCCTTTCC CCTGCCAACC TCAAGGCCGT CGGCAAAAGG
GTAAACCTGG TGGATGATTA CCGGCAGGCC ATTAAGAAGC TTCATGACCA CGGCATTGCC
ATTGAAGGCG CCTTTGTCTT CGGCCTTGAC GAGGATGACG AAAGCGTCTT TGAACGCACC
GTCAAATTCG CCCAGGAAAA TCGCCTGGAA GCCGCCCAAT TCGGCATCCT GACCCCCTTC
CCGGGAACCC CCTTAAGGGA GGCCCTGGAA CGTGAGGGGC GCATCACCAA TAATGACTGG
AGCGAGTATA CCATCAGCAA GGTAGTCTTT GAACCGAAAA ACATGAGCGC CCGGACCCTC
CAGGAAGGCT TTAACTGGGC CTGGCAGGAA TTCTACTCCC TGGGTTCCAT CTCCCGTCGC
CTGGGGTTGG CCAAGAAGCA CGCCGCCATC CTCTGGGCCC TGAATCTGAA CATCCGCAAG
CGGTTTAACC ATTTTATGGA AAGACTCCGG GCGGGGAACC TCGGCCTACC CCAGCCCTCC
CTGGCCAGGC AATGA
 
Protein sequence
MKIALIAPAW HDPLWESEKE KSIFPPLNLI TLAAMTPPQH EVTILDESLT DLDFNEKYDL 
VGISAMTALA PRAYEIADAF RERGTMVVLG GMHPSALPEE AIAHADAVVV GEAEGSWQRL
LTDLENGQLQ AFYRQEKRPS LEHMVIPRRD LLQRSRYLVP DTVQTTRGCP FACSFCSVSQ
FFGHSYRFRP VEEVISEVRD LEGEVIAFID DNIVGNPAYA RRLFTELARL PRKVKWFSQG
SLNIARDEEL LRLAAASGCI GLFIGFESLS PANLKAVGKR VNLVDDYRQA IKKLHDHGIA
IEGAFVFGLD EDDESVFERT VKFAQENRLE AAQFGILTPF PGTPLREALE REGRITNNDW
SEYTISKVVF EPKNMSARTL QEGFNWAWQE FYSLGSISRR LGLAKKHAAI LWALNLNIRK
RFNHFMERLR AGNLGLPQPS LARQ