Gene Moth_0811 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0811 
Symbol 
ID3831602 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp840859 
End bp842841 
Gene Length1983 bp 
Protein Length659 aa 
Translation table11 
GC content60% 
IMG OID637828742 
Productheterodisulfide reductase subunit 
Protein accessionYP_429672 
Protein GI148283119 
COG category[C] Energy production and conversion 
COG ID[COG1148] Heterodisulfide reductase, subunit A and related polyferredoxins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000746765 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAACGCA TTGGCGTCTT TATCTGCCAC TGCGGCACCA ATATCGCCTC TGTGGTCGAT 
GTTAAGAAAG TGGCCGCAAC GGCCGGGAAT TTTCCCGGGG TTGTCTACGC CACTGATTAC
CAGTATATGT GCTCCGACCC GGGGCAGGAA CTCATTCGTA AGGCCATCAA GGAACAGCGT
CTGGACCGGG TAGTTGTGGC CTCCTGCTCG CCCCGCCTCC ACGAGCCGAC CTTCCGGAAG
ACGGTGGAGA GTGCCGGTAT CAATCCCTAT CTTTTTGAAA TGGCCAATAT CCGCGAGCAG
TGCGCCTGGG TCCACGCCCG GGATAAAGAG CGGGCTACCG CCAAGGCCAT CGACCTGGTG
CGGGCCGCCG TGGCCAAGGT GAGCCGTGAC GGTCCCCTCC AGGCGGCTAC TATCCCCATT
ACCAAGCGGG CCCTGGTCAT CGGCGCCGGT ATTGCCGGTA TGCAGGCCGC CCTGGATATT
GCCGATGCCG GCTACGAGGT GGTACTGTTA GACCGGGAAC CGACCATCGG CGGCAACATG
GTCAAGCTGG ACAAGACCTT CCCGACCCTG GACTGTTCTG CTTGAATAAG CACGCCGAAA
ATGGTTGCTG CGGCGCAGCA CCCTCATATT AAACTCATGA CCTATGCGGA AGTAGAGAAT
ATCGCCGGCT ATGTAGGCAA TTTCGAAGTG ACCATTCGCC AGAAGGCCCG CTCGGTGGAC
GCCGGCAAGT GTACCGGCTG CGGTACCTGC TGGGAGAAAT GCCCGACCAG GGTCGACAGC
GAGTTCGACC TGGGCCTGGG TAAGCGTAAG GCCATTTACC TGCCCTTCCC CCAGGCGGTG
CCGGCAGTGC CGGTAATCGA CCGGGAGCAC TGCCGCCAGT TCACCAAAGG CAAGTGCGGC
GTCTGCCAGA AGGTCTGCCC GGCCAAAGCC ATTGACTATG AGCAACAGGA TGAGGTTATA
ACCGAGAACT TCGGCGCCAT TGTCGTGGCC ACCGGCTATG ACCTCTTTAA ATGGGAAGAG
GTTTACGGTG AATACGGTTA CGGCAAGTAC CCGGATGTCA TCACCGGTAT GCACTTCGAA
CGCTTGAACA ACGCCTCCGG TCCCACCGGC GGTAAGATCC TGAGGCCCTC CGACGGTAAG
GAACCCAAGA CGGTGGTCTT TATTAAGTGC GTAGGTTCCC GGGACGAGGC CAAGGGCAAG
AGCTACTGCT CCCGGGCCTG CTGTATGTAT ACGGCCAAGC ACGCCCACCA GGTGCTGGAG
AAGATTCCCG ACTCCCAGGC AATTGTCTTC TATATGGATA TCCGCACCCC GGGCAAAGCC
TACGAGGAAT TCCAGCAGCG CTCCGTCCAC GAGGGGGCCA TTTACGTCCG CGGCCGGGTG
AGCCGGGTCT TCCAGGAAGG CGACAAGCTC ATTGTCCGCG GCGAGGATAC CTTGCTTGGC
CGCCAGGTAG AGGTTGCCGC CGACATGGTG GTTCTGGCTA CGGCCATGGT CCCCAGCCAT
GGCTGGGAGA AAGTGGCCAA GATGATCGGC CTGCAGACCG ATAAAGACGG CTTCTTCCAG
GAGGCCCACC CGAAACTGCG GCCGGTGGAG ACCTTTACAG CCGGTGTCTT CCTGGCCGGG
GCCTGCCAGG GGCCCAAGGA TATCCCCGAC ACCGTCTCCC AGGCCAGCGC GGCGGCCGTC
AAGGTCTGCC AGCTCTTTGC GAAGGATGAG ATGGCCACCG ATCCCATGAT CGCCGCCGTA
GATGAAAGCA TTTGCTCCGG CTGTGCCATG TGTGAGAAGA TTTGTCCCTA CAAGGCCATT
TCCATCAAGA CCATTACGGA ACGGGTGGCC GGCAGGCAGG TCAGCCGCCG GGTGGCGTCG
GTGAACAACG GCCTGTGCCA GGGCTGCGGA ACCTGCTCGG TTGCCTGCCC GTCCAGCGCC
ATGAATCTAC GCGGCTTTAC CAACGAACAA ATACTGGCGG AGGTGGATGC GGTATGTCTG
TAG
 
Protein sequence
MKRIGVFICH CGTNIASVVD VKKVAATAGN FPGVVYATDY QYMCSDPGQE LIRKAIKEQR 
LDRVVVASCS PRLHEPTFRK TVESAGINPY LFEMANIREQ CAWVHARDKE RATAKAIDLV
RAAVAKVSRD GPLQAATIPI TKRALVIGAG IAGMQAALDI ADAGYEVVLL DREPTIGGNM
VKLDKTFPTL DCSAISTPKM VAAAQHPHIK LMTYAEVENI AGYVGNFEVT IRQKARSVDA
GKCTGCGTCW EKCPTRVDSE FDLGLGKRKA IYLPFPQAVP AVPVIDREHC RQFTKGKCGV
CQKVCPAKAI DYEQQDEVIT ENFGAIVVAT GYDLFKWEEV YGEYGYGKYP DVITGMHFER
LNNASGPTGG KILRPSDGKE PKTVVFIKCV GSRDEAKGKS YCSRACCMYT AKHAHQVLEK
IPDSQAIVFY MDIRTPGKAY EEFQQRSVHE GAIYVRGRVS RVFQEGDKLI VRGEDTLLGR
QVEVAADMVV LATAMVPSHG WEKVAKMIGL QTDKDGFFQE AHPKLRPVET FTAGVFLAGA
CQGPKDIPDT VSQASAAAVK VCQLFAKDEM ATDPMIAAVD ESICSGCAMC EKICPYKAIS
IKTITERVAG RQVSRRVASV NNGLCQGCGT CSVACPSSAM NLRGFTNEQI LAEVDAVCL