Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0811 |
Symbol | |
ID | 3831602 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 840859 |
End bp | 842841 |
Gene Length | 1983 bp |
Protein Length | 659 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637828742 |
Product | heterodisulfide reductase subunit |
Protein accession | YP_429672 |
Protein GI | 148283119 |
COG category | [C] Energy production and conversion |
COG ID | [COG1148] Heterodisulfide reductase, subunit A and related polyferredoxins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0000746765 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAACGCA TTGGCGTCTT TATCTGCCAC TGCGGCACCA ATATCGCCTC TGTGGTCGAT GTTAAGAAAG TGGCCGCAAC GGCCGGGAAT TTTCCCGGGG TTGTCTACGC CACTGATTAC CAGTATATGT GCTCCGACCC GGGGCAGGAA CTCATTCGTA AGGCCATCAA GGAACAGCGT CTGGACCGGG TAGTTGTGGC CTCCTGCTCG CCCCGCCTCC ACGAGCCGAC CTTCCGGAAG ACGGTGGAGA GTGCCGGTAT CAATCCCTAT CTTTTTGAAA TGGCCAATAT CCGCGAGCAG TGCGCCTGGG TCCACGCCCG GGATAAAGAG CGGGCTACCG CCAAGGCCAT CGACCTGGTG CGGGCCGCCG TGGCCAAGGT GAGCCGTGAC GGTCCCCTCC AGGCGGCTAC TATCCCCATT ACCAAGCGGG CCCTGGTCAT CGGCGCCGGT ATTGCCGGTA TGCAGGCCGC CCTGGATATT GCCGATGCCG GCTACGAGGT GGTACTGTTA GACCGGGAAC CGACCATCGG CGGCAACATG GTCAAGCTGG ACAAGACCTT CCCGACCCTG GACTGTTCTG CTTGAATAAG CACGCCGAAA ATGGTTGCTG CGGCGCAGCA CCCTCATATT AAACTCATGA CCTATGCGGA AGTAGAGAAT ATCGCCGGCT ATGTAGGCAA TTTCGAAGTG ACCATTCGCC AGAAGGCCCG CTCGGTGGAC GCCGGCAAGT GTACCGGCTG CGGTACCTGC TGGGAGAAAT GCCCGACCAG GGTCGACAGC GAGTTCGACC TGGGCCTGGG TAAGCGTAAG GCCATTTACC TGCCCTTCCC CCAGGCGGTG CCGGCAGTGC CGGTAATCGA CCGGGAGCAC TGCCGCCAGT TCACCAAAGG CAAGTGCGGC GTCTGCCAGA AGGTCTGCCC GGCCAAAGCC ATTGACTATG AGCAACAGGA TGAGGTTATA ACCGAGAACT TCGGCGCCAT TGTCGTGGCC ACCGGCTATG ACCTCTTTAA ATGGGAAGAG GTTTACGGTG AATACGGTTA CGGCAAGTAC CCGGATGTCA TCACCGGTAT GCACTTCGAA CGCTTGAACA ACGCCTCCGG TCCCACCGGC GGTAAGATCC TGAGGCCCTC CGACGGTAAG GAACCCAAGA CGGTGGTCTT TATTAAGTGC GTAGGTTCCC GGGACGAGGC CAAGGGCAAG AGCTACTGCT CCCGGGCCTG CTGTATGTAT ACGGCCAAGC ACGCCCACCA GGTGCTGGAG AAGATTCCCG ACTCCCAGGC AATTGTCTTC TATATGGATA TCCGCACCCC GGGCAAAGCC TACGAGGAAT TCCAGCAGCG CTCCGTCCAC GAGGGGGCCA TTTACGTCCG CGGCCGGGTG AGCCGGGTCT TCCAGGAAGG CGACAAGCTC ATTGTCCGCG GCGAGGATAC CTTGCTTGGC CGCCAGGTAG AGGTTGCCGC CGACATGGTG GTTCTGGCTA CGGCCATGGT CCCCAGCCAT GGCTGGGAGA AAGTGGCCAA GATGATCGGC CTGCAGACCG ATAAAGACGG CTTCTTCCAG GAGGCCCACC CGAAACTGCG GCCGGTGGAG ACCTTTACAG CCGGTGTCTT CCTGGCCGGG GCCTGCCAGG GGCCCAAGGA TATCCCCGAC ACCGTCTCCC AGGCCAGCGC GGCGGCCGTC AAGGTCTGCC AGCTCTTTGC GAAGGATGAG ATGGCCACCG ATCCCATGAT CGCCGCCGTA GATGAAAGCA TTTGCTCCGG CTGTGCCATG TGTGAGAAGA TTTGTCCCTA CAAGGCCATT TCCATCAAGA CCATTACGGA ACGGGTGGCC GGCAGGCAGG TCAGCCGCCG GGTGGCGTCG GTGAACAACG GCCTGTGCCA GGGCTGCGGA ACCTGCTCGG TTGCCTGCCC GTCCAGCGCC ATGAATCTAC GCGGCTTTAC CAACGAACAA ATACTGGCGG AGGTGGATGC GGTATGTCTG TAG
|
Protein sequence | MKRIGVFICH CGTNIASVVD VKKVAATAGN FPGVVYATDY QYMCSDPGQE LIRKAIKEQR LDRVVVASCS PRLHEPTFRK TVESAGINPY LFEMANIREQ CAWVHARDKE RATAKAIDLV RAAVAKVSRD GPLQAATIPI TKRALVIGAG IAGMQAALDI ADAGYEVVLL DREPTIGGNM VKLDKTFPTL DCSAISTPKM VAAAQHPHIK LMTYAEVENI AGYVGNFEVT IRQKARSVDA GKCTGCGTCW EKCPTRVDSE FDLGLGKRKA IYLPFPQAVP AVPVIDREHC RQFTKGKCGV CQKVCPAKAI DYEQQDEVIT ENFGAIVVAT GYDLFKWEEV YGEYGYGKYP DVITGMHFER LNNASGPTGG KILRPSDGKE PKTVVFIKCV GSRDEAKGKS YCSRACCMYT AKHAHQVLEK IPDSQAIVFY MDIRTPGKAY EEFQQRSVHE GAIYVRGRVS RVFQEGDKLI VRGEDTLLGR QVEVAADMVV LATAMVPSHG WEKVAKMIGL QTDKDGFFQE AHPKLRPVET FTAGVFLAGA CQGPKDIPDT VSQASAAAVK VCQLFAKDEM ATDPMIAAVD ESICSGCAMC EKICPYKAIS IKTITERVAG RQVSRRVASV NNGLCQGCGT CSVACPSSAM NLRGFTNEQI LAEVDAVCL
|
| |