Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1007 |
Symbol | |
ID | 3833310 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1035657 |
End bp | 1036706 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637828936 |
Product | 2-hydroxyglutaryl-CoA dehydratase, D-component |
Protein accession | YP_429865 |
Protein GI | 83589856 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1775] Benzoyl-CoA reductase/2-hydroxyglutaryl-CoA dehydratase subunit, BcrC/BadD/HgdB |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000000530359 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000167288 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCCATT TAGTGGGTTG GCTGTGCCAC TATACGCCGG TAGAAATTTT TACGGCTCTC GGGTACACCC CTTACCGGCT GCTGGGCCGG GAGGGCAATA ACCCCCGCGC CGCTACATAT CTGGTGGGCA ACCTCTGCCC TTATGTCCAG AGCTGCCTCG AGGCAGCCAT CAAACAGGAG TTACCCCTCC TGGCCGGCGT CGTAATCGCC CGATCTTGCA ATGCCATGAT CCACCTGGCT AATGTCTGGC CCCATTATGG TGCAGGGGGT AAGACCGTAG TCCTCGATGT ACCCCGCCGG TTCGACGAGG ACGCAGTTTT CTATTTCAGC CAGAATCTAC GCCGTCTGGC GGCAGAACTA GCTCCCCCTG GGGAGCGACA ATTAAATGAA GAACGTTTGT GGGCGGCCAT TACCTGGTGG GAAGAACAAC GGGCAGCCTG GCGCGAGCTT TTAGCCTGCC GGGCTGCGTG GGAAGACCCT CCCGGCGGGA AAGAAATCAT GGACGGGCTG GCCAGGTGGC AAACTCCCCT TAATCCCCAG GAGGCAGATA AATGGGAGAC TGTAATTACT CGGTTAACCA GGCAACCCCG GGTGAAAACC GCCCGGCGGC CGCGCCTGCT GCTAGCGGGA AGCATCCTCC CGCGGGAATT GATTACTATG GTGGAAGAGT GCGGCGGTTT ATCTGTTTTC GAGGACAGCT GCAATGGCAT GCGGCTATTA CTGGCCCCCG CCGCCAGGGC CGAGGGTGAC CCTTACCTCT ACCTGGCCAG GCTCTACCTG GGAGGACCCC CATGTCCCCG GATGGTAGGC GACCGGGAGA AACGACGCCA GTACTGGGCC GCAGCTGTAG ACCGGTTCCG TATCCAGGGA ATCATTTACC ATGCCATGAA GTTCTGCGAC GCCGCTATGT ACGACTACAT AGCCCTGAAG GTTTTTAGTG AAAAAAAGGG GTTACCCCTT CTTCGTCTTG ATGGGGATTT CAGCGGCGGT AATCGGGGTC AATGGCAGAC GCGGCTGGAA GCCTTTTTAG AAATGCTAGG GGTGGAGTAA
|
Protein sequence | MSHLVGWLCH YTPVEIFTAL GYTPYRLLGR EGNNPRAATY LVGNLCPYVQ SCLEAAIKQE LPLLAGVVIA RSCNAMIHLA NVWPHYGAGG KTVVLDVPRR FDEDAVFYFS QNLRRLAAEL APPGERQLNE ERLWAAITWW EEQRAAWREL LACRAAWEDP PGGKEIMDGL ARWQTPLNPQ EADKWETVIT RLTRQPRVKT ARRPRLLLAG SILPRELITM VEECGGLSVF EDSCNGMRLL LAPAARAEGD PYLYLARLYL GGPPCPRMVG DREKRRQYWA AAVDRFRIQG IIYHAMKFCD AAMYDYIALK VFSEKKGLPL LRLDGDFSGG NRGQWQTRLE AFLEMLGVE
|
| |