Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1006 |
Symbol | |
ID | 3833309 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1034392 |
End bp | 1035657 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637828935 |
Product | 2-hydroxyglutaryl-CoA dehydratase, D-component |
Protein accession | YP_429864 |
Protein GI | 83589855 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1775] Benzoyl-CoA reductase/2-hydroxyglutaryl-CoA dehydratase subunit, BcrC/BadD/HgdB |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0695891 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000177823 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGATATCT ACCAGCTCTT GCGCTCTCAC CTGGGAAACG CCTTGCGCCA GCGCGCCTTT AAATCCCCCT GGACATATAG ATTATTGCGG CAGGCCGTAG CAGCCCAAAA GAAGCATTAT ACCTGGCAGG CGGCCCACCG GCTGCTTCTA AATACCATCG ACCAGGCAGC AGCAGCCTTT CTCCAAAAGG GTCCGGTGGT GTGGCACAAT GTTTTCTTCC CCACTGAGAT CCTTTATGGC CTGGGAGTAA TTCCCTTTGC CCCGGAAGCA GCCGCCGTGG TGGCAACCGG TCTGGGGATA GCCAGGGGGG CCTTGCGGCG GGCTGAAGGT GACTGGGTCA GCGGCGAGGC CTGCTCCTTC CACCGCCTGG CCCACGGTTG CGACCGGGAA GGTTACTTCC CGCCGCCAGG AGCTGTAGTC TGTAGCTCGC ATCTGTGCGA CACGGCCCCC CAGTCCCTGG AGACGACGGC GGCCTACCAC GGGGTGCCCT TTTACCTCTT GGACGTACCC CATCGGCAGG ACGCGGAAGC CCTGAACTAT GTCGCCCGCC AGCTTAAGGG CATAACCTTT TCCCTGGTAG AATCCTTACG CCTGGTATGG GACGAGGACC GTTTCCAGGA GGCCATAGTT AACTCCAACG CGGCGCGGGA ACGCCTGCTG GCCGTCAACC GCTTGCGCCA GCAACGCCCG GCCTGCATCC GGGGGGAAGA AGCCCACGGT TTCATTTACC CCATGCTGGC TGGTTTTGGT GCGGCAACCT CAGTGGAGGT CTACGGCCAA CTGGCGGATG AATTGGACCG GCGTAACCGG GAACAGCGCC GGGCCGTACC GGAAGAAAAG GCCCGCTTGC TATGGTTGCA CCTGCGGCCC TATTATCCTA ACGCCATCTT CCAGTTGCTA GAACGGGAGG CCGGGGCGGT GGTTGTTTTT GAAGAAATGA GTCATGTTTA TTGGGAGCCG CTGGACCCGG AAAAGCCTTT TTATAGCCTG GCGCGTAAGG TTTTAAGCCA CCACGGCCTG GCTCCCATGG CCAGGAGGGT AGAGGCCATC CTGGCCATGG TCGACGCCTA CCAGGCCGAC GGGATTATCC ATTTCGCCCA CTGGGGTTGC CGCCAGAGCA CCGCCGGTTT ACGCTTGTTG CAGGACGCTC TGCGCGAAAG GGGAATACCA TTCTTAAACC TGGAAGGCGA TTGTGTCGAT CAAAGTAAGT ACGCCCCCGG CGCTACCAGA ACGCGCCTGG AAGGTTTCCT GGAAATGCTA TTATAA
|
Protein sequence | MDIYQLLRSH LGNALRQRAF KSPWTYRLLR QAVAAQKKHY TWQAAHRLLL NTIDQAAAAF LQKGPVVWHN VFFPTEILYG LGVIPFAPEA AAVVATGLGI ARGALRRAEG DWVSGEACSF HRLAHGCDRE GYFPPPGAVV CSSHLCDTAP QSLETTAAYH GVPFYLLDVP HRQDAEALNY VARQLKGITF SLVESLRLVW DEDRFQEAIV NSNAARERLL AVNRLRQQRP ACIRGEEAHG FIYPMLAGFG AATSVEVYGQ LADELDRRNR EQRRAVPEEK ARLLWLHLRP YYPNAIFQLL EREAGAVVVF EEMSHVYWEP LDPEKPFYSL ARKVLSHHGL APMARRVEAI LAMVDAYQAD GIIHFAHWGC RQSTAGLRLL QDALRERGIP FLNLEGDCVD QSKYAPGATR TRLEGFLEML L
|
| |