Gene Moth_1006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1006 
Symbol 
ID3833309 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1034392 
End bp1035657 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content60% 
IMG OID637828935 
Product2-hydroxyglutaryl-CoA dehydratase, D-component 
Protein accessionYP_429864 
Protein GI83589855 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1775] Benzoyl-CoA reductase/2-hydroxyglutaryl-CoA dehydratase subunit, BcrC/BadD/HgdB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0695891 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000177823 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGATATCT ACCAGCTCTT GCGCTCTCAC CTGGGAAACG CCTTGCGCCA GCGCGCCTTT 
AAATCCCCCT GGACATATAG ATTATTGCGG CAGGCCGTAG CAGCCCAAAA GAAGCATTAT
ACCTGGCAGG CGGCCCACCG GCTGCTTCTA AATACCATCG ACCAGGCAGC AGCAGCCTTT
CTCCAAAAGG GTCCGGTGGT GTGGCACAAT GTTTTCTTCC CCACTGAGAT CCTTTATGGC
CTGGGAGTAA TTCCCTTTGC CCCGGAAGCA GCCGCCGTGG TGGCAACCGG TCTGGGGATA
GCCAGGGGGG CCTTGCGGCG GGCTGAAGGT GACTGGGTCA GCGGCGAGGC CTGCTCCTTC
CACCGCCTGG CCCACGGTTG CGACCGGGAA GGTTACTTCC CGCCGCCAGG AGCTGTAGTC
TGTAGCTCGC ATCTGTGCGA CACGGCCCCC CAGTCCCTGG AGACGACGGC GGCCTACCAC
GGGGTGCCCT TTTACCTCTT GGACGTACCC CATCGGCAGG ACGCGGAAGC CCTGAACTAT
GTCGCCCGCC AGCTTAAGGG CATAACCTTT TCCCTGGTAG AATCCTTACG CCTGGTATGG
GACGAGGACC GTTTCCAGGA GGCCATAGTT AACTCCAACG CGGCGCGGGA ACGCCTGCTG
GCCGTCAACC GCTTGCGCCA GCAACGCCCG GCCTGCATCC GGGGGGAAGA AGCCCACGGT
TTCATTTACC CCATGCTGGC TGGTTTTGGT GCGGCAACCT CAGTGGAGGT CTACGGCCAA
CTGGCGGATG AATTGGACCG GCGTAACCGG GAACAGCGCC GGGCCGTACC GGAAGAAAAG
GCCCGCTTGC TATGGTTGCA CCTGCGGCCC TATTATCCTA ACGCCATCTT CCAGTTGCTA
GAACGGGAGG CCGGGGCGGT GGTTGTTTTT GAAGAAATGA GTCATGTTTA TTGGGAGCCG
CTGGACCCGG AAAAGCCTTT TTATAGCCTG GCGCGTAAGG TTTTAAGCCA CCACGGCCTG
GCTCCCATGG CCAGGAGGGT AGAGGCCATC CTGGCCATGG TCGACGCCTA CCAGGCCGAC
GGGATTATCC ATTTCGCCCA CTGGGGTTGC CGCCAGAGCA CCGCCGGTTT ACGCTTGTTG
CAGGACGCTC TGCGCGAAAG GGGAATACCA TTCTTAAACC TGGAAGGCGA TTGTGTCGAT
CAAAGTAAGT ACGCCCCCGG CGCTACCAGA ACGCGCCTGG AAGGTTTCCT GGAAATGCTA
TTATAA
 
Protein sequence
MDIYQLLRSH LGNALRQRAF KSPWTYRLLR QAVAAQKKHY TWQAAHRLLL NTIDQAAAAF 
LQKGPVVWHN VFFPTEILYG LGVIPFAPEA AAVVATGLGI ARGALRRAEG DWVSGEACSF
HRLAHGCDRE GYFPPPGAVV CSSHLCDTAP QSLETTAAYH GVPFYLLDVP HRQDAEALNY
VARQLKGITF SLVESLRLVW DEDRFQEAIV NSNAARERLL AVNRLRQQRP ACIRGEEAHG
FIYPMLAGFG AATSVEVYGQ LADELDRRNR EQRRAVPEEK ARLLWLHLRP YYPNAIFQLL
EREAGAVVVF EEMSHVYWEP LDPEKPFYSL ARKVLSHHGL APMARRVEAI LAMVDAYQAD
GIIHFAHWGC RQSTAGLRLL QDALRERGIP FLNLEGDCVD QSKYAPGATR TRLEGFLEML
L