Gene Moth_1370 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1370 
Symbol 
ID3832293 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1415708 
End bp1416667 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content62% 
IMG OID637829306 
ProductCoA enzyme activase 
Protein accessionYP_430226 
Protein GI83590217 
COG category[I] Lipid transport and metabolism 
COG ID[COG1924] Activator of 2-hydroxyglutaryl-CoA dehydratase (HSP70-class ATPase domain) 
TIGRFAM ID[TIGR00241] CoA-substrate-specific enzyme activase, putative 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACCTCTA TCTACCTGGG CATCGACGTT GGATCAGTAA GCACCAACGT CATTGCCCTG 
GATATAGACG GTAACTTACT CGCCTCCGTC TACCTTCGTA CCCGCGGCCA GCCTATCCCG
GCCATCCAGG AGGGTTTACG AGAGATTAGG GCTACCCTGG GCCGAGAGGT GACTGTCGCT
GGCGTCGGCA CAACAGGCAG CGGCCGGGGC CTGGCGGCCG TCATGACAGG GGCTGATGTC
GTCAAAAACG AAATCACCGC CCATGCCGTA GCTGCCAGCC AGGTCGTACC CGGTGTCCAA
ACGGTGCTGG AGATCGGCGG CCAGGACTCT AAAATAATCA TCCTGCGCCA GGGCGTAGTC
ACCGACTTTG CCATGAATAC CGTCTGTGCC GCCGGCACCG GTTCCTTCCT GGACCAGCAG
GCGGCCCGTT TGGGGATCCC CATCGAGAAT TTCGGTCGCC TGGCCCTGGG TGCCAAGAAC
CCGGTGCGCA TCGCTGGGCG CTGCGCCGTC TTTGCCGAAT CCGATATGAT CCATAAACAG
CAACAGGGCC ACCCCCTGGA TGATATCGTC GCCGGCCTGT GCGAGGCCCT GGTACGCAAC
TACCTGAATA ACGTCGGTAA GGGTAAGGAG ATCCTGCCGC CGGTAGTCTT CCAGGGCGGG
GTGGCGGCCA ACGCCGGTAT GCGCCAGGCC TTCAGTCGCG CCCTGGGGAC GGAGGTCATC
GTCCCGGAGC ATTATGGTGT TATGGGCGCC TACGGGGCCG CCCTCCTGGC CCGGGAAGCC
CGCCCGAAAA CGAGCGCTTT CCGGGGCTTT GAGCTTACCG AAAGGGACTT CCGGACCGGC
GGTTTTGAAT GCCGGGGGTG TGCCAATCAC TGCGAAGTGG TGGAATTAAG GGAAGGAAAA
GAGGTCCTGG CCCGCTGGGG CGACCGCTGT GGCAAGTGGA GCAATGCTGT AGCCGTCTAG
 
Protein sequence
MTSIYLGIDV GSVSTNVIAL DIDGNLLASV YLRTRGQPIP AIQEGLREIR ATLGREVTVA 
GVGTTGSGRG LAAVMTGADV VKNEITAHAV AASQVVPGVQ TVLEIGGQDS KIIILRQGVV
TDFAMNTVCA AGTGSFLDQQ AARLGIPIEN FGRLALGAKN PVRIAGRCAV FAESDMIHKQ
QQGHPLDDIV AGLCEALVRN YLNNVGKGKE ILPPVVFQGG VAANAGMRQA FSRALGTEVI
VPEHYGVMGA YGAALLAREA RPKTSAFRGF ELTERDFRTG GFECRGCANH CEVVELREGK
EVLARWGDRC GKWSNAVAV