Gene Moth_1938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1938 
SymbolaksA 
ID3832430 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2012878 
End bp2014029 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content58% 
IMG OID637829869 
Producttrans-homoaconitate synthase 
Protein accessionYP_430779 
Protein GI83590770 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR02660] homocitrate synthase NifV 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.00360416 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000147263 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGGGCGGAC AGATTAAAAT CGTCGATACC ACCCTCCGGG ACGGCGAACA AACAGCCGGG 
GTGGTCTTTG CCAACAGTGA AAAACTGCGC ATTGCCAAGA TGCTGGACGC TATCGGTGTC
GATCAAATTG AAGCCGGCGT ACCGGTCATG GGCGGTGATG AGAAAAAGGT CATCAAGGAG
ATTGTCGATG CCGGTTTGCG GGCGAGTATC ATGGGCTGGA ACCGGGCTGT CATTGACGAT
GTCCGGCATT CCATCGATTG TGGCTGCGAT GCCGTAGCCA TCTCCATTTC TACTTCTGAT
ATCCATATCG AACACAAACT GCGCAGCACC AGGGAAAAGG TCATTGAGTC CATGTCCCGG
GCCTGCGAGT TCGCCAAGAA GCACAACCTC TACGTCTCCG TCAATGCCGA GGACGCCAGC
CGCACCGATC CCGATTACCT ATTACAGTTT GCTCGGGCTG CCCGGGAGGC CGGCGCCGAC
CGCCTGCGCT TCTGCGATAC GGTGGGCATA ATGGATCCCT TCGGCACTTA TGAAAAAATA
AAATGGTTGA TCGAAGAAGT GGGCCTGGAT GTAGAGATGC ACATGCACAA CGACTTTGGC
ATGGCAACGG CCAATACCCT GGCCGGCATC CGCGCCGGGG CCAAGTACGC CGGGGTGACG
GTGGTGGGCC TGGGCGAACG GGCCGGCAAC GCCGCCCTGG AGGAAGTGGT CATGGCCTTA
AAATATCTGG AAAATATTGA CTTAAAGTTT AAAACCGAGC AGTTCCGCGA GCTGGCCGAG
TACGTTTCCC TGGCGGCACG CCGGCAACTA CCGGCCTGGA AAGCCGTTGT GGGCAGCAAT
ATGTTTGCCC ACGAGTCCGG TATTCATGCC GACGGCGCCC TGAAGGACCC GCGCACCTAC
GAAGTAATGA CGCCGGAAGA GGTCGGCCTG GAACGGCAGA TTGTCATCGG CAAGCACTCG
GGCACGGCGG CCATTAAGGC CAAGTTTGCC GAGTACGGCG TGCACCTGGA AGAAGCAGAC
GCCGCAGCTA TCCTCGCCAG GGTGCGTGCC CTGGCGGTGG AACTCAAGCG CCCCCTCTTT
GATAAGGAAC TGGCCCACAT CTATGAGGAT TACCAGGAAG CCCGGAAAAA GGCGGCCGTG
ACCGCCGTAT AG
 
Protein sequence
MGGQIKIVDT TLRDGEQTAG VVFANSEKLR IAKMLDAIGV DQIEAGVPVM GGDEKKVIKE 
IVDAGLRASI MGWNRAVIDD VRHSIDCGCD AVAISISTSD IHIEHKLRST REKVIESMSR
ACEFAKKHNL YVSVNAEDAS RTDPDYLLQF ARAAREAGAD RLRFCDTVGI MDPFGTYEKI
KWLIEEVGLD VEMHMHNDFG MATANTLAGI RAGAKYAGVT VVGLGERAGN AALEEVVMAL
KYLENIDLKF KTEQFRELAE YVSLAARRQL PAWKAVVGSN MFAHESGIHA DGALKDPRTY
EVMTPEEVGL ERQIVIGKHS GTAAIKAKFA EYGVHLEEAD AAAILARVRA LAVELKRPLF
DKELAHIYED YQEARKKAAV TAV