Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1938 |
Symbol | aksA |
ID | 3832430 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2012878 |
End bp | 2014029 |
Gene Length | 1152 bp |
Protein Length | 383 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637829869 |
Product | trans-homoaconitate synthase |
Protein accession | YP_430779 |
Protein GI | 83590770 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0119] Isopropylmalate/homocitrate/citramalate synthases |
TIGRFAM ID | [TIGR02660] homocitrate synthase NifV |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.00360416 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.0000000147263 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGGGCGGAC AGATTAAAAT CGTCGATACC ACCCTCCGGG ACGGCGAACA AACAGCCGGG GTGGTCTTTG CCAACAGTGA AAAACTGCGC ATTGCCAAGA TGCTGGACGC TATCGGTGTC GATCAAATTG AAGCCGGCGT ACCGGTCATG GGCGGTGATG AGAAAAAGGT CATCAAGGAG ATTGTCGATG CCGGTTTGCG GGCGAGTATC ATGGGCTGGA ACCGGGCTGT CATTGACGAT GTCCGGCATT CCATCGATTG TGGCTGCGAT GCCGTAGCCA TCTCCATTTC TACTTCTGAT ATCCATATCG AACACAAACT GCGCAGCACC AGGGAAAAGG TCATTGAGTC CATGTCCCGG GCCTGCGAGT TCGCCAAGAA GCACAACCTC TACGTCTCCG TCAATGCCGA GGACGCCAGC CGCACCGATC CCGATTACCT ATTACAGTTT GCTCGGGCTG CCCGGGAGGC CGGCGCCGAC CGCCTGCGCT TCTGCGATAC GGTGGGCATA ATGGATCCCT TCGGCACTTA TGAAAAAATA AAATGGTTGA TCGAAGAAGT GGGCCTGGAT GTAGAGATGC ACATGCACAA CGACTTTGGC ATGGCAACGG CCAATACCCT GGCCGGCATC CGCGCCGGGG CCAAGTACGC CGGGGTGACG GTGGTGGGCC TGGGCGAACG GGCCGGCAAC GCCGCCCTGG AGGAAGTGGT CATGGCCTTA AAATATCTGG AAAATATTGA CTTAAAGTTT AAAACCGAGC AGTTCCGCGA GCTGGCCGAG TACGTTTCCC TGGCGGCACG CCGGCAACTA CCGGCCTGGA AAGCCGTTGT GGGCAGCAAT ATGTTTGCCC ACGAGTCCGG TATTCATGCC GACGGCGCCC TGAAGGACCC GCGCACCTAC GAAGTAATGA CGCCGGAAGA GGTCGGCCTG GAACGGCAGA TTGTCATCGG CAAGCACTCG GGCACGGCGG CCATTAAGGC CAAGTTTGCC GAGTACGGCG TGCACCTGGA AGAAGCAGAC GCCGCAGCTA TCCTCGCCAG GGTGCGTGCC CTGGCGGTGG AACTCAAGCG CCCCCTCTTT GATAAGGAAC TGGCCCACAT CTATGAGGAT TACCAGGAAG CCCGGAAAAA GGCGGCCGTG ACCGCCGTAT AG
|
Protein sequence | MGGQIKIVDT TLRDGEQTAG VVFANSEKLR IAKMLDAIGV DQIEAGVPVM GGDEKKVIKE IVDAGLRASI MGWNRAVIDD VRHSIDCGCD AVAISISTSD IHIEHKLRST REKVIESMSR ACEFAKKHNL YVSVNAEDAS RTDPDYLLQF ARAAREAGAD RLRFCDTVGI MDPFGTYEKI KWLIEEVGLD VEMHMHNDFG MATANTLAGI RAGAKYAGVT VVGLGERAGN AALEEVVMAL KYLENIDLKF KTEQFRELAE YVSLAARRQL PAWKAVVGSN MFAHESGIHA DGALKDPRTY EVMTPEEVGL ERQIVIGKHS GTAAIKAKFA EYGVHLEEAD AAAILARVRA LAVELKRPLF DKELAHIYED YQEARKKAAV TAV
|
| |