Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2052 |
Symbol | |
ID | 3831198 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2143773 |
End bp | 2145068 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637829981 |
Product | adenylosuccinate lyase |
Protein accession | YP_430891 |
Protein GI | 83590882 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0015] Adenylosuccinate lyase |
TIGRFAM ID | [TIGR00928] adenylosuccinate lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0154529 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.110523 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGAGC GCTATACCCT GCCGGAAATG GGTAAAATTT GGGAACCGGA GCACAAGTTC CGCACCTGGC TGGCCATTGA GATCTATGCC TGCGAGGCCT GGGCGGAGCT CGGCCGGATT CCGCCGGCCG CCCTGGAAGA GATCAAGGCC AGGGCCGATT TCGACATTGA CCGGATCAAT GAAATTGAAG CCACCACCCG CCATGACGTC CTGGCTTTCC TCACTGCCGT TGCGGAGAAG GTCGGTGATG CCTCCAAGTA TATTCATCTT GGTATGACAT CCTCAGACGT CCTGGATACG GCCCTGGCCG TACAGATGCG GGACGCCGCC GACCTGCTCC TTAAGCGCCT CCGGGATCTG CGGGCCGAAC TGGTGAAAAA GGCCATGGAG CATAAATATA CTTTGATGAT CGGCCGCACC CACGGCGTCC ACGCCGAGCC CACCACTTTT GGCCTCAAGA TGGCCCTGTG GGTTATGGAG GTTGATCGGC ACCTCATCCG GCTGGAACAG GCTAAAGAGA TGATCAGCGT CGGCAAAATC TCCGGCGCCG TGGGTACTTT CGCCAATATC AACCCCCGGG TGGAGGAGCA CGTCTGCCGT CGCCTGGGGC TCAAGCCCGC CAGCGTCTCC ACCCAGATAA TCCAACGGGA CCGTCACGCC CAGTTCCTGG CCACCCTGGC CATCATCGGC AGCTCCCTGG AGAAAATGGC CACAGAGATC CGCAACCTCC AGCGCACCGA CATCCTGGAG GTGGAGGAGC CCTTCGCTAA AGGTCAAAAA GGGTCTTCGG CTATGCCCCA CAAACGAAAC CCCATTATAT CGGAACGGGT GGCCGGCCTT TCGCGAGTGT TGCGGGGCAA CGCCCTGGCC GCCATGGAGG ACATCGCCCT CTGGCACGAG CGGGATCTCA CCCATTCCTC CGTTGAGCGG GTGATTATCC CCGACAGCAC CATCCTGCTG GACTATATGC TGGTAAAGTT TACCGGGATT ATCGCCGGCC TTAACGTCTA CCCGGAGAAC ATGCGCCGGA ATCTCGAGGC CACCCACGGC CTGGTCTTTT CCCAGCGGGT CCTCCTGGCC CTGGTGAATA AGGGCGTCCT GCGGGAAACG GCCTACGCCT GGGTGCAGCG CAACGCCCTC AAGGCCTGGC AGACCCGCCA GCCATTTAAA GAACTGGTCC TTAAGGACCA GGATATCATG TCTCGCCTGG ATCCGAAGGA AGTAGAGGCC CTCTTTGATT ACGACTACCA CCTGCGCCAC GTAGATTATA TCTTCCGCCG CGCCGGTTTG GAGTAG
|
Protein sequence | MIERYTLPEM GKIWEPEHKF RTWLAIEIYA CEAWAELGRI PPAALEEIKA RADFDIDRIN EIEATTRHDV LAFLTAVAEK VGDASKYIHL GMTSSDVLDT ALAVQMRDAA DLLLKRLRDL RAELVKKAME HKYTLMIGRT HGVHAEPTTF GLKMALWVME VDRHLIRLEQ AKEMISVGKI SGAVGTFANI NPRVEEHVCR RLGLKPASVS TQIIQRDRHA QFLATLAIIG SSLEKMATEI RNLQRTDILE VEEPFAKGQK GSSAMPHKRN PIISERVAGL SRVLRGNALA AMEDIALWHE RDLTHSSVER VIIPDSTILL DYMLVKFTGI IAGLNVYPEN MRRNLEATHG LVFSQRVLLA LVNKGVLRET AYAWVQRNAL KAWQTRQPFK ELVLKDQDIM SRLDPKEVEA LFDYDYHLRH VDYIFRRAGL E
|
| |