Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1067 |
Symbol | |
ID | 3833332 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1096999 |
End bp | 1098240 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637828995 |
Product | aspartate kinase I |
Protein accession | YP_429924 |
Protein GI | 83589915 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0527] Aspartokinases |
TIGRFAM ID | [TIGR00656] aspartate kinase, monofunctional class [TIGR00657] aspartate kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0350826 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0457717 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGGTCC TGGTCCAAAA GTTCGGTGGT ACGTCGGTAG CCAGTCCCGA GCAACGACTG GTGGTAACCG GGCATATCGA AAGGGCCTGC CGGGTGGGTT ACCAGGTGGT AGTAGTCGTT TCGGCTATGG GGCGCCGGGG CGCGCCCTAT GCTACCGATA CCCTCCTTGA ACTGCTGGGG GATAACGAGG TTGAGCCCCG GGAGCGGGAT CTCCTCCTGG CTTGTGGCGA GGTTATTTCC GGGGTGGTCC TGACCGGGCT CCTTAAAAGT AAGGATCTGC CGGCAGTTTT CCTGACGGGA GGCCAGGCCG GCATCATTAC CGACGCCCAG TTTGGGGATG CCCGTATTCT TAGGGTTGAA CCGCGCCGGA TTCAATCCTA CCTTGACCAG GGCCGGGTAG TGGTGGTAGC CGGTTTCCAG GGAGTCACTG AATCCGGGGA AGTAACCACT CTAGGCCGTG GTGGCAGCGA CACCACGGCG GTGGCCCTGG GAGTGGCCTT GGGTGCGGAA GCAGTTGAAA TTTTTACCGA TGTGGATGGG GTTAAAACTG CCGACCCACA TATTGTCAGC GATGCCAGGA CCCTGAGCAC CATCACCTAC AATGAGGTTT GTCAGATGGC CTATGAAGGG GCGAAGGTCA TCCACCCCCG AGCCGTAGAA ATAGCCCGGC AGAAGAATAT TCCCTTACGG ATCAAGTCAA CCTTTAATGA CGGCCCTGGT ACCCTGGTGG TAGCCTGGCA ACCGGGGGTC ACCGGCGTCC ATATCAGCCG GGACCGGGTC ATTACCGGCA TTACCCACAT GGACGGTCTG ACCCAGTTGC GGGTTTCCCT CCCCTCAGGG GAGGGAGCCG GGGAGGTTTT CCCGCTGCTG GCTCAAAATA ATATCAGCGT GGACTTTATC AATATCTTTC CCGGGGAACT GGTCTTTACC GTTAAAAGCG AGGTTGCCCG GCAGGCCCGG GAACTGATTG AAGGGCTGGG CCTGAAGGTG ACTGCCCGCC CCGGTTGCGC CAAGGTGGCC ACGGTAGGGG CCGGTATGCG CGGCGTACCC GGGGTCATGG CTACCATTGT TACGGCCCTG GAGCGGGAGG GCATTAAAAT TCTCCAGTCA GCAGATTCCT ATACTTCTAT CTGGTGCCTG GTGGACAGGA AGGATATGGA ACGGGCCGTA CAAACCCTTC ATCGGGAGTT TAAACTTAAC GACGGTAAAA CAGGCGAGGT GAAAGTTTAT GCAGTGGGGT AG
|
Protein sequence | MKVLVQKFGG TSVASPEQRL VVTGHIERAC RVGYQVVVVV SAMGRRGAPY ATDTLLELLG DNEVEPRERD LLLACGEVIS GVVLTGLLKS KDLPAVFLTG GQAGIITDAQ FGDARILRVE PRRIQSYLDQ GRVVVVAGFQ GVTESGEVTT LGRGGSDTTA VALGVALGAE AVEIFTDVDG VKTADPHIVS DARTLSTITY NEVCQMAYEG AKVIHPRAVE IARQKNIPLR IKSTFNDGPG TLVVAWQPGV TGVHISRDRV ITGITHMDGL TQLRVSLPSG EGAGEVFPLL AQNNISVDFI NIFPGELVFT VKSEVARQAR ELIEGLGLKV TARPGCAKVA TVGAGMRGVP GVMATIVTAL EREGIKILQS ADSYTSIWCL VDRKDMERAV QTLHREFKLN DGKTGEVKVY AVG
|
| |