Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0633 |
Symbol | |
ID | 3832531 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 657533 |
End bp | 659110 |
Gene Length | 1578 bp |
Protein Length | 525 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637828575 |
Product | phosphoenolpyruvate carboxykinase |
Protein accession | YP_429505 |
Protein GI | 83589496 |
COG category | [C] Energy production and conversion |
COG ID | [COG1866] Phosphoenolpyruvate carboxykinase (ATP) |
TIGRFAM ID | [TIGR00224] phosphoenolpyruvate carboxykinase (ATP) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000000533993 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTAATA CTTATGGTTT AGAGCGATTG GGCATCATTA ATCCTGGCAC TATTTATCGT AACCTGCCGA TGGCCCGGCT GGTTGAAATA GCCCTGGCCC GGGGAGAAGG CCTCCTAGCT TCCAATGGGG CTTTAAGCGT TAATACCGGC AAGTACACCG GCCGTTCCCC CCACGACAGG TATATCGTGG ACACTCCCGC CGTTCACGAC AGCATCAGTT GGGGTGCCGT CAACCAGCCC GTGAGCGAGG CGACCTTTGA ACGCCTCTAC AGTCGCCTGA CCGCCTACCT CCAGGGAAAG GATCTCTTCG TTTTTGACGG CTTTGTCGGG GCCGATCCTG CCTACCGCAT GCCTATCAGG ATTGTCAATG AGTATGCCTG GCAGAACCTG TTTGTCCACC AGCTTTTTAT CAGGCCGACG GCGGAAGAAC TGGCCGGGCA TGAGCCCCGG TTTACCGTTA TCTGCGCTCC GGGCTTCAAG GCCATTCCGG AGGAAGACGG TACCCGTTCC GAAGCCTTTA TTATTTTAAA CTTTGACCGG CGGCTGGTGA TCATCGGCGG TACCTCCTAT GCCGGCGAGA TGAAAAAATC CATTTTTACC GTCATGAATT ATTTGTTGCC AGAGCAAGGT GTTTGCCCCA TGCACTGCTC GGCTAACATG GGCCCGGCAG GCGATACGGC CCTGTTCTTC GGTCTTTCCG GTACCGGCAA GACTACCCTG TCGGCCGATC CGGAACGCTA CCTTATTGGC GACGATGAGC ATGGATGGTC GGACAAGGGC ATTTTTAACT TTGAAGGCGG TTGTTATGCT AAGTGCATCA AGCTCTCCGC CGAGCATGAA CCCCAGATCT GGAATGCCAT CCGTTTCGGC AGCGTCCTGG AGAATGTGAT GGTAGACCCC GATTGCCGAA TCATTGACTA CGACAGCGAT GCCCTGACGG AAAACACCCG CGCTGCCTAC CCGGTAGATT TTATCCCTAA CGCCGTCATC CCCGGGGTGG GTGGCCATCC CCAGACGGTG GTTTTTCTCA CCGCTGACGC CTTTGGCGTT ATGCCGCCGA TAGCCAAACT CACCCGGGAA CAGGCCATGT ACTATTTCCT GTCCGGTTAT ACCAGCAAGC TAGCCGGTAC CGAGCGGGGG GTTACCGAGC CCAAGGCGAC TTTCTCGACT TGTTTCGGGG CACCCTTCCT GCCTCGGTCG CCCATGGTTT ACGCCAACCT CCTGGGGGAA AGGATAGCCA GGCATAACGC CAGCGTTTAC CTGGTCAATA CCGGCTGGAC AGGGGGGCCC TATGGCACTG GCCGGCGTAT GAGCCTGCCC TATACTCGGG CCATGGTCAG GGCGGCTTTA AACGGTGAAC TGGATAAGGT GGAATTTACC CCCGACCCTG TTTTCGGCTT CCTGGTACCT AAAGCCTGCC CCGGAGTCCC GGCTGAAATT CTCAATCCAC GCAACACCTG GGCAGAAACG GAAAAATATG ATGCCATGGC TCGCAAGCTA GCCAGCCTCT TCAGGGAGAA CTTTGCCAAA TTTAAGGACG TACCGGTCAG CATCCAGGAG GCCGGAGTGG TTGGTTGA
|
Protein sequence | MSNTYGLERL GIINPGTIYR NLPMARLVEI ALARGEGLLA SNGALSVNTG KYTGRSPHDR YIVDTPAVHD SISWGAVNQP VSEATFERLY SRLTAYLQGK DLFVFDGFVG ADPAYRMPIR IVNEYAWQNL FVHQLFIRPT AEELAGHEPR FTVICAPGFK AIPEEDGTRS EAFIILNFDR RLVIIGGTSY AGEMKKSIFT VMNYLLPEQG VCPMHCSANM GPAGDTALFF GLSGTGKTTL SADPERYLIG DDEHGWSDKG IFNFEGGCYA KCIKLSAEHE PQIWNAIRFG SVLENVMVDP DCRIIDYDSD ALTENTRAAY PVDFIPNAVI PGVGGHPQTV VFLTADAFGV MPPIAKLTRE QAMYYFLSGY TSKLAGTERG VTEPKATFST CFGAPFLPRS PMVYANLLGE RIARHNASVY LVNTGWTGGP YGTGRRMSLP YTRAMVRAAL NGELDKVEFT PDPVFGFLVP KACPGVPAEI LNPRNTWAET EKYDAMARKL ASLFRENFAK FKDVPVSIQE AGVVG
|
| |