Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1500 |
Symbol | |
ID | 3831727 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1545678 |
End bp | 1546847 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637829432 |
Product | phosphopentomutase |
Protein accession | YP_430352 |
Protein GI | 83590343 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1015] Phosphopentomutase |
TIGRFAM ID | [TIGR01696] phosphopentomutase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 48 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTTAG ATCGGGTCAT CATCATCGTT CTTGATAGCG TTGGCGTCGG GGCGCTGCCT GACGCTGCCC AATACGGGGA CGAAGGCAGT AACACCCTGG CTCATATTGC CGCTACTGTA GACCTCAGGC TACCCAATCT TACCCGCCTG GGGGTGGGTA ACATCACTCC CCTGCGGGGT ATTCCCCCGG TAGGCACTCC GGCTGCCGCC TGGGGTAAGA TGGCTTCCCA AACGGCCGGC AAGGACACCA CCAGCGGGCA CTGGGAACTG GCCGGGTTGA TTTTAGAACG ACCCTTTCCC CTCTACCCCC ATGGCTTCCC GCCGGAGATA ATTGAGCCTT TTGAAAAGGC CATCGGCCGC CGGGTCCTGG GTAATAAACC GGCTTCGGGT ACGGTAATAA TCGAGGAACT TGGAGCCGAA CATATGCGCA CCGGTAATCC CATTGTCTAC ACCTCGGCCG ATAGCGTCTT CCAGATTGCC GCCCATGAAG AGGTCATTCC GGTGGAAGAG CTCTACCGCT ATTGTAAGAT TGCCCGGCGG CTGTTGACCG GGGAACATGC CGTCGGCCGG GTAATTGCCA GGCCCTTTGT CGGGGAGCCG GGGCATTTTA TCCGTACCGA CCGGCGTCAG GACTTTTCCC TGGAGCCGCC CCGGCCAACC CTGCTGGACG CCGTCATCGC CGCCGGCCTG GAGGTAATGG CCGTAGGTAA AATAAAGGAT ATTTTTGCGG GCAGGGGTAT CAGCCGCTGG ATCCACACCC ACGACAACAT GGACGGCGTC GACCAGACCC GCAACTTTAT GCGCGAGGGC GAGCGGGGCC TCATTTTTAC CAATCTGGTG GACTTTGATA TGCGTTACGG CCACCGCAAC GACGTGGCCG GTTATGCCGC TGCCCTGGAG GCCTTTGACC GGCGCCTGCC GGAGCTCCTG GACGCCCTGG AAACCAGCGA CGCCCTGATC CTGACCGCCG ATCATGGCTG TGACCCGACC ACCCCGAGTA CCGATCACTC CCGGGAATAT GTGCCCCTCT TAATTTACGG GAAACGCATC CGGCCACTGA ATATCGGTGT TCGCCCGACC TTCGCCGACC TGGGCGCCAC GGTGGCCGAC CTTTTGGGCG TGCCTTATGA CCTGCCCGGG AAAAGCTTTG CTTCCATGTT GCTGGAGTAA
|
Protein sequence | MKLDRVIIIV LDSVGVGALP DAAQYGDEGS NTLAHIAATV DLRLPNLTRL GVGNITPLRG IPPVGTPAAA WGKMASQTAG KDTTSGHWEL AGLILERPFP LYPHGFPPEI IEPFEKAIGR RVLGNKPASG TVIIEELGAE HMRTGNPIVY TSADSVFQIA AHEEVIPVEE LYRYCKIARR LLTGEHAVGR VIARPFVGEP GHFIRTDRRQ DFSLEPPRPT LLDAVIAAGL EVMAVGKIKD IFAGRGISRW IHTHDNMDGV DQTRNFMREG ERGLIFTNLV DFDMRYGHRN DVAGYAAALE AFDRRLPELL DALETSDALI LTADHGCDPT TPSTDHSREY VPLLIYGKRI RPLNIGVRPT FADLGATVAD LLGVPYDLPG KSFASMLLE
|
| |