Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0150 |
Symbol | |
ID | 3832380 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 144256 |
End bp | 145338 |
Gene Length | 1083 bp |
Protein Length | 360 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637828083 |
Product | shikimate/quinate 5-dehydrogenase |
Protein accession | YP_429031 |
Protein GI | 83589022 |
COG category | [R] General function prediction only |
COG ID | [COG5322] Predicted dehydrogenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000000131066 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCATAAAT TCGCCTTCAT GATCCACCCC CTGGACATCC ACGACGTTAC CCGTAAATTC CCTGTTGCCC GCCACCTGCC TCCGGCCCTG CTGGAGAAAG CGGTACGCTA TCTGCCGCCC ATCAAGGCTT CCCATATTAC CGGCGTCCGC TCCGCCTACG CTGAAACCGA GGGGTGGTTT GTGGCCTGCT CCCTGACAAG CCGCCAGATG CTCTCTCTGC CCCAGGATCT GGTCATCAAA AAGCTAATCC GTACCGGTCG CTTGGCCGAA AAACTGGGAG CCGAGATCCT GGGCCTGGGG GCCATGACCT CCGTAGTGGG CGATGCCGGC ATCACCATTG CCCGCCACCT GAACATAGCC GTCACCACCG GCAACAGCTA CACGGTAGCC ACTGCCCTCG AAGCCACCGC CAAGGCTGCT GCAATGATGG ACATCGACCT GACCCGGGCC GAGATAGCGA TTATGGGGGC CACGGGCTCC ATTGGCGCCG TCTGTGCCCG GATTCTGGCC CGCAACTGCC GGCACCTGAC CCTGATTGCC CGTAATGAAG AAAAACTGGC CCGCCTGGCG CATCAGATTA AAGAAGAAAC CGGTCTTAAG GCCCGGGTAA CCAATCATTC CCGGGAAGCC CTGCGCCGGG CCGATGTCAT TATTACCGTC ACCTCGGCGG TAGATACCGT GATTGAACCC GAGGACCTGA AACCAGGTGC CGTAGTTTGC GACGTCGCCC GGCCCCGGGA TGTCTCGCGG CGGGTAGCCG AGGTGCGCGA CGACGTCCTG GTTATCGACG GCGGTGTCGT CCAGGTCCCC GGGGATGTCG ACTTCCATTT TAACTTTGGC TATCCCCCGG GCCTCTCCTA CGCCTGTATG GCCGAAACCA TGATCCTGGC CCTGGAGGGC AGGATTGAAA ACTTTACCCT GGGCCGGGAG TTGACGGTAG AACAAATCGA CACCATTAAC CGGCTGGCCG CCAAGCACGG CTTTCAGGTT GCCGGCTTCC GCAGCTTTGA ACTACCTGTT TCCGAGGAGC AGGTGGCGGC CATCAGGGAG CGAGCACGGC AGCGGGCCGC CCTGGCCCGT TAA
|
Protein sequence | MHKFAFMIHP LDIHDVTRKF PVARHLPPAL LEKAVRYLPP IKASHITGVR SAYAETEGWF VACSLTSRQM LSLPQDLVIK KLIRTGRLAE KLGAEILGLG AMTSVVGDAG ITIARHLNIA VTTGNSYTVA TALEATAKAA AMMDIDLTRA EIAIMGATGS IGAVCARILA RNCRHLTLIA RNEEKLARLA HQIKEETGLK ARVTNHSREA LRRADVIITV TSAVDTVIEP EDLKPGAVVC DVARPRDVSR RVAEVRDDVL VIDGGVVQVP GDVDFHFNFG YPPGLSYACM AETMILALEG RIENFTLGRE LTVEQIDTIN RLAAKHGFQV AGFRSFELPV SEEQVAAIRE RARQRAALAR
|
| |