Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1558 |
Symbol | |
ID | 3832191 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1601250 |
End bp | 1602137 |
Gene Length | 888 bp |
Protein Length | 295 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637829490 |
Product | shikimate dehydrogenase |
Protein accession | YP_430410 |
Protein GI | 83590401 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0169] Shikimate 5-dehydrogenase |
TIGRFAM ID | [TIGR00507] shikimate 5-dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 48 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.523819 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATACAGG TTAAAGCTTC CACCGGGCTG GTTGCCCTCC TGGGACACCC GGTGCAACAC TCCCTTTCGC CTCTTATGCA TAATGCCGCC TTTGCGGCCG GCGGCCAGAA CCTGGTCTAC CTGGCCTTTG ATGTTAAACC GGGTGATTTA GCTGCCGCCC TGGCCGGATT AAAGGCCCTG GGTTTCCGCG GGGCCAACGT CACCGTACCC CATAAGGAAG CGATAATCCC CTACCTGGAT GCAGTCGACC CGGTAGCAGC CAGGATCGGG GCCGTGAATA CTATCGTCAA TGAGGACCGG TGCCTGAAAG GCTACAACAC CGACGGCAGC GGTTTTTTGC GTTCCCTGGA GGAGGCCGGT TTTGACCCGG CCGGGAAGAG GGCAGTAATC CTGGGTGCAG GTGGCGCCGC CAGGGCGGTG GCCTTCGCCC TGGCGACGGC CGGCTGTGGG AGCCTGGTCC TGGCCAACCG GACCCCGGAA CGGGCCACGG AACTGGCCGG TGCCCTGGCA GGAGCCGGCC TGCCGGCGCC CGTAGTTTAC CGGCTGGGAG ATGCCGGGAT GCGGTCCGAA GTGGAGGCCG CCGACCTGGT ACTCAATACC ACCAGCCTGG GTATGTGGCC CCGGGTCGAA GAAACGCCGC TGCCACCGGA CTGGTTCCGG CCCGGACAAT GGGTCTACGA CCTGGTTTAC AACCCCCTGG AAACCAAATT CCTGGCAGGT GCCCGGCGCC GGGGCTGCCG GGTGATCTCC GGCCTGGATA TGCTCCTCTA CCAGGGGGCC GCGGCCTTTA CTCTCTGGAC GGGCCGCGAA GCCCCGGTAG CAGTCATGGA CAGGGTCCTC CGGGAGGCCA TGGGGGCGAG TTCAGGCGGG CCTGCTGCCG GCCGGTGA
|
Protein sequence | MIQVKASTGL VALLGHPVQH SLSPLMHNAA FAAGGQNLVY LAFDVKPGDL AAALAGLKAL GFRGANVTVP HKEAIIPYLD AVDPVAARIG AVNTIVNEDR CLKGYNTDGS GFLRSLEEAG FDPAGKRAVI LGAGGAARAV AFALATAGCG SLVLANRTPE RATELAGALA GAGLPAPVVY RLGDAGMRSE VEAADLVLNT TSLGMWPRVE ETPLPPDWFR PGQWVYDLVY NPLETKFLAG ARRRGCRVIS GLDMLLYQGA AAFTLWTGRE APVAVMDRVL REAMGASSGG PAAGR
|
| |