Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1104 |
Symbol | |
ID | 3833070 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1131553 |
End bp | 1132680 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637829032 |
Product | L-threonine O-3-phosphate decarboxylase |
Protein accession | YP_429961 |
Protein GI | 83589952 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase |
TIGRFAM ID | [TIGR01140] L-threonine-O-3-phosphate decarboxylase [TIGR01141] histidinol-phosphate aminotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 47 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTCATG AAAGGGCAAG CAACAGCAAT GGCCGGGAAC TGACCGGGGT CATTCACGGC GGAGACTGGC AGGGAGCCGT AGATCGCTAC GGCTGGAAGG TGGAAGAGAT ACTGGACTTC AGCGCCAACA TCAATCCCCT GGGGCCGCCG GTCGGGGTAC TGGAAACCTT GAAGGAAAAC TTGCCGGCCG TTCAGCGTTA CCCTGACCCG GCAAGCCGGC GTTTGAAAGA AGCACTGGCA GATCAACTGC ATGTAGATAC CGGCGCTATA ATTATTGCTA ACGGGGCGGT AGAATTAATT TACCTTATAA TGCAGGTTCT AAAGCCTGAT CGGGTGCTGG TAGTCGAACC CACCTTCGGC GAATACCGCC GGGCAGCTAC CATCGCCGGG GCCGAAGTGT TGCCTGTCTA CCTTGATCCC GCTACCGGTT TTACCTTTGA TTTCGACCGC TGGCGCCCGG AACTGCAGCG GTCGCAGGTA GCCTTTATTT GCAATCCCAA TAATCCCACC GGCCGCCTCC TGAACCCGGA TATTTTACAT CGGGCAGCAA GCCTATGCCG GGAACAGGGG GTTTTCCTGG TGATGGATGA ATCCTTTCTG GACTTTGTCC CCGACGGGGA TAAGTTTTCT CTGGTACCCC AGGCGGCCGC CGGGCCGGGG ATATTTATCC TGCACTCTTT GACGAAGATT TTTGCCCTGC CGGGATTGCG CCTTGGTTAC GGTGTCGGCT GTCCGGATAT GGTACGCAGG CTGGAGAACA GCCGGGACCC CTGGAGCGTC AATATCCTGG CCCAGATGGC AGGCGTAGCC GCCCTGGCCG ATAAGGAGTA TTTAAAGAAA ACCCGGGAAC TAATCAAGCG GGAAAAGGAG TATCTTTTCC ATAACCTGTC CAGGCTGGCA GGATTCCGGC CCTATTACCC CGAAGTTAAT TTTATCCTGA CTAACATTCA GAACGGCTGC CTGACGGTAT CCCGGTTGGC TGAACTCCTG GCCCGGAAGC GCATCTTAAT CCGTGACTGT TCTTCTTTTC CCGGTCTTGG ACCGGCCTAC TTCCGGGTTG CCGTACGTGA CCACCGGGCC AATAAGAGAC TGGTGGCTGC CTTAAAGGAG ATAATGGAGG AGAGTTAA
|
Protein sequence | MVHERASNSN GRELTGVIHG GDWQGAVDRY GWKVEEILDF SANINPLGPP VGVLETLKEN LPAVQRYPDP ASRRLKEALA DQLHVDTGAI IIANGAVELI YLIMQVLKPD RVLVVEPTFG EYRRAATIAG AEVLPVYLDP ATGFTFDFDR WRPELQRSQV AFICNPNNPT GRLLNPDILH RAASLCREQG VFLVMDESFL DFVPDGDKFS LVPQAAAGPG IFILHSLTKI FALPGLRLGY GVGCPDMVRR LENSRDPWSV NILAQMAGVA ALADKEYLKK TRELIKREKE YLFHNLSRLA GFRPYYPEVN FILTNIQNGC LTVSRLAELL ARKRILIRDC SSFPGLGPAY FRVAVRDHRA NKRLVAALKE IMEES
|
| |