Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1998 |
Symbol | |
ID | 3832331 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2082267 |
End bp | 2083511 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637829927 |
Product | L-threonine synthase |
Protein accession | YP_430837 |
Protein GI | 83590828 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0498] Threonine synthase |
TIGRFAM ID | [TIGR00260] threonine synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 51 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAATA TAACCCAACT ACGCTGTATA AACTGCAGCC GTACCTATCA ACCCCGGCCC GGCTTTTATA CCTGCCCCAT CTGCGGGTCG ACGGACGGCA TCCTGGATGT CGAATACGAT TACGATTATA TAAACAAATC CATCTCCAGG CAGAGCCTGG CCAGCAACCG GGAACATTCC CTCTGGCGTT ACCGTCCCTT CTTGCCGGTG GCCCAGGAGG GACCGCTGCC CCCTCTGCGG GTGGGTTGGA GCCCCCTGTA CCGGGCCCGG CGCCTGGGGG ACGAACTGGG GTTGAAAGAG CTATATATCA AGGACGATGG GATTAATCCC ACGGGGTCTT TAAAAGACAG GCCTTCGGCC GTGGCGGTGG CCCGAGCCCT GGCCGAAGGA GCTAAGGTGG TCGCCTGCTC TTCAACCGGC AACGCTGCTT CCTCCCTGGC GGGGGCAGCA GCTTCCGTGG GCCTGAAGGC GGTAATCTTT GTACCGGGAC GGGCACCGCA AGGGAAGGTA GCCCAACTCT TGATTTTCGG GGCCACCGTT ATCAGTGTTC AGGGATCCTA TGAGGATGCC TTCAAGCTTT CGGCGGCGGC CATAGCCGAG CACGGCTGGT ACAACCGCAA TGCCGCCATT AACCCTTATC TGGTAGAAGG CAAAAAGACA GTTTGCCTGG AAGTGGCGGA ACAGCTAAAC TGGGAGGTAC CCGACTGGGT GGTCCTTTCC GTGGGCGACG GTTGTACCAT GGCCGGTGCC TGGAAGGCCT GGGTGGACCT GAAAAAGGCC GGTTGGATTG ACAAACTGCC GCGGATGCTG GGGGTCCAGG CCGAGGGTTG CTGCCCCATT ACCAGGGCCT TCCGGGAGGG AACCAGGGTA AAACCCATGC CCGAAAATAC CCTGGCGGAC AGCATCGCCG TCGGCGTGCC CCGCAATCCA GAAAAGGCTC TGCGGGCCGT AAGGGATTCC GGGGGTACGA TGATCAACGT CAGCGATGAA GAAATCCTTG CAGCCATGCG CACCCTGGGG CGGACCAGCG GCATCTTCGG CGAACCTGCC GGTGCAGCCG GGACCGCCGG CCTTATCAAA GCCGTCCAGG AAGGAATAAT TAAATCCGGA GATAAGGTGG TCGTCCTGGT AACAGGCAAC GGCCTCAAGG ATGTGGCCAA TGCCATCAAA GCGGCCGGAG AGCCTATCCG GGTGGAGCCC TCCCTGGAGG CCTTAAAGGA AGCCCTGGCT CGGTACGGGA AATGA
|
Protein sequence | MSNITQLRCI NCSRTYQPRP GFYTCPICGS TDGILDVEYD YDYINKSISR QSLASNREHS LWRYRPFLPV AQEGPLPPLR VGWSPLYRAR RLGDELGLKE LYIKDDGINP TGSLKDRPSA VAVARALAEG AKVVACSSTG NAASSLAGAA ASVGLKAVIF VPGRAPQGKV AQLLIFGATV ISVQGSYEDA FKLSAAAIAE HGWYNRNAAI NPYLVEGKKT VCLEVAEQLN WEVPDWVVLS VGDGCTMAGA WKAWVDLKKA GWIDKLPRML GVQAEGCCPI TRAFREGTRV KPMPENTLAD SIAVGVPRNP EKALRAVRDS GGTMINVSDE EILAAMRTLG RTSGIFGEPA GAAGTAGLIK AVQEGIIKSG DKVVVLVTGN GLKDVANAIK AAGEPIRVEP SLEALKEALA RYGK
|
| |