Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2390 |
Symbol | |
ID | 3832029 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2511980 |
End bp | 2513230 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637830309 |
Product | serine hydroxymethyltransferase |
Protein accession | YP_431215 |
Protein GI | 83591206 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0112] Glycine/serine hydroxymethyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0832592 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCTGG AAACAGTAGC CAAGGTTGAT CCCGAAATAG TAGCCGCCGT CAGGGGCGAG CTTCAACGCC AGCGGACCCA CCTGGAACTA ATCGCCTCAG AGAACTTCGT CAGCCAGGCC GTCATGGAGG CCTATAGCTG CGTCCTGACC AATAAATACG CCGAAGGGTA TCCCGGCAAA CGCTATTATG GCGGTTGTGA ATGGGCCGAC GTAGTCGAAA ACTTGGCCCG GGAAAGGGCC AAAGCCCTCT TCGGGGCCGA GCACGCCAAT GTCCAGCCCC ATTCCGGCAG CCAGGCCAAC ACCGCCGTCT ACCTGGCCGT TCTGAACCCG GGCGATAAGG CCCTGGGCAT GAACCTGGCC CACGGCGGCC ATTTGACCCA CGGTAGCCCG GTAAGCCTTT CCGGCAAATA CTATAACTTC TGTTTCTACG GCGTCGATGC CAAAACCGGC CGTATTGACT ATGACGCCGT AGCCCGCATC GCCCGTGAGG AGCGGCCCAG ACTGATCGTC GCCGGGGCCA GCGCCTACCC GCGGGTAATC GACTTTGCCC GTTTCCGCGA GATTGCCGAC GAGGTCGGCG CCCTGCTGAT GGTGGATATG GCCCATATCG CCGGACTGGT GGCCGCCGGT ATCCATCCCA ACCCGGTACC CTACGCCCAC TTCGTGACCA CCACCACCCA TAAGACCATG CGCGGTCCGC GGGGTGGGAT TATCCTGACC ACCAGGGAAT ACGCCCGGGA TATCGATAAG GCCGTCTTCC CCGGTGTCCA GGGCGGTCCC CTGATGCACG TCATCGCCGC CAAGGCTGTG GCCTTAAAGG AGGCCATGCT GCCGGAGTTC AAACGTTACC AGGAGCAAAT AGTTACCAAT GCCCGTACCC TGGCCGACGC CCTCATGGGC TACGGATTTA ACCTGGTTTC CGGCGGTACC GACAACCACC TAATGCTCGT CGACCTACGC AATAAGAATA TAACCGGCCG GGAGGCCGAA GATATCCTGG CCTCGGTCCA GATTACCGTC AACAAGAACG CCATCCCCTT CGACCCGCAG AAGCCCAGCG TAACCAGCGG CATCCGCCTG GGAACGGCCG CTCTGACCTC CCGGGGCATG GACGCCGACG CCATGGTCCA GGTGGCCCGG GCCATCGACC TGGCCCTCTC CTACGGTCCG GATGAGAAGA AACTGGAGGA AGCCAGGGGT ATCGTCGCCG AACTCTGCCG GGCCTTCCCC CTCTACCAGG AGCTCGACTA A
|
Protein sequence | MNLETVAKVD PEIVAAVRGE LQRQRTHLEL IASENFVSQA VMEAYSCVLT NKYAEGYPGK RYYGGCEWAD VVENLARERA KALFGAEHAN VQPHSGSQAN TAVYLAVLNP GDKALGMNLA HGGHLTHGSP VSLSGKYYNF CFYGVDAKTG RIDYDAVARI AREERPRLIV AGASAYPRVI DFARFREIAD EVGALLMVDM AHIAGLVAAG IHPNPVPYAH FVTTTTHKTM RGPRGGIILT TREYARDIDK AVFPGVQGGP LMHVIAAKAV ALKEAMLPEF KRYQEQIVTN ARTLADALMG YGFNLVSGGT DNHLMLVDLR NKNITGREAE DILASVQITV NKNAIPFDPQ KPSVTSGIRL GTAALTSRGM DADAMVQVAR AIDLALSYGP DEKKLEEARG IVAELCRAFP LYQELD
|
| |