Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0110 |
Symbol | |
ID | 3832000 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 108857 |
End bp | 110065 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637828044 |
Product | dihydropteroate synthase |
Protein accession | YP_428992 |
Protein GI | 83588983 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0294] Dihydropteroate synthase and related enzymes |
TIGRFAM ID | [TIGR01496] dihydropteroate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCTGGC ATGACGAGAT AGAAGTTATT ACCGTAGCCG ATGCCGCCAC TGCCGCCAGG TTGATGCAGG AAGTAGGTGC TGACGCGGGA GGGATCGACC TGATGCTTCC TAAGGCCTGC TTCTACTGCC TCAAACTCAA GGATGTACCG GCCCGGGCGG CCAATATCCT CAAGCAGGAG ATCCTGGCCC GGGGCGGCGA GGCCGCCATG GCTGCCGGGG TAGCCGGCTG GTCGGTAGAT AAGACCGATG TGCTCATTTT CGCCACCCGG CGCCAGCTGG AACTCCTGGT GGCGAAACTT CCCGTCCAGG GCTTCGGCTT AGATAAACTT GCGGCTGGTA TTAAAACTAC CCTGGCCAAC TGGGAAAGGA ACGACGTCCG GAAAATAGGT TGTCCCCGGG GTCCTCTGGT TTTGGGTAAG AGGACCCTGG TTATGGGCAT CGTCAATGTC ACCCCGGATT CCTTTTCCGA CGGTGGCCAA TTCTATGACC CGGGTGCGGC GGTGGAACAC GCCCATCAGC TGGTCGCTGA AGGGGTTGAC ATTATCGATG TCGGCGGGGA ATCCACCCGT CCCGGTCATG AACCGGTGAG TGCCGCAGAG GAATGGCGGC GCCTGGAGCC CGTCCTGACA CGCCTGACCC GGGAAATCAA GGTGCCCATA TCGGTAGATA CCTATAAAGC CAGTACTGCT CGCCGGGCAC TGGAGATGGG AGTTGACATA ATCAATGATA TCTGGGGCTT CCGGGCTGAC CCGGAAATGG CCCGGGTTTG CGGCCAGTAC CAGGCGGCAG TTGTCCTTAT GCATAACCAG GAGGGGACCA GCTACAAGGA CCTGATGTTT GACATCCTTA CCTTCCTGCG CCAGAGCATC CGAATGGCTG AGGACAACGG CGTTCCCCCG GAAAAAATTA TTGTCGATCC GGGTATCGGT TTTGGAAAGG ACCTGGACCA GAACCTGGAG GTGATGGGGC GTCTGGAGGA GTTTAAGGTC CTGGGGAAAC CGGTCCTGCT GGGAACTTCC CGTAAATCAA TGATCGGCAG GACCCTGGAC CTGCCCGTTG ATCAGCGCCT GGAGGGAACG GCAGCCACCG TCGCCCTGGG TATTGCCCGG GGCGTGGATA TCGTTCGCGT CCACGACGTC CGGGCCATGG TGCGGGTAGC CCGGATGACC GACGCTATAG TTCGCCGAAA AAAGGGCTCT ACCCGATGA
|
Protein sequence | MSWHDEIEVI TVADAATAAR LMQEVGADAG GIDLMLPKAC FYCLKLKDVP ARAANILKQE ILARGGEAAM AAGVAGWSVD KTDVLIFATR RQLELLVAKL PVQGFGLDKL AAGIKTTLAN WERNDVRKIG CPRGPLVLGK RTLVMGIVNV TPDSFSDGGQ FYDPGAAVEH AHQLVAEGVD IIDVGGESTR PGHEPVSAAE EWRRLEPVLT RLTREIKVPI SVDTYKASTA RRALEMGVDI INDIWGFRAD PEMARVCGQY QAAVVLMHNQ EGTSYKDLMF DILTFLRQSI RMAEDNGVPP EKIIVDPGIG FGKDLDQNLE VMGRLEEFKV LGKPVLLGTS RKSMIGRTLD LPVDQRLEGT AATVALGIAR GVDIVRVHDV RAMVRVARMT DAIVRRKKGS TR
|
| |