Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1103 |
Symbol | |
ID | 3833069 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1130684 |
End bp | 1131595 |
Gene Length | 912 bp |
Protein Length | 303 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637829031 |
Product | GHMP kinase |
Protein accession | YP_429960 |
Protein GI | 83589951 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG4542] Protein involved in propanediol utilization, and related proteins (includes coumermycin biosynthetic protein), possible kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 50 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACGGGC GGGCTCGGGT ACCAGGAGCC TGCGGGGAAC TGGTCCAGGG AATGATCGAC GGTGATTATT TTCTCATTAC CTGCCCCATT AACCTGGGGT CTGAGGTCAG GGTTTCCCTG CAGCCAGGGG GCCAGGTTAC CGGTCCGGGA GAAAAAGGGA AGGCCCTGCG GGCTGTCCGC CTGACCCTGG ATCACCTGGC CGCCCCCTGG GGTGCGCGGG TGGATATCTA CAACCCCCTG CCTCCAGGTA AAGGACTGGC CAGTAGTACG GCTGACGTAG TCGCCGCGGC GGTGGCCACG GCGGAAGCCC TGGGGACGGG ACTCTCCCTG GAAACGATAA CCGAGATCGC CCTGGCCGTT GAACCAAGCG ACGGGACCTT TTTGCCGGGC ATTGTCTGTT TCGATCACCT TCAAGGTAAA CGATGGGAGT ACCTGGGGCA GCCGCCACCC ATGGACGTGC TGATCGTCGA TCCCGGGGGT ATGGTGGACA CCGTTCTTTT TAACCGGCGC CGGGACCTGG TGGCCCTTAA CCTGGCTAAG GAAGAAAAAG TCAGGCAGGC AGTAAAGCTG GTCAAGGAAG GCCTGGCCCG GGGAAGGGCA GATTTAATAG CCAGGGGGGC TACCATTAGC GCCTTGGCCA ATCAGGATAT TCTCTCCAAA CCGGAACTGG AAACCATTCT GGAACTGGCT ACCAGCAGGG GAGCCCTGGG GGTAAATACG GCCCATAGTG GTACGGTAAT AGGTATCCTT TACCGGCCGG GGGAGGTAGA CGTGGACGCC CTGGAGGCAA GCATACGAAC GACCTTTCCC TATGTCGATT TTATCCGGGC CACCATGGTC GGCGGCGGGG TGGAGGGTAG AGACGGCCCA TGGTTCATGA AAGGGCAAGC AACAGCAATG GCCGGGAACT GA
|
Protein sequence | MYGRARVPGA CGELVQGMID GDYFLITCPI NLGSEVRVSL QPGGQVTGPG EKGKALRAVR LTLDHLAAPW GARVDIYNPL PPGKGLASST ADVVAAAVAT AEALGTGLSL ETITEIALAV EPSDGTFLPG IVCFDHLQGK RWEYLGQPPP MDVLIVDPGG MVDTVLFNRR RDLVALNLAK EEKVRQAVKL VKEGLARGRA DLIARGATIS ALANQDILSK PELETILELA TSRGALGVNT AHSGTVIGIL YRPGEVDVDA LEASIRTTFP YVDFIRATMV GGGVEGRDGP WFMKGQATAM AGN
|
| |