Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1248 |
Symbol | |
ID | 3833043 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1289275 |
End bp | 1290837 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637829184 |
Product | uroporphyrinogen-III C-methyltransferase / uroporphyrinogen-III synthase |
Protein accession | YP_430105 |
Protein GI | 83590096 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0007] Uroporphyrinogen-III methylase [COG1587] Uroporphyrinogen-III synthase |
TIGRFAM ID | [TIGR01469] uroporphyrin-III C-methyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.472521 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGAAAATA CCAGGGGTAA AGTTTTTTTA GTTGGTGCCG GCCCCGGCGA CCCGGGCCTG TTGACTGTCA AAGGACGGGA GTGCCTGGCC CGGGCCGGGG CTGTAGTCTA CGATCGCTTG CTCAACCCTG CTTTACTGGA ATATGCCCCG CCGGAGGCTG TTAAGATCTA CGTCGGTAAA GCGCCGGATC GCCATGCCTT AAGCCAGGAC GAGATCAACG ACCTCCTGGT GGACCTGGCG CGGCAGGGGA AACAGGTGGT GCGCCTTAAA GGGGGCGACC CCTTTGTCTT TGGCCGCGGC GGGGAAGAAG CCCTGGCCCT ACGAGCTGCC GGCATCCCCT TCGAAGTAGT GCCAGGGGTC ACGGCCGCCG TGGCCGTGCC GGCCTATGCC GGTATTCCGG TAACCCACCG GGGCCTGGCT TCAACGGCGG CCTTCATAAC TGGCAACGAA GACCCCCGGA AAGGAAACAG CGCCATTAAC TGGGAGGGTC TGGCCCGGGC CGTTGACACC CTGGTTTTTT TAATGGGTAT GGCCAACCTG CCCTACATTG TCAGCCGCCT CCTGGCCTGC GGTCGCTCTC CCGATACACC GGTGGCTCTC ATCCGCTGGG GCACCAGGGC CGAGCAGGAG ACGCTGACCG GCACCCTGGC GGATATCGAG GGCCGGGCCC AGGAAGCCGG CTTCCGCAAC CCGGCCATTA TCATCATCGG CCAGGTGGTC AACCTGCGTT CAACCCTGGC CTGGCTGGAG GATAAACCCC TCTTTGGCCG GCGGGTGATC GTTACCCGCC CACGGGCCCA GGCAGAAGGC TTGACCCGAA GCCTGGCGGA CCTGGGGGCG GAAGTCATAA ATTTCCCGGT GATTAGGACG GAACCTCCGG CCGACTGGCA CCCCTTGGAT ACCGCCCTGG ACGCTATCGG GGAGTTTGAT TGGATCATAT TTACCAGTGC CAACGGTGTC CGTTACTTCT GGCGACGACT TCTGGAACGA CACCAGGATA TCCGTTCCCT GGCCGGAATC AGGATTGCGG CCATAGGGCC GGCCACTTCC CGTGCCCTGA AGGAACGGGG CCTCCTGACC GATTGGCAAC CCCGGGAATA TGTAGCCGAA GCCGTGGCTT CCGGACTGGG ACCCCGGGTC AGGGGCCGGC GGGTTCTCCT ACCCCGGGCT GATATCGCCC GGCCCTTCCT GGCCGTGGAC CTCCGCCGCC AGGGGGCAGA AGTAACGGAG GTAACGGCCT ACCGGACGGT AAAGAATGAA GAAAACGCCG GGTCCCTTAA AGAAATGCTG GCCGCCGGTA AAGTAGCCGC CGTCACCTTT ACCAGTTCTT CGACGGTACG GGCCTTCCTT GACCTGCTCG GGGACGGGGC CCTGGATTTA GTGCAAGGAA TAGACGTTTT TTGCCTCGGC CCGGTCACGG CGGCCACAGC CCGGGAGGCC GGCCTGCAGG TAGCCGCCAC GGCCGGCGAG TATACAGAAG AGGGGCTGGT GCGGGCCATG GAAAACTATT ATACAACCAT AAGGGCCGGG GGCGGAGATA GCAACCAATC CCGGAGCCTT TAA
|
Protein sequence | MENTRGKVFL VGAGPGDPGL LTVKGRECLA RAGAVVYDRL LNPALLEYAP PEAVKIYVGK APDRHALSQD EINDLLVDLA RQGKQVVRLK GGDPFVFGRG GEEALALRAA GIPFEVVPGV TAAVAVPAYA GIPVTHRGLA STAAFITGNE DPRKGNSAIN WEGLARAVDT LVFLMGMANL PYIVSRLLAC GRSPDTPVAL IRWGTRAEQE TLTGTLADIE GRAQEAGFRN PAIIIIGQVV NLRSTLAWLE DKPLFGRRVI VTRPRAQAEG LTRSLADLGA EVINFPVIRT EPPADWHPLD TALDAIGEFD WIIFTSANGV RYFWRRLLER HQDIRSLAGI RIAAIGPATS RALKERGLLT DWQPREYVAE AVASGLGPRV RGRRVLLPRA DIARPFLAVD LRRQGAEVTE VTAYRTVKNE ENAGSLKEML AAGKVAAVTF TSSSTVRAFL DLLGDGALDL VQGIDVFCLG PVTAATAREA GLQVAATAGE YTEEGLVRAM ENYYTTIRAG GGDSNQSRSL
|
| |