Gene Moth_1248 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1248 
Symbol 
ID3833043 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1289275 
End bp1290837 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content63% 
IMG OID637829184 
Producturoporphyrinogen-III C-methyltransferase / uroporphyrinogen-III synthase 
Protein accessionYP_430105 
Protein GI83590096 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0007] Uroporphyrinogen-III methylase
[COG1587] Uroporphyrinogen-III synthase 
TIGRFAM ID[TIGR01469] uroporphyrin-III C-methyltransferase 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.472521 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAAAATA CCAGGGGTAA AGTTTTTTTA GTTGGTGCCG GCCCCGGCGA CCCGGGCCTG 
TTGACTGTCA AAGGACGGGA GTGCCTGGCC CGGGCCGGGG CTGTAGTCTA CGATCGCTTG
CTCAACCCTG CTTTACTGGA ATATGCCCCG CCGGAGGCTG TTAAGATCTA CGTCGGTAAA
GCGCCGGATC GCCATGCCTT AAGCCAGGAC GAGATCAACG ACCTCCTGGT GGACCTGGCG
CGGCAGGGGA AACAGGTGGT GCGCCTTAAA GGGGGCGACC CCTTTGTCTT TGGCCGCGGC
GGGGAAGAAG CCCTGGCCCT ACGAGCTGCC GGCATCCCCT TCGAAGTAGT GCCAGGGGTC
ACGGCCGCCG TGGCCGTGCC GGCCTATGCC GGTATTCCGG TAACCCACCG GGGCCTGGCT
TCAACGGCGG CCTTCATAAC TGGCAACGAA GACCCCCGGA AAGGAAACAG CGCCATTAAC
TGGGAGGGTC TGGCCCGGGC CGTTGACACC CTGGTTTTTT TAATGGGTAT GGCCAACCTG
CCCTACATTG TCAGCCGCCT CCTGGCCTGC GGTCGCTCTC CCGATACACC GGTGGCTCTC
ATCCGCTGGG GCACCAGGGC CGAGCAGGAG ACGCTGACCG GCACCCTGGC GGATATCGAG
GGCCGGGCCC AGGAAGCCGG CTTCCGCAAC CCGGCCATTA TCATCATCGG CCAGGTGGTC
AACCTGCGTT CAACCCTGGC CTGGCTGGAG GATAAACCCC TCTTTGGCCG GCGGGTGATC
GTTACCCGCC CACGGGCCCA GGCAGAAGGC TTGACCCGAA GCCTGGCGGA CCTGGGGGCG
GAAGTCATAA ATTTCCCGGT GATTAGGACG GAACCTCCGG CCGACTGGCA CCCCTTGGAT
ACCGCCCTGG ACGCTATCGG GGAGTTTGAT TGGATCATAT TTACCAGTGC CAACGGTGTC
CGTTACTTCT GGCGACGACT TCTGGAACGA CACCAGGATA TCCGTTCCCT GGCCGGAATC
AGGATTGCGG CCATAGGGCC GGCCACTTCC CGTGCCCTGA AGGAACGGGG CCTCCTGACC
GATTGGCAAC CCCGGGAATA TGTAGCCGAA GCCGTGGCTT CCGGACTGGG ACCCCGGGTC
AGGGGCCGGC GGGTTCTCCT ACCCCGGGCT GATATCGCCC GGCCCTTCCT GGCCGTGGAC
CTCCGCCGCC AGGGGGCAGA AGTAACGGAG GTAACGGCCT ACCGGACGGT AAAGAATGAA
GAAAACGCCG GGTCCCTTAA AGAAATGCTG GCCGCCGGTA AAGTAGCCGC CGTCACCTTT
ACCAGTTCTT CGACGGTACG GGCCTTCCTT GACCTGCTCG GGGACGGGGC CCTGGATTTA
GTGCAAGGAA TAGACGTTTT TTGCCTCGGC CCGGTCACGG CGGCCACAGC CCGGGAGGCC
GGCCTGCAGG TAGCCGCCAC GGCCGGCGAG TATACAGAAG AGGGGCTGGT GCGGGCCATG
GAAAACTATT ATACAACCAT AAGGGCCGGG GGCGGAGATA GCAACCAATC CCGGAGCCTT
TAA
 
Protein sequence
MENTRGKVFL VGAGPGDPGL LTVKGRECLA RAGAVVYDRL LNPALLEYAP PEAVKIYVGK 
APDRHALSQD EINDLLVDLA RQGKQVVRLK GGDPFVFGRG GEEALALRAA GIPFEVVPGV
TAAVAVPAYA GIPVTHRGLA STAAFITGNE DPRKGNSAIN WEGLARAVDT LVFLMGMANL
PYIVSRLLAC GRSPDTPVAL IRWGTRAEQE TLTGTLADIE GRAQEAGFRN PAIIIIGQVV
NLRSTLAWLE DKPLFGRRVI VTRPRAQAEG LTRSLADLGA EVINFPVIRT EPPADWHPLD
TALDAIGEFD WIIFTSANGV RYFWRRLLER HQDIRSLAGI RIAAIGPATS RALKERGLLT
DWQPREYVAE AVASGLGPRV RGRRVLLPRA DIARPFLAVD LRRQGAEVTE VTAYRTVKNE
ENAGSLKEML AAGKVAAVTF TSSSTVRAFL DLLGDGALDL VQGIDVFCLG PVTAATAREA
GLQVAATAGE YTEEGLVRAM ENYYTTIRAG GGDSNQSRSL