Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_4080 |
Symbol | hemE |
ID | 7873307 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 4480947 |
End bp | 4482017 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643701011 |
Product | uroporphyrinogen decarboxylase |
Protein accession | YP_002891034 |
Protein GI | 237654720 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0407] Uroporphyrinogen-III decarboxylase |
TIGRFAM ID | [TIGR01464] uroporphyrinogen decarboxylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCCGCC TGAAGAACGA CACCTTCCTG CGCGCCCTGC TGCGCCAGCC CACCGAATAC ACCCCCGTCT GGCTGATGCG CCAGGCCGGG CGCTACCTCC CCGAATATTG CGAGACGCGC AAGCGCGCGG GCAGCTTCCT GCAGCTGTGC AAGAGCCCGG CGATGGCCTG CGAGGTGACC CTGCAGCCGC TCGCGCGCTA CGACCTCGAC GCCGCGATCC TGTTCTCGGA CATCCTCACC GTGCCCGACG CGATGGGCCT CGGCCTGTAC TTCGCCGAGG GCGAGGGCCC GCGCTTCGAG CGCCCGCTCA AGGACGAGTG GGAGATCCGC AACCTCAGCG CGCCGGACCC CCACGCCGAG CTGCAGTACG TGATGGATGC GGTCGCCGAG ATCCGCCGTG CGCTCGACGG CAGCGTGCCG CTGATCGGTT TCTCGGGCAG CCCGTGGACC CTGGCCTGCT ACATGGTCGA GGGCGGCTCC TCCGACGACT ACCGCAAGGT CAAGTCGCTG GCCTACAGCC GCCCCGACCT GATGCACCAC ATCCTCGACG TCACCGCGCA GGCGGTGGTG AAGTACCTCA ACGCGCAGAT CGAGGCCGGC GCGCAGGCGG TGATGGTGTT CGACTCCTGG GGCGGTGTGC TGTCCGAGGC CGCGTACAAG GAGTTCTCGC TGCCTTACCT GGAACAGGTC GTAGCAGGCC TGATCCGTGA GCGCGACGGC CAGCGCGTGC CCAGCATCGT GTTCACCAAG GGCGGCGGCC TGTGGCTGGA GTCGATCGCC GCGATCGGTT GCGACGCGGT CGGCCTCGAC TGGACCATGG ACATCGGCCG CGCGCGTCGC CTGGTGGGCG ACAAGGTGGC GCTGCAGGGC AACCTCGACC CCAACGTGTT GTTCGCCCCG CCCGAGGCGG TCGCGACGGA GACGCGCCGG GTGCTCGACG CCTTCGGCAA CCATCCGGGA CACGTCTTCA ATCTCGGCCA TGGCATCTCG CAGTACACCC CTCCGGAGAG CGTGAGCGTG CTGGTAGACA CCGTGCACGC GCACAGCCGG GCGATCCGCG CCGGGGCTTG A
|
Protein sequence | MSRLKNDTFL RALLRQPTEY TPVWLMRQAG RYLPEYCETR KRAGSFLQLC KSPAMACEVT LQPLARYDLD AAILFSDILT VPDAMGLGLY FAEGEGPRFE RPLKDEWEIR NLSAPDPHAE LQYVMDAVAE IRRALDGSVP LIGFSGSPWT LACYMVEGGS SDDYRKVKSL AYSRPDLMHH ILDVTAQAVV KYLNAQIEAG AQAVMVFDSW GGVLSEAAYK EFSLPYLEQV VAGLIRERDG QRVPSIVFTK GGGLWLESIA AIGCDAVGLD WTMDIGRARR LVGDKVALQG NLDPNVLFAP PEAVATETRR VLDAFGNHPG HVFNLGHGIS QYTPPESVSV LVDTVHAHSR AIRAGA
|
| |