Gene Tmz1t_4080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_4080 
SymbolhemE 
ID7873307 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp4480947 
End bp4482017 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content69% 
IMG OID643701011 
Producturoporphyrinogen decarboxylase 
Protein accessionYP_002891034 
Protein GI237654720 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0407] Uroporphyrinogen-III decarboxylase 
TIGRFAM ID[TIGR01464] uroporphyrinogen decarboxylase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCCGCC TGAAGAACGA CACCTTCCTG CGCGCCCTGC TGCGCCAGCC CACCGAATAC 
ACCCCCGTCT GGCTGATGCG CCAGGCCGGG CGCTACCTCC CCGAATATTG CGAGACGCGC
AAGCGCGCGG GCAGCTTCCT GCAGCTGTGC AAGAGCCCGG CGATGGCCTG CGAGGTGACC
CTGCAGCCGC TCGCGCGCTA CGACCTCGAC GCCGCGATCC TGTTCTCGGA CATCCTCACC
GTGCCCGACG CGATGGGCCT CGGCCTGTAC TTCGCCGAGG GCGAGGGCCC GCGCTTCGAG
CGCCCGCTCA AGGACGAGTG GGAGATCCGC AACCTCAGCG CGCCGGACCC CCACGCCGAG
CTGCAGTACG TGATGGATGC GGTCGCCGAG ATCCGCCGTG CGCTCGACGG CAGCGTGCCG
CTGATCGGTT TCTCGGGCAG CCCGTGGACC CTGGCCTGCT ACATGGTCGA GGGCGGCTCC
TCCGACGACT ACCGCAAGGT CAAGTCGCTG GCCTACAGCC GCCCCGACCT GATGCACCAC
ATCCTCGACG TCACCGCGCA GGCGGTGGTG AAGTACCTCA ACGCGCAGAT CGAGGCCGGC
GCGCAGGCGG TGATGGTGTT CGACTCCTGG GGCGGTGTGC TGTCCGAGGC CGCGTACAAG
GAGTTCTCGC TGCCTTACCT GGAACAGGTC GTAGCAGGCC TGATCCGTGA GCGCGACGGC
CAGCGCGTGC CCAGCATCGT GTTCACCAAG GGCGGCGGCC TGTGGCTGGA GTCGATCGCC
GCGATCGGTT GCGACGCGGT CGGCCTCGAC TGGACCATGG ACATCGGCCG CGCGCGTCGC
CTGGTGGGCG ACAAGGTGGC GCTGCAGGGC AACCTCGACC CCAACGTGTT GTTCGCCCCG
CCCGAGGCGG TCGCGACGGA GACGCGCCGG GTGCTCGACG CCTTCGGCAA CCATCCGGGA
CACGTCTTCA ATCTCGGCCA TGGCATCTCG CAGTACACCC CTCCGGAGAG CGTGAGCGTG
CTGGTAGACA CCGTGCACGC GCACAGCCGG GCGATCCGCG CCGGGGCTTG A
 
Protein sequence
MSRLKNDTFL RALLRQPTEY TPVWLMRQAG RYLPEYCETR KRAGSFLQLC KSPAMACEVT 
LQPLARYDLD AAILFSDILT VPDAMGLGLY FAEGEGPRFE RPLKDEWEIR NLSAPDPHAE
LQYVMDAVAE IRRALDGSVP LIGFSGSPWT LACYMVEGGS SDDYRKVKSL AYSRPDLMHH
ILDVTAQAVV KYLNAQIEAG AQAVMVFDSW GGVLSEAAYK EFSLPYLEQV VAGLIRERDG
QRVPSIVFTK GGGLWLESIA AIGCDAVGLD WTMDIGRARR LVGDKVALQG NLDPNVLFAP
PEAVATETRR VLDAFGNHPG HVFNLGHGIS QYTPPESVSV LVDTVHAHSR AIRAGA