Gene Tmz1t_4026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_4026 
Symbol 
ID7873672 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp4423940 
End bp4425163 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content71% 
IMG OID643700963 
Productcoproporphyrinogen III oxidase 
Protein accessionYP_002890986 
Protein GI237654672 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00539] putative oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.799175 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCACCC GCCGCATCAT CCCGCTGACG CCGGCCACCG GGGCCGCGCG CGCCCTGCCC 
CTGCCCGAAC GCCCGCAGCT CGAGGCCCCG CCGCCGCTCT CCCTGTACGT GCATTTCCCG
TGGTGCGTGA AGAAGTGCCC CTACTGCGAC TTCAACTCCC ACGCCCCACG CGAGGGCGGC
ATCCCCGAGC AGGCCTGGCT AGGCGCGGTG CTCGCCGACC TGCAGCACGC CCTGCCCCAG
GTCTGGGGCC GGCGGGTGCA TAGCGTCTTC ATCGGCGGCG GCACGCCCAG CCTGATGTCC
GCGACCACGC TCGACGGCCT GCTCACCGGC ATCCGCATGC TGCTGCCGCT CGACCCGCTC
GCCGAGATCA CGCTCGAGGC CAACCCGGGC ACGGTCGAGG CCGGACGCTT CCGCGACTAC
CGCGCCGCCG GCGTCAATCG CCTGTCGCTC GGCATCCAGA GCTTCGACGA CGCCATGCTG
GCGAAGATCG GCCGCATCCA CGGCGGCGAG GAGGCGCGGC GCGCGATCGA GGCCGCGCGC
ACCCACTTCG AGCGCGTCAA CCTCGATCTC ATGTATGCCC TGCCCGGCCA GACCCTCGAC
ATGGCGCTCG CCGACCTGGA GACCGCGATC GGCTTCGGCG TGCAGCACCT GTCGTGCTAC
CACCTCACCC TCGAACCCAA CACCCCCTTC GCCCACGACC CGCCGCCGCT GCCCGACGAC
GACACCGCCG CCGACATGCA GGAGGCCATC GAGGCCCGCC TCGCCGCCGC CGGCTTCACG
CACTACGAGA CCTCCGCCTT CGCCCGCCCG CACGAGCAGA GCCGGCACAA CCTCAACTAC
TGGACCTTCG GCGACTATCT CGGCCTCGGC CCGGGCGCGC ACGGCAAGCT CTCCAGCCAC
GAGGGCATCC GCCGCGAGAT GCGCCACAAG CACCCGGGGC GTTACCTCGA AGGCGCGGCG
CGCAGCGACT TCATCCAGGA AGCGCGCGAG GTGTCGGTGG CCGAGTTGCC CTTCGAGTTC
ATGATGAACG CGCTGCGGCT CACCGAGGGC GTTCCGGCGA AGCTGTTCGC GGCGCGCACG
GGGGTGCCGA TCGAGGCCAT CACCGACGAA CTGGCGCGGG CGCGCGAACG CGGGCTGCTG
GACACTGCGG ACGGAAAACT GCGGCCGACG CTGCAGGGTC GGCGGTTTCT GAACGAGTTG
CTGCAGGGGT TTCTAAAGGA CTGA
 
Protein sequence
MTTRRIIPLT PATGAARALP LPERPQLEAP PPLSLYVHFP WCVKKCPYCD FNSHAPREGG 
IPEQAWLGAV LADLQHALPQ VWGRRVHSVF IGGGTPSLMS ATTLDGLLTG IRMLLPLDPL
AEITLEANPG TVEAGRFRDY RAAGVNRLSL GIQSFDDAML AKIGRIHGGE EARRAIEAAR
THFERVNLDL MYALPGQTLD MALADLETAI GFGVQHLSCY HLTLEPNTPF AHDPPPLPDD
DTAADMQEAI EARLAAAGFT HYETSAFARP HEQSRHNLNY WTFGDYLGLG PGAHGKLSSH
EGIRREMRHK HPGRYLEGAA RSDFIQEARE VSVAELPFEF MMNALRLTEG VPAKLFAART
GVPIEAITDE LARARERGLL DTADGKLRPT LQGRRFLNEL LQGFLKD