Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_4026 |
Symbol | |
ID | 7873672 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 4423940 |
End bp | 4425163 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643700963 |
Product | coproporphyrinogen III oxidase |
Protein accession | YP_002890986 |
Protein GI | 237654672 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases |
TIGRFAM ID | [TIGR00539] putative oxygen-independent coproporphyrinogen III oxidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.799175 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACCACCC GCCGCATCAT CCCGCTGACG CCGGCCACCG GGGCCGCGCG CGCCCTGCCC CTGCCCGAAC GCCCGCAGCT CGAGGCCCCG CCGCCGCTCT CCCTGTACGT GCATTTCCCG TGGTGCGTGA AGAAGTGCCC CTACTGCGAC TTCAACTCCC ACGCCCCACG CGAGGGCGGC ATCCCCGAGC AGGCCTGGCT AGGCGCGGTG CTCGCCGACC TGCAGCACGC CCTGCCCCAG GTCTGGGGCC GGCGGGTGCA TAGCGTCTTC ATCGGCGGCG GCACGCCCAG CCTGATGTCC GCGACCACGC TCGACGGCCT GCTCACCGGC ATCCGCATGC TGCTGCCGCT CGACCCGCTC GCCGAGATCA CGCTCGAGGC CAACCCGGGC ACGGTCGAGG CCGGACGCTT CCGCGACTAC CGCGCCGCCG GCGTCAATCG CCTGTCGCTC GGCATCCAGA GCTTCGACGA CGCCATGCTG GCGAAGATCG GCCGCATCCA CGGCGGCGAG GAGGCGCGGC GCGCGATCGA GGCCGCGCGC ACCCACTTCG AGCGCGTCAA CCTCGATCTC ATGTATGCCC TGCCCGGCCA GACCCTCGAC ATGGCGCTCG CCGACCTGGA GACCGCGATC GGCTTCGGCG TGCAGCACCT GTCGTGCTAC CACCTCACCC TCGAACCCAA CACCCCCTTC GCCCACGACC CGCCGCCGCT GCCCGACGAC GACACCGCCG CCGACATGCA GGAGGCCATC GAGGCCCGCC TCGCCGCCGC CGGCTTCACG CACTACGAGA CCTCCGCCTT CGCCCGCCCG CACGAGCAGA GCCGGCACAA CCTCAACTAC TGGACCTTCG GCGACTATCT CGGCCTCGGC CCGGGCGCGC ACGGCAAGCT CTCCAGCCAC GAGGGCATCC GCCGCGAGAT GCGCCACAAG CACCCGGGGC GTTACCTCGA AGGCGCGGCG CGCAGCGACT TCATCCAGGA AGCGCGCGAG GTGTCGGTGG CCGAGTTGCC CTTCGAGTTC ATGATGAACG CGCTGCGGCT CACCGAGGGC GTTCCGGCGA AGCTGTTCGC GGCGCGCACG GGGGTGCCGA TCGAGGCCAT CACCGACGAA CTGGCGCGGG CGCGCGAACG CGGGCTGCTG GACACTGCGG ACGGAAAACT GCGGCCGACG CTGCAGGGTC GGCGGTTTCT GAACGAGTTG CTGCAGGGGT TTCTAAAGGA CTGA
|
Protein sequence | MTTRRIIPLT PATGAARALP LPERPQLEAP PPLSLYVHFP WCVKKCPYCD FNSHAPREGG IPEQAWLGAV LADLQHALPQ VWGRRVHSVF IGGGTPSLMS ATTLDGLLTG IRMLLPLDPL AEITLEANPG TVEAGRFRDY RAAGVNRLSL GIQSFDDAML AKIGRIHGGE EARRAIEAAR THFERVNLDL MYALPGQTLD MALADLETAI GFGVQHLSCY HLTLEPNTPF AHDPPPLPDD DTAADMQEAI EARLAAAGFT HYETSAFARP HEQSRHNLNY WTFGDYLGLG PGAHGKLSSH EGIRREMRHK HPGRYLEGAA RSDFIQEARE VSVAELPFEF MMNALRLTEG VPAKLFAART GVPIEAITDE LARARERGLL DTADGKLRPT LQGRRFLNEL LQGFLKD
|
| |