Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2608 |
Symbol | |
ID | 7873349 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 2813426 |
End bp | 2815069 |
Gene Length | 1644 bp |
Protein Length | 547 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643699531 |
Product | cytochrome d1 heme region |
Protein accession | YP_002889587 |
Protein GI | 237653273 |
COG category | [C] Energy production and conversion |
COG ID | [COG2010] Cytochrome c, mono- and diheme variants |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.213874 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGATCC GCGCCTCGAC CCTGACCGGC CTGTTCGCAC TCGCCGGTGC ACTGGTCGTC TTGCCTGCCG CCGCTTCGCC GGACAAGGCC GTCGCTCGGG CCGCGTCCGG GGCAGCGGCG TCGCCGGCTG CCATCGACGT ACCCGCGCTC TACGCGCAGC ATTGCGCGGC CTGCCATGCG CCCAATCGCC TCGGAGCGAT GGGGCCCGCG TTGCTGCCCG ACAACCTGGG CCGCCTGCGC AAGAGCGAGG CTGCCAAGGT GATCGCCGAG GGCCGGCCCG CGACCCAGAT GGCCGGCTTC GCCGCGCAGC TGGGCAAGGA CGAGATCGCC GCGCTCACCG AGTGGATCTA CACCCCGGTG GTGCCCGAGC CGCGCTGGAC CGACGAGGAC ATCCGCGCCT CGCGCGTCGT CCATGTCGAC ACCGCGACCC TTTCCGACAA GCCGGTGTTC GACGCCGACC CGCTCAACCT CTTCCTGGTG GTGGAGGCGG GCGACCACCA CGTCAGCGTG CTCGACGGCG ACAGGCTCGA GCCGATCCAC CGCTTCCAGT CGCGCTTCGC GCTGCATGGC GGGCCCAAGT TCGCCGCCAA CGGGCGCTAC GTGTTCTTCG CCTCGCGCGA CGGCTGGATC ACCAAGTTCG ACCTGTGGAA CCTGAAGGTC GTGGCCGAGG TGCGTGCCGG CATCAACACC CGCAACGTCG CCGCATCGCC CGACGGCACC CACGTCGCGG TGGCCAACTA CCTCCCCAAC ACGCTGGTGC TGCTCGACGG CGAGCTGAAC CTGGTCAAGA CCATCCCGGC GATGGACCGC GACGGCAAGA AGAGCTCGCG GGTGTCGGCG GTGTACGACG CCACGCCGCG CCAGTCCTTC ATCGCCGCGC TCAAGGACGT CGGCGAGGTG TGGGAGATCA GCTACACCAA GGCGGTCGAG GACATCCCGG TCAGCTACAT CCACGACTAC ACACAGCGCG AAGGCAGCTT CATTCCCGGT TACCTCAACC CGCGCCGCAC CATCCTCGCC GAGGTGCTCG ACGACTTCTT CTTCACCCAG GACTATTCCG AGCTGATGGG GGCCTCGCGC GAGGGGGTCG GACAGGTGGT GAACCTCGAC GTGCGCAAGA AGATCGCCAA TCTGCCGCTT GACGGCATGC CGCACCTGGG CTCTGGCATC ACCTGGGAAT GGCAGGACCC CGCCGCCGGT CCGGGCGCCA GGCCGCGCAC GGTGATGGCG AGCACCAACC TCAAGGCGGG CGAGGTCACC GTGATCGACA TGAAGACCTG GGAGGTGGTG AAGCGCATCC CCACCCGCGG CCCGGGCTTC TTCCTGCGCA GCCACTCCAG CAGCCGCTAT GCGTTCGTCG ATTCGATGAT GAGCGCGGAG GCCAAGCACA TCCTGCAGGT CATCGACAAG CAGACGCTGG AGGTGGTGCG CGAGATCACC GGTGAGCCGG GCAAGACGCT CGCCCACGTC GAGTTCACCC GCGATGGCCG CCATGCGCTC GCCAGCCTGT GGGAGGACGA TGGCGCGGTG ATCGTCTATG ACGCGCGGAC CCTCGAGGAG GTCAAGCGCC TGCCGATGAG AAAGCCTGTG GGCAAGTACA ACGTATGGAA CAAGATCACG CGCGAGGAAG GCACCAGCCA CTGA
|
Protein sequence | MKIRASTLTG LFALAGALVV LPAAASPDKA VARAASGAAA SPAAIDVPAL YAQHCAACHA PNRLGAMGPA LLPDNLGRLR KSEAAKVIAE GRPATQMAGF AAQLGKDEIA ALTEWIYTPV VPEPRWTDED IRASRVVHVD TATLSDKPVF DADPLNLFLV VEAGDHHVSV LDGDRLEPIH RFQSRFALHG GPKFAANGRY VFFASRDGWI TKFDLWNLKV VAEVRAGINT RNVAASPDGT HVAVANYLPN TLVLLDGELN LVKTIPAMDR DGKKSSRVSA VYDATPRQSF IAALKDVGEV WEISYTKAVE DIPVSYIHDY TQREGSFIPG YLNPRRTILA EVLDDFFFTQ DYSELMGASR EGVGQVVNLD VRKKIANLPL DGMPHLGSGI TWEWQDPAAG PGARPRTVMA STNLKAGEVT VIDMKTWEVV KRIPTRGPGF FLRSHSSSRY AFVDSMMSAE AKHILQVIDK QTLEVVREIT GEPGKTLAHV EFTRDGRHAL ASLWEDDGAV IVYDARTLEE VKRLPMRKPV GKYNVWNKIT REEGTSH
|
| |