Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1889 |
Symbol | |
ID | 7085658 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 2131321 |
End bp | 2132619 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643698914 |
Product | hypothetical protein |
Protein accession | YP_002355536 |
Protein GI | 217970302 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGGGGC CGGGCCTCTC CTACGACGAC ACGCCGCCGT TCTCGGCACC GATCCGCTTC TTCCTCACCG CCCCGCTGTT CGGCGTGGCG GCGGGGCTGA CGCTGATCTT TGGCGGCGAG ATCCTCGTCT CGCGGTGGAC GCCCGGCGCA CTCGCCATCA CGCACCTCTT CGCCGCCGGC TTCATGTTGC AGGTGATGCT CGGCGCGTTG CTGCAGGTCA TGCCGGTGGT GGCGGGCGCG AGCATGCCCG CGCCGCTGCG CATCGCCGGC ATCACTCACC TGGCGATGGC GCTCGGCGCC GCGGGGCTCG CCTTCGGACT GGGTGCCGGC GCACCCGGCG TCCTGCTCGG CGGCGCCGTG CTGCTCGTGG GCGGACTCTT CGTGTTCCTC TCCGCCGCCG CGCTGGGCCT CAGCCGCGGC GCACGCACGG GCAACAACAC CCGCACCCCG CGCGATCTGC GCATGGCCTT CGCCGGCCTC GTCGTCACCG CGGTCCTCGG CTTCAGCCTG GTGCTCGTGC TCACGCGCGG CGTCGCCCTG CCGGTGCCGC TGCCGACCGT GGTGAACCTG CACGCCGGCT GGGGCTGGAT GGGCTGGGCC GCAGTGCTGC TCGCCGCCGC GAGCTGGGTC GTGGTACCGA TGTTCCAGAT CACCGCTGCC TACCCGCAGC GCTTCACCAC GCTGTGGGCA CCGGCGGTCA CCGCCACCCT CGTGTTGTGG ACCCTCGCCG AGTATCTCGC AATCGATGCC GCGCGCTTCA TCGCCATCCT CGCACTCGGG CTGCTCGGCG CCGGCTACGC GGGCACCACG CTGTACCTGC AGGCACACTC GCGGCGCAGC AAGCCCGACA CGCCCTTCCT CGCCTTCCGC GAGGCTATGT ACTCGGCGCT CGCCGGCGTC CTGGTGCTGG CGATCTCGCT GTGGTCCGAC GCCGACTGCT GGCCGATCCT CGCCGGCGTG CTGATCCTGC ACGGCGGTTT CGGCGGCACC ATCACCGCGA TGCTCTACAA GATCGTGCCC TTCCTCGCCT GGCTGCACCT CACCCAGGCC GGGCTGAAGG CACCGAACAT GAAGAAGCTG CTGCCCGACA CCCCGATCCG CCGCCAGCTG CGCGTGCGCA CCACGAGTCT CGCCACGCTC TGCGTCGCGG TCTTCGTGCC GGTGCTCGCC CCGCTCGCCG GCGTCGTGCT GGCGGTCGAG TTCGGTTGGC TCTTCGCCAA CCTGCTGCGT GTGGTGCGCG CCCGCCGCGA TGCAGCGAAA AACGCCGGGC CGCGCCCCGG AAGCCACGCC AGCGCGTGA
|
Protein sequence | MMGPGLSYDD TPPFSAPIRF FLTAPLFGVA AGLTLIFGGE ILVSRWTPGA LAITHLFAAG FMLQVMLGAL LQVMPVVAGA SMPAPLRIAG ITHLAMALGA AGLAFGLGAG APGVLLGGAV LLVGGLFVFL SAAALGLSRG ARTGNNTRTP RDLRMAFAGL VVTAVLGFSL VLVLTRGVAL PVPLPTVVNL HAGWGWMGWA AVLLAAASWV VVPMFQITAA YPQRFTTLWA PAVTATLVLW TLAEYLAIDA ARFIAILALG LLGAGYAGTT LYLQAHSRRS KPDTPFLAFR EAMYSALAGV LVLAISLWSD ADCWPILAGV LILHGGFGGT ITAMLYKIVP FLAWLHLTQA GLKAPNMKKL LPDTPIRRQL RVRTTSLATL CVAVFVPVLA PLAGVVLAVE FGWLFANLLR VVRARRDAAK NAGPRPGSHA SA
|
| |