Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0697 |
Symbol | |
ID | 7083926 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 780374 |
End bp | 781549 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643697723 |
Product | protein of unknown function DUF185 |
Protein accession | YP_002354365 |
Protein GI | 217969131 |
COG category | [S] Function unknown |
COG ID | [COG1565] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.141855 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTTCAT TGCCCCAGCC TTCCGCCGAT GCGCTCGCCC AGAGCGCTCG CCTGCTCGAA CACATCGAGG CCGAGCTGGC CGCGGCCGCT GGCTGGATCC CGTTCGCGCG CTACATGGAG CTCGCACTCT ACGCGCCGGG GCTGGGGTAT TACAGCGGTG GCGCGCGAAA GTTCGGCCCC GGCGGCGATT TCATCACCGC GCCCGAGCTT ACCCCGCTTT TCGGCCAGGC GCTCGCCGCC CAGGTCGAGC AGGTGATGCG CGCGAGCACG CCCGCGCTGA TCGAGGTCGG CGCCGGTACC GGCCTGCTCG CCGCCGACCT GCTGCTCGAG CTCGAACGCC GCGGCTGCCT GCCCGAGCGC TACGGCATCC TCGAGCTCTC GGGCGAATTG CGCGAACGCC AGTTCGACAC CCTGGCCGCC AAGGTTCCTC ACCTGGCAGC GCGCGTGCAT TGGCTGGACG CGCTGCCTGA GCGCTTCTCC GGTGCAGTGG TGGCCAACGA GGTGCTCGAC GTGATGCCGG TGCATCTGCT GGTGTCGCGC GCCGAGGGGC TCTTCGAGCG CGGCGTCGCC ATCGCCACCG ATGCCGCGGG GATACGCCGG CTGTGCTGGG CGGACGTGCC GGCGGCGGGC GCGGTGGCGG AAGGAGCGCG GGCGCTCGCC CTGCCGGTGC CGCAGAGCGG GGAATACGTC ACCGAGCTGA ACCTCGCCGG CAAGGCCTGG GTGGCGGCCT GGGCCGAGCG CCTGCACGCG GGCGCCCTGC TGCTGATCGA CTACGGCTAT CCGCGCGCCG AGTACTACCT GCCCTCGCGT TCGGGCGGCA CCCTGCTGTG CTACTACCGC CACCATGCCC ACGGCGACCC TTTCCTGTGG CCGGGGCTCA ACGACATCAC CGCCTTCGTG GACTTCACCG CGGTGGCCGA GGCCGGCTTC GAGGCCGGGC TGGACGTGCA GGGCTACACC ACGCAGGCGC AGTTCCTCTT CAACTGCGGC GTGCTGGAAT GCCTGGAGCG GCGCGGCGCC CGCGAGAGCG CGGACTACAT CCGCGCCGCG CGCGCGGTGC AGCGCCTGAC CGCGCCGCAG GAGATGGGGG AGCTCTTCAA GGTGATCGCG CTGTCGCGCG CGATCGACGG ACCGCTGCTC GGCTTCGCGC GCGGCGATCG TACGCACGCG CTCTGA
|
Protein sequence | MSSLPQPSAD ALAQSARLLE HIEAELAAAA GWIPFARYME LALYAPGLGY YSGGARKFGP GGDFITAPEL TPLFGQALAA QVEQVMRAST PALIEVGAGT GLLAADLLLE LERRGCLPER YGILELSGEL RERQFDTLAA KVPHLAARVH WLDALPERFS GAVVANEVLD VMPVHLLVSR AEGLFERGVA IATDAAGIRR LCWADVPAAG AVAEGARALA LPVPQSGEYV TELNLAGKAW VAAWAERLHA GALLLIDYGY PRAEYYLPSR SGGTLLCYYR HHAHGDPFLW PGLNDITAFV DFTAVAEAGF EAGLDVQGYT TQAQFLFNCG VLECLERRGA RESADYIRAA RAVQRLTAPQ EMGELFKVIA LSRAIDGPLL GFARGDRTHA L
|
| |