Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0514 |
Symbol | |
ID | 7085128 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 579390 |
End bp | 580445 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643697542 |
Product | hypothetical protein |
Protein accession | YP_002354184 |
Protein GI | 217968950 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.241085 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGTCA CCGCGTCCAG TGTCGAGCTC CAGTCCTCGC ACCGTTTCGC CCAGTCGCTG GAGACCCGTG AGCGCCTCGA AGTCTGGCGA GGCACGCGCG ATTCCGGCTC CGCAGCGGAG AGCACGCAGC CCCCGGTGGT GGCGCTCTCC GCCGCGGGCC AGAACGCCCT GGCGGCCGAG AGCGTGACTG CGGCGGTGGA CGCGGTCGCC GAGGAGGCGG TCGAAGACGA TCCGCGCCTC GCCATGCTGA TCCGCATGAT CGAGTTCCTC ACCGGCGAGC CGGTGCGGCG CTTCAGCATG CGCGACCTGC AGTCTGCCGA CGCCGCCGAG CCTGGCTATG AGACCGGCGC AAACCCGGGT ACGAGCGGCC AGGCTGGCGC ACGGCGCGCC GGCTTCGGGC TCGAATACGA CTTCAGTAGC CGCTTCAGCG AGACCGAGAC GATTGCCTTC GGTGCCGCCG GCGTGGTGCG CACCGCGGAC GGTGCTGAGA TCCGTTTCGA GCTCGGCTTC GAGATGTCGC GCAGCTACAC CGAGTCGGTG GCGGTCAGTG TGCGCGCCGG CGACCAGCGC CTGAAGGATC CACTGGTGCT CGACTTCGGC GGCCCCGCGG CGGCGCTCTC GGAGGTGCGC TTCGACTTCG ATCTGGATAG CGACGGCACC AGGGAGAAGC TTCCGCTGGT GAGCGGCAGC GGCTTCCTCG CCTTCGACCG TAACGCCAAT GGCCGCATCG ACGATGGCCG CGAGCTCTTC GGACCCGCCA GCGGCGACGG CTTTTCCGAA CTCGCCCGCC TCGACGACGA TGCCAACGGC TGGATCGACG AGGCCGATGC CGCCTGGTCG CAGTTGCGCG TGTGGCGGCC GGATGCGGCG GGCAAGGGCA GCGTGCGGTC GCTGAGCGAG GCGGGCGTCG GCGCCCTCCA CCTCGGCCGC GTGGCGACAC CGTTCAGCCT GAAGGGCGCT GCGAACGACA CCCAGGGTGT GATGCGCGCG AGCGGCGTCT ACCTGCGCGA GGACGGTGGG GTCGGCACGC TGAGCCAGGT CGACCTGAAG GTCTGA
|
Protein sequence | MKVTASSVEL QSSHRFAQSL ETRERLEVWR GTRDSGSAAE STQPPVVALS AAGQNALAAE SVTAAVDAVA EEAVEDDPRL AMLIRMIEFL TGEPVRRFSM RDLQSADAAE PGYETGANPG TSGQAGARRA GFGLEYDFSS RFSETETIAF GAAGVVRTAD GAEIRFELGF EMSRSYTESV AVSVRAGDQR LKDPLVLDFG GPAAALSEVR FDFDLDSDGT REKLPLVSGS GFLAFDRNAN GRIDDGRELF GPASGDGFSE LARLDDDANG WIDEADAAWS QLRVWRPDAA GKGSVRSLSE AGVGALHLGR VATPFSLKGA ANDTQGVMRA SGVYLREDGG VGTLSQVDLK V
|
| |