Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1702 |
Symbol | |
ID | 7084122 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 1910119 |
End bp | 1911354 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 643698723 |
Product | protein of unknown function DUF399 |
Protein accession | YP_002355353 |
Protein GI | 217970119 |
COG category | [S] Function unknown |
COG ID | [COG3016] Uncharacterized iron-regulated protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0381555 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCTCTT CCTCCTGGCT GCGCGCGTGC GCGCTCACGA GCGGTCTCCT GTGCACGCTG CCGACGCTGG CGAACCCGGC GCAGAGCCAG GCCCCGTCCG CCGCGCCCGC CTGCCTCGCG CCCGCGAGCT GGGCGCGCAC CGACGGGGTG ACGCCGCAGG TGGTGGACGC GGGCGGGCTG CTGACCGCGA TGGCCGGGCG TGCGGTCGTG CTGCTCGGCG AGCAGCACGA CGACATGGAC CACCACCGCT GGCAGCTGCA GACCCTCTCC ACGCTGCATG CGCTGCGCCC ACGGATGGTG ATCGGTTTCG AGATGTTCCC GCGCCGGGCG CAGCCGGTGC TCGACCGCTG GGTGGCGGGC GAGCTGACCG AGGCGGAGTT CCTGCAGCAG TCGGAATGGG GCAAGGTGTG GAGCATGCCG GCGGAGCTCT ACCTGCCGCT CTTCCACTTC GCGCGCATGA ACCGTATCCC GATGCTCGCG CTCAACATCG ACGCGGAGCT CACCCGGCGG ATCGCCGACA AGGGCTGGGA CGGCGTGCCC GAGGGCGAGC GCGAGGGCGT CGGTCGGCCG GCCGCGGCGG TGTCGGCTTA CGAGGACTTC CTCTTCGAGA TCCACAGCCA GCACGCCCAG ATGCGCCAGC ACGGCAAGGA CCAGGGCAAG CCGGCGCGCG GCGATGCCGC TTTCCGCAAC TTCGTGGATT CCCAGCTCGC CTGGGACCGT GCGATGGCCG AAGCCTTGCT CGCCGGCCGC GCGCGCCACG CCGCCGCGGA CGGCAGCCTG CCGTTCGCAG TGGGCATCAT GGGGAGCGGG CATGTGCGCC ACGGCCACGG CGTGGCCCAT CAGCTCGCCG CGCTCGGCGA GCCGAGCATC GGCCAGCTGA TGCCGGTCGA GGCGGCGACG CCTTGCGTGG AACTCCCCGC CGGCCTCGCC GACGCGGTGT TCGCCGTACC GGCGCTGCGC CAGCCGCCGC CTCCGCCGCC GCGGCTCGGG GTGGGGCTCG ACGACACCGA CGGCGGCATC CGCATCGCCG AGCTCACGCC GGGCAGCCTC GCCGAGCGCA GCGGCCTGCG CCGCGGCGAC CTGATCACCG AGGCGGCGGG TGCGCCGGTG CGGCGCTCCG CGCAGCTGAT CGGCCTGATC CGCCGCCAGC CGGCGGGCAC CTGGTTGCCG CTGCAGGTGA TGCGCGAGGG CAAGGCGGTC GAGGTGGTGG TGCGCTTCCC GCGCGAGGCG CGCTGA
|
Protein sequence | MSSSSWLRAC ALTSGLLCTL PTLANPAQSQ APSAAPACLA PASWARTDGV TPQVVDAGGL LTAMAGRAVV LLGEQHDDMD HHRWQLQTLS TLHALRPRMV IGFEMFPRRA QPVLDRWVAG ELTEAEFLQQ SEWGKVWSMP AELYLPLFHF ARMNRIPMLA LNIDAELTRR IADKGWDGVP EGEREGVGRP AAAVSAYEDF LFEIHSQHAQ MRQHGKDQGK PARGDAAFRN FVDSQLAWDR AMAEALLAGR ARHAAADGSL PFAVGIMGSG HVRHGHGVAH QLAALGEPSI GQLMPVEAAT PCVELPAGLA DAVFAVPALR QPPPPPPRLG VGLDDTDGGI RIAELTPGSL AERSGLRRGD LITEAAGAPV RRSAQLIGLI RRQPAGTWLP LQVMREGKAV EVVVRFPREA R
|
| |