Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1896 |
Symbol | |
ID | 7085665 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 2140092 |
End bp | 2141375 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643698921 |
Product | cysteine desulfurase, SufS subfamily |
Protein accession | YP_002355543 |
Protein GI | 217970309 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAACGCTC CGTGTGACCC CGGCGCGCAG ACGCCCATCC TGGACCTTGC CGCCGACTTC CCCATCTTGT CGCGGCCGTT GCATGGCCGC CGCCTGGCCT ACCTGGACAA CGGCGCCACC ACGCAGAAGC CCGCCGCGGT GATCGAGGCC GAGGCGCGCT TCTACCGCGA GTCCAACGCC AACATCCACC GCGGCGTGCA CTGGCTGTCG CAGCACGCCA CCGAACTCTA CGACGGCGCC CGCGCCACGG TGCAGCGCTT CCTCAACGCC GCGCGCGCCG ACGAGATCGT GTTCACCCGC GGCACCACCG AGGCGATCAA CCTGGTGGCG CAGAGCTGGG GCCGGCCACG GCTCGCGGCC GGTGACGAGA TCCTGCTCAG CACCATGGAG CACCACTCAA ACATCGTGCC CTGGCAGCTC GTGTGCGAGC AGACCGGCGC GGTGCTCAAG GTGATTCCGG TGCGGGACAA CGGCGAGCTC GACATGGCGG CCTTCGCGGG CCTGCTCGGC GAGCGCACGC GGCTGCTGGC GATCACCCAT GTGTCGAACG CGCTCGGCAC GGTCAATCCG GTGGCCGAGA TGACGCGGCG CGCGCACGAG ATGGGCGCGG TCGTGCTCGT GGACGGCGCG CAGGCGGTGG CCCACCAGGC GGTGGACGTG CAGGCGATCG GATGCGACTT CTACGCCTTC TCGGGCCACA AGCTCTACGG CCCAACCGGC GTCGGCGCGC TCTACGGCCG CGCCGAGCTG CTGCGCCACA TGCCCCCCTG GCAGGGTGGC GGCGACATGA TCCGTACCGT CGCCTTCGAC AACACCACCT TCGCCCCGCC GCCGCAGCGC TTCGAGGCCG GCACGCCCAA CATCGCCGGG GCGATCGCGC TGGCTGCGGC GATCGACTAC GTCCAAGGCG TCGGGCTCGC GCGCATCCAC GTCCACGAGC AGGCCCTGCT CGACTACGGC ACGCGCGCGC TCGCCGCCAT CCCCGGGGTA CGCCTGGTCG GCACCGCGGA GCGCAAAGCC GGCATCCTGT CCTTCCTGGT CGACGGCATC CACCCGCACG ACCTCGGCAC CATCCTCGAC GCCGAGGGTG TGGCGATCCG CGCCGGGCAC CACTGCGCGA TGCCACTGAT GACGCGCTTC GGCATCCCCG GCACCGCGCG CGCCTCGCTC GGCCTCTACA ACGGCCTCGC CGACCTCGAC GCGCTGGTCG CGGCGATCCA CAAGGCGCAG GAACTGTTCG GCACGCGCGG CGGCACGGCG TCTGGCAACG GGAGAGTGGC ATGA
|
Protein sequence | MNAPCDPGAQ TPILDLAADF PILSRPLHGR RLAYLDNGAT TQKPAAVIEA EARFYRESNA NIHRGVHWLS QHATELYDGA RATVQRFLNA ARADEIVFTR GTTEAINLVA QSWGRPRLAA GDEILLSTME HHSNIVPWQL VCEQTGAVLK VIPVRDNGEL DMAAFAGLLG ERTRLLAITH VSNALGTVNP VAEMTRRAHE MGAVVLVDGA QAVAHQAVDV QAIGCDFYAF SGHKLYGPTG VGALYGRAEL LRHMPPWQGG GDMIRTVAFD NTTFAPPPQR FEAGTPNIAG AIALAAAIDY VQGVGLARIH VHEQALLDYG TRALAAIPGV RLVGTAERKA GILSFLVDGI HPHDLGTILD AEGVAIRAGH HCAMPLMTRF GIPGTARASL GLYNGLADLD ALVAAIHKAQ ELFGTRGGTA SGNGRVA
|
| |