Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1998 |
Symbol | |
ID | 7083753 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2260183 |
End bp | 2261193 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 643699023 |
Product | dihydrouridine synthase DuS |
Protein accession | YP_002355645 |
Protein GI | 217970411 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0042] tRNA-dihydrouridine synthase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.774292 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCGCCTGC TGCTCGCGCC CATGGAGGGC GTGGCCGATT TCGTGCTTCG CGACGTGCTG ACCGCGATCG GCGGCTACGA CCTCGCGGTG TCGGAATTCG TCCGCGTCTC CGGCACCCTG CTGCCGCGGC GCATGTTCGA GCGCATCTGC CCGGAGATCC TCGCCGGCAG CCGCACCGCC GCGGGCACGC CGGTGGCGGT GCAACTGCTC GGCAGCGATC CCGTCCTGCT CGCGGAGAAC GCGGCGCGCC TGGCCGCGCT GAGCCCGCAC GGCATCGACC TCAACTTCGG CTGCCCGGCC AAGGTGGTCA ATCGCCATGG TGGCGGTGCG ATGCTGCTCG ACCGCCCGCG CCTGCTGCAC GACATCGCCG CCGCAGTGCG TGCCGCCGTG CCCGCAGCGA TCCCGGTGAG CGCCAAGATG CGCCTGGGCA TCGCCGACCC CTCGCGCGCG CTCGACTGTG CGCAGGCGCT CGCGGACGGT GGCGCGCAGG CCATCGTCGT CCATGCGCGC ACCCGCGAGC AGGGCTACCG CGCGCCCGCG CACTGGGAGT GGGTGGCGCG CATCGGCGAC GCGGTGCGCG TGCCGGTGAC CGCCAACGGC GAGGTGTGGA GCGTGCGCGA CTGGGCACGC TGCCGCGAGG TCTCCGCAGC CGAGGATGTG ATGATCGGCC GCGGCGCGGT CTCCGACCCC TGGCTCGCGC GCCGCATCCG CGGCGAACGC GGCACCGAGC CGGACGAGCG CGACTGGGTC GAACTGCACC CGCACCTCCT GCGCTACTGG GAGGGCGTGC AGGGGCGCGC CGCCCCGGTG CATGCCTGTG GCCGGCTCAA GCTGTGGCTG GGCATGCTGC GGCGCAACTA CCCCGCGGCG GGCCGGCTGC ACGAGGCGGT GCGCCGCGTC GTCCACCCCT CGCGCATGAA CGAGGAACTC CGCCGCCACG GCATTCCCGC GTGCGACCAC CCGCCATCTG CCCGGGCCGA CGGGCCCCGG CCGACGAGTC CCGACCGATG A
|
Protein sequence | MRLLLAPMEG VADFVLRDVL TAIGGYDLAV SEFVRVSGTL LPRRMFERIC PEILAGSRTA AGTPVAVQLL GSDPVLLAEN AARLAALSPH GIDLNFGCPA KVVNRHGGGA MLLDRPRLLH DIAAAVRAAV PAAIPVSAKM RLGIADPSRA LDCAQALADG GAQAIVVHAR TREQGYRAPA HWEWVARIGD AVRVPVTANG EVWSVRDWAR CREVSAAEDV MIGRGAVSDP WLARRIRGER GTEPDERDWV ELHPHLLRYW EGVQGRAAPV HACGRLKLWL GMLRRNYPAA GRLHEAVRRV VHPSRMNEEL RRHGIPACDH PPSARADGPR PTSPDR
|
| |