Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1425 |
Symbol | |
ID | 7083508 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 1588931 |
End bp | 1589800 |
Gene Length | 870 bp |
Protein Length | 289 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 643698443 |
Product | modification methylase, HemK family |
Protein accession | YP_002355080 |
Protein GI | 217969846 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG2890] Methylase of polypeptide chain release factors |
TIGRFAM ID | [TIGR00536] HemK family putative methylases [TIGR03534] protein-(glutamine-N5) methyltransferase, release factor-specific |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.949307 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCCCG CCGCGCCGTC CGCCGACGGG ATCGACATCG CCGGCGCGCT CGCCTGGGCA CGCGCGCGCA TCGACCAGAT GGACGCGCGC GTGCTGCTGC GCCACGTGCT GCAGTGCCCG GCCGCGCGTC TGGTGGCGTG GCCGGAGCAG AAGCTTGCTG CGGAGGACTG GGCCGAGTAC CGCGCCCTCG TCGAGCGCCG CGCCGCCGGC GTGCCGGTGG CCTATCTCAC CGGCACGCGC GAGTTCTACG GGCGCGAGTT CCTCGTCACC CCGGCGGTGC TGATCCCGCG CCCCGAGACC GAGCTGCTGG TCGAGCTCGC GCTCGCGCAC TTTCCCGGAC GGCGCGGGCT GCGCGTGCTC GACCTCGGCA CGGGCAGCGG GGCGCTCGCG GTGACCCTGG CGCTCGAGCT CGAGGCGGCC GAGGTGGTCG CGCTCGACCG CTCGCGCGAG GCGCTGTGGG TGGCGATGGC CAATGCCGCC AGGCTGGGCG CGAGCGTGTC CTTCGTGCAG AGCGACTGGT TCGGCGCGCT CGGCGACGAG CACTTCGAGC TCATCGTGTC CAATCCGCCC TACGTGGCTG CGGGCGACCC GCACCTCGAG CAGGGCGACG TGCGCTTCGA GCCGCGTGGC GCGCTCGCCG CCGGGCCGCA GGGCCTGGAC GACCTCGCCG AGATCGTCGC CGGCGCGCCG GCACGCCTGG TCGATGGCGG CTGGCTCTTC CTCGAGCACG GCTACGACCA GGCGGCGTCG GCGCGCGGCC TGCTCGCCGA CGCCGGCTTT GCCGCGATCG CCTCGTGGGC CGACCTCGCC GGCATCGAGC GCGTCTCGGG TGGGCGCTGG CTGGGGCGCG CAGCGCGCGA TTCGCGTTGA
|
Protein sequence | MSPAAPSADG IDIAGALAWA RARIDQMDAR VLLRHVLQCP AARLVAWPEQ KLAAEDWAEY RALVERRAAG VPVAYLTGTR EFYGREFLVT PAVLIPRPET ELLVELALAH FPGRRGLRVL DLGTGSGALA VTLALELEAA EVVALDRSRE ALWVAMANAA RLGASVSFVQ SDWFGALGDE HFELIVSNPP YVAAGDPHLE QGDVRFEPRG ALAAGPQGLD DLAEIVAGAP ARLVDGGWLF LEHGYDQAAS ARGLLADAGF AAIASWADLA GIERVSGGRW LGRAARDSR
|
| |