Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3389 |
Symbol | hisS |
ID | 7873880 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3707727 |
End bp | 3709022 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643700328 |
Product | histidyl-tRNA synthetase |
Protein accession | YP_002890360 |
Protein GI | 237654046 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0124] Histidyl-tRNA synthetase |
TIGRFAM ID | [TIGR00442] histidyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCAGA CCTTGCAGGC CGTGCGCGGG ATGAACGACA TCCTGCCGGC CGACGCCGAA ACCTGGGAAT ACTTCGAAGA CATCGTGCGC GACTGGCTGC AGAGCTACGG TTACCGCCCG ATCCGCATGC CGCTGGTCGA GCCGACGCCG CTGTTCAAGC GCGCCATCGG CGAGGTCACC GACATCGTCG AGAAGGAGAT GTACTCCTTC GAGGACGCGC TCAACGGCGA GCACCTCACG CTGCGCCCCG AAGGCACGGC CTCCTGCGTG CGCGCCGCGA TCCAGCACAA CCTGATCCCC GCGGGCGGCC CGCAGCGGTT GTACTACTAC GGCCCGATGT TCCGCCACGA GCGTCCGCAG AAGGGCCGCT ACCGCCAGTT CCACCAGATC GGCGTGGAGG CACTCGGCTT CGCCGGAGCC GATACCGACG CCGAGCTGAT CCTGATGTGC GCGCGGCTGT GGGAGGACCT CGGCCTGGAG GATGTCGCGC TCGAGATCAA CTCGCTCGGC TCGCCCGAGG AGCGCGCGCA GCACCGCGCC GCGCTGATCG CCCACCTCGA GCAGCATCAG GACAAGCTCG ACGAGGACGG CAAGCGCCGC CTGTACACCA ACCCGCTGCG CATCCTCGAC ACCAAGAATC CCGAACTGCA GGCGATCGTC GAAGCCGCGC CCAGGCTCGC CGACTATCTC GGCGACGAAT CGAAGGCGCA CTTCGAGGCG GTGCAGGTCT TCCTCAAGGA CGCCGGCATC CCGTATCGCA TCAACCACCG CCTGGTGCGC GGCCTGGACT ACTACAACCG CACGGTGTTC GAGTGGGTCA CCACGCGCCT GGGCGCGCAG GGCACGATCT GCGCCGGCGG GCGCTACGAC GGCCTGTTCG AGCAGCTCGG CGGCAAGCCG CAGCCGGCCG CGGGCTTCGC GATCGGCATC GAGCGCCTGC TGCTGCTGTG GCAGGCCTGC GGTGGCGAGG CCGAGCGTCC GGTGCCCGAC GTGTATGTGG TGAGCGTGGG CGAGGCCGCG CAGCGCCTCG GTTTCCGCGC CGCCGAGACC TTGCGCGAGC ACGGCTTCGC GGTGCTGATG CATTGCGGTG GCGGAAGCTT CAAGTCGCAG ATGAAGAAGG CCGACGCCAG CGAGGCGCCG GTGGCGATCG TGATCGGAGA GGACGAGGCC GCGGCGGGGG AGGTCGGCCT CAAGCCCCTG CGCGTCGCGG GCGCCCAGCA GCGCGTGGCG ATCGACGACC TGGTCGAGGC GATGGCCGCC CTGATGTTCC CCGAAGAAGA AGACGAAGAG GTTTGA
|
Protein sequence | MSQTLQAVRG MNDILPADAE TWEYFEDIVR DWLQSYGYRP IRMPLVEPTP LFKRAIGEVT DIVEKEMYSF EDALNGEHLT LRPEGTASCV RAAIQHNLIP AGGPQRLYYY GPMFRHERPQ KGRYRQFHQI GVEALGFAGA DTDAELILMC ARLWEDLGLE DVALEINSLG SPEERAQHRA ALIAHLEQHQ DKLDEDGKRR LYTNPLRILD TKNPELQAIV EAAPRLADYL GDESKAHFEA VQVFLKDAGI PYRINHRLVR GLDYYNRTVF EWVTTRLGAQ GTICAGGRYD GLFEQLGGKP QPAAGFAIGI ERLLLLWQAC GGEAERPVPD VYVVSVGEAA QRLGFRAAET LREHGFAVLM HCGGGSFKSQ MKKADASEAP VAIVIGEDEA AAGEVGLKPL RVAGAQQRVA IDDLVEAMAA LMFPEEEDEE V
|
| |