Gene Tmz1t_3389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3389 
SymbolhisS 
ID7873880 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3707727 
End bp3709022 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content69% 
IMG OID643700328 
Producthistidyl-tRNA synthetase 
Protein accessionYP_002890360 
Protein GI237654046 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAGA CCTTGCAGGC CGTGCGCGGG ATGAACGACA TCCTGCCGGC CGACGCCGAA 
ACCTGGGAAT ACTTCGAAGA CATCGTGCGC GACTGGCTGC AGAGCTACGG TTACCGCCCG
ATCCGCATGC CGCTGGTCGA GCCGACGCCG CTGTTCAAGC GCGCCATCGG CGAGGTCACC
GACATCGTCG AGAAGGAGAT GTACTCCTTC GAGGACGCGC TCAACGGCGA GCACCTCACG
CTGCGCCCCG AAGGCACGGC CTCCTGCGTG CGCGCCGCGA TCCAGCACAA CCTGATCCCC
GCGGGCGGCC CGCAGCGGTT GTACTACTAC GGCCCGATGT TCCGCCACGA GCGTCCGCAG
AAGGGCCGCT ACCGCCAGTT CCACCAGATC GGCGTGGAGG CACTCGGCTT CGCCGGAGCC
GATACCGACG CCGAGCTGAT CCTGATGTGC GCGCGGCTGT GGGAGGACCT CGGCCTGGAG
GATGTCGCGC TCGAGATCAA CTCGCTCGGC TCGCCCGAGG AGCGCGCGCA GCACCGCGCC
GCGCTGATCG CCCACCTCGA GCAGCATCAG GACAAGCTCG ACGAGGACGG CAAGCGCCGC
CTGTACACCA ACCCGCTGCG CATCCTCGAC ACCAAGAATC CCGAACTGCA GGCGATCGTC
GAAGCCGCGC CCAGGCTCGC CGACTATCTC GGCGACGAAT CGAAGGCGCA CTTCGAGGCG
GTGCAGGTCT TCCTCAAGGA CGCCGGCATC CCGTATCGCA TCAACCACCG CCTGGTGCGC
GGCCTGGACT ACTACAACCG CACGGTGTTC GAGTGGGTCA CCACGCGCCT GGGCGCGCAG
GGCACGATCT GCGCCGGCGG GCGCTACGAC GGCCTGTTCG AGCAGCTCGG CGGCAAGCCG
CAGCCGGCCG CGGGCTTCGC GATCGGCATC GAGCGCCTGC TGCTGCTGTG GCAGGCCTGC
GGTGGCGAGG CCGAGCGTCC GGTGCCCGAC GTGTATGTGG TGAGCGTGGG CGAGGCCGCG
CAGCGCCTCG GTTTCCGCGC CGCCGAGACC TTGCGCGAGC ACGGCTTCGC GGTGCTGATG
CATTGCGGTG GCGGAAGCTT CAAGTCGCAG ATGAAGAAGG CCGACGCCAG CGAGGCGCCG
GTGGCGATCG TGATCGGAGA GGACGAGGCC GCGGCGGGGG AGGTCGGCCT CAAGCCCCTG
CGCGTCGCGG GCGCCCAGCA GCGCGTGGCG ATCGACGACC TGGTCGAGGC GATGGCCGCC
CTGATGTTCC CCGAAGAAGA AGACGAAGAG GTTTGA
 
Protein sequence
MSQTLQAVRG MNDILPADAE TWEYFEDIVR DWLQSYGYRP IRMPLVEPTP LFKRAIGEVT 
DIVEKEMYSF EDALNGEHLT LRPEGTASCV RAAIQHNLIP AGGPQRLYYY GPMFRHERPQ
KGRYRQFHQI GVEALGFAGA DTDAELILMC ARLWEDLGLE DVALEINSLG SPEERAQHRA
ALIAHLEQHQ DKLDEDGKRR LYTNPLRILD TKNPELQAIV EAAPRLADYL GDESKAHFEA
VQVFLKDAGI PYRINHRLVR GLDYYNRTVF EWVTTRLGAQ GTICAGGRYD GLFEQLGGKP
QPAAGFAIGI ERLLLLWQAC GGEAERPVPD VYVVSVGEAA QRLGFRAAET LREHGFAVLM
HCGGGSFKSQ MKKADASEAP VAIVIGEDEA AAGEVGLKPL RVAGAQQRVA IDDLVEAMAA
LMFPEEEDEE V