Gene Hmuk_0665 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0665 
Symbol 
ID8410178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp626298 
End bp627920 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content68% 
IMG OID645019000 
Producttryptophanyl-tRNA synthetase 
Protein accessionYP_003176504 
Protein GI257386731 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0180] Tryptophanyl-tRNA synthetase 
TIGRFAM ID[TIGR00233] tryptophanyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.383548 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.528912 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGACC CAGATCGCAC CCGCGACGAG GGATCGCCCG ACGACCACAG CGCCGCGTCG 
GGTTCGGACT GGCGCGAGCG CTCGGGGACC TCCGTCCGAA CGGACGGTGG AACGGAGGCA
GAAGGCCCCG ATGCTGTCGC TCTCGATCCG TGGGGCTCCT CGACGGTCTC GGACTACCGG
AAGCTGTTCG ACCAGTTCGG CATCGAGTCC TTCGACGAGG TCCTGCCCGA GGTCCCGGAG
CCCCACTACC TGATGCGACG GGGCGTCATC TTCGGGCACC GCGACTACCG CCCCGTGGCT
CGCGCGATGC GCGAGGGCGA GCCGTTCGCC GCGCTGTCGG GGTTCATGCC GACGGGCGAC
CCTCACATCG GTCACAAACT CGTCTTCGAC GAGATCATCT GGCACCAGGA GCGGGGCGGC
GACGCGTACG GACTGATCGC CGACCTGGAG GCCCACAGCG CCCGCGGCCT GACCTGGGAG
GAGATCGACG AGCACGCGAG GAACTACGTT CTCTCGCTGC TGGCGCTGGG CTTCGATCCC
GAGGACGGAG AGCTGTACCG CCAGTCCGAG AACCGCGACG TACAGGATCT CGCCTTCGAA
CTCGGTGCCG AGGCGAACTT CTCGGAGCTC CAGGCGATCT ACGACTTCGA CGGCGAGACC
GACGTGAGTC ACATGCAGTC GGTCGTCACC CAGATGGCCG ACATCCTCTA CCCGCAACTC
GACGGGCCGA AGCCGACGGT GATCCCGGTC GGTCCCGACC AGGACCCACA CATGCGACTC
GCCCGCGACC TGGCGGCCCG GATGGGCTAT TTCGGCGTCA CTCGCGCGTA CGCGAGCTTC
GAGGTCGACG ACGCCGAGCG GGAGCTGCTC GGTGCCGCCC ACGAGGCCCT GCAGGCCGAC
CGTTCGGCCG ACGAGCGCGT CCGCTGTGAA GCGGCCGCCG ACTGGATCGA GGCCGAAATC
GCGCCCGACG ACGTGCGCCA GCGCACCGTC GAGAAGCTCC GCGCTGCGGG CAAGGAACCA
CTTCGCCCTC GGACTCGCTT CCTCGATCGC AACGCCACCG ACGAGGCCTT CGAGGCGCTG
ATCGAGGCGG TCGAGGGCGA GAAGCGCGTC TACGACGAGC ACGTCGACGC CTTCGACATG
GACCGCGAGG CCGCCACGGA ACTGGCCCGC GAGATCGCGA TCGACCACGG CGGCTACGGC
TTTCGCGCAC CGTCGTCGAT CTACCACCGC TTCATGACGG GGTTGACCGG CGGCAAGATG
TCCTCCTCGA TTCCCGCCTC ACACATCTCG CTGCTGGACG ACCCCGAAGA CGGCTACGAC
AAGGTGAAGT CGGCGACGAC CGGCGGCCGA TCGACCGCCG AGGAACAGCG CGAGAAGGGC
GGCAAGGCCG ACGAGTGCCC GGTCTACGAG CTGTACGCCT ACCTGCTCGC CGGTGACGAC
GACGAGTTCG CCAAGGAGGT CTACGACGAG TGCGTCGGGG GCGAACGGCT CTGTGGGGGC
TGCAAGGAAC AGGCCGCCGA ACTGATGGAG TCGTTCCTCG AAGACCACCA GGAGAAACGC
GCCGAGTGGG AAGCGAAACT CGACGAGCTC GACATCGACC TCGACAGCGA TCGCAAGCGC
TGA
 
Protein sequence
MSDPDRTRDE GSPDDHSAAS GSDWRERSGT SVRTDGGTEA EGPDAVALDP WGSSTVSDYR 
KLFDQFGIES FDEVLPEVPE PHYLMRRGVI FGHRDYRPVA RAMREGEPFA ALSGFMPTGD
PHIGHKLVFD EIIWHQERGG DAYGLIADLE AHSARGLTWE EIDEHARNYV LSLLALGFDP
EDGELYRQSE NRDVQDLAFE LGAEANFSEL QAIYDFDGET DVSHMQSVVT QMADILYPQL
DGPKPTVIPV GPDQDPHMRL ARDLAARMGY FGVTRAYASF EVDDAERELL GAAHEALQAD
RSADERVRCE AAADWIEAEI APDDVRQRTV EKLRAAGKEP LRPRTRFLDR NATDEAFEAL
IEAVEGEKRV YDEHVDAFDM DREAATELAR EIAIDHGGYG FRAPSSIYHR FMTGLTGGKM
SSSIPASHIS LLDDPEDGYD KVKSATTGGR STAEEQREKG GKADECPVYE LYAYLLAGDD
DEFAKEVYDE CVGGERLCGG CKEQAAELME SFLEDHQEKR AEWEAKLDEL DIDLDSDRKR