Gene Hmuk_0901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0901 
SymbolleuS 
ID8410416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp864125 
End bp866884 
Gene Length2760 bp 
Protein Length919 aa 
Translation table11 
GC content69% 
IMG OID645019235 
Productleucyl-tRNA synthetase 
Protein accessionYP_003176737 
Protein GI257386964 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0495] Leucyl-tRNA synthetase 
TIGRFAM ID[TIGR00396] leucyl-tRNA synthetase, eubacterial and mitochondrial family
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.102143 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAACC GCTACAACCA CGCCCGCGTC CAGGAGTACT GGCAACACGT CTGGGACCGC 
GAGGGCGTGT ACGAACTCGA CGACGCGGCG ATCGACCGCG GCGAGGGGAC CTACGTGCTG
GGGATGTTCC CCTACACGTC GGGCTCGCTG CACATGGGCC ACATCCGCAA CTACGCGATC
ACCGACGCCG TCGCTCGCTA CCGGCGGATG AACGGCGAGG ACGTGCTCCA CCCGATGGGG
TGGGACGCCT TCGGGCTCCC GGCGGAGAAC GCCGCCTACG AGCGCGACAC CGACCCGGAG
TCGTGGACTC GAACGTGCAT CGAGCAGATG CGCGACGAAC TCGAACGGAT GGGGTTTGGC
TACGACTGGT CTCGCGAGAT CACGACCTGT GAGCCGGAGT ACTACCGGTG GAACCAGTGG
CTCTTTACCC GCTTCTACGA GAACGATCTC GTCGAGTACA CCGGCGCGCA GGTCAACTGG
TGCCCGGACT GCGAGACCGT CCTGGCCGAC GCACAGGTCG ACGAGGGCGA CGGTCACGCC
GACGTGCGCG ACGACACGCC GGGACACGCC GACGGCGAGG GGGTCTGCTG GCGCTGTGGC
ACCGCGGTCG AGACCCGCGA ACTCGACCAG TGGTTCTTCA CGATCACCGA CTACGCGGAC
GAACTCGCGG ACGGCCTGGA TGCCCTCGAC GGGTGGCCCG AAAGCGTCCG CGAGATCCAG
CGCAACTGGA TCGGCCGCCG CGAGGGTGCC CAGATCACCT TCGAGATCGA CGGCGATCCG
ATCGACGTGT TCACGACCCG GCCAGACACC GTCTTCGGGG CGACGTATCT GGCCCTCGCG
CCGGGCCACG GGGTCGCCCG CGAACTCGCC CAGCGCACGT CCTCGGGAGC GAGCGAGGCG
GTCGACGACT TCCACGACCG CGCCGCGGAC GGCCGAGCCG AGGGAACGGC CGGCGTCGAC
ACTGGCCTCA CCGCGACGAA CCCGGTCACC GGCGAGGAGA TCCCCGTCTA CGTCGCCAGC
TACGTCCTCT CTGACGTGGG GACGGGCGCG GTGATGGGCG TGCCGGCCCA CAACGAGCGC
GACCACGCCT TCGCCACGGC ACACGACCTC CCGGTCGAGC GAGTCGTCGC GCCCAACGAC
GGCGAACACG GAGAACGTAA CGGCCAGCCC CTCCCGTACA CCGACGAGGG GATGATCGTC
GCCGACGGGG CCTACGAGGG GCTGGCGAGC GCGGCCGCCC GCGAGCGCTT GCTGGACCAC
GACGCCGTGA CGGCGGCGAC AACCTACCGG CTCCGGGACT GGCTCATCTC ACGACAGCGC
TACTGGGGGA CGCCGATCCC GATGGTCCAC TGCGAGGAGT GTGGCGCGGT GCCGGTGCCC
GACGAGGACC TCCCCGTCGA ACTTCCCGAG TTCGTCCAGA CCACGGGGAA CCCACTGGCC
GCGAGCGACG AGTTCGTCCA GACGACCTGT CCCGACTGTG GCGGCCCCGC CGAACGCGAG
ACCGACACCA TGGACACCTT CGTCGACTCC TCCTGGTACT TCCTGCGGTA CCTGAACCCC
GACCTCTCGA CGGCTCCCTT CGAGAACGAG GTCGCCGACG ACTGGCTCCC GGTCGACGTG
TACGTCGGTG GCGACGAGCA CGCGGTGCTT CACCTGCTGT ACATCCGCTT TTTCACCCGC
GCGCTGGCGG ACCTGGGTCT GCTCGACCGG CGCGAGCCCG TCGACCGGCT GGTCAACCAG
GGGACGGTGT TGCACGGCGG CGAGAAGATG TCCAAGTCGA AGGGCAACGA CGTGGCTCCC
CACGAGTACG GCGCGGAGAC CACGCGGCTG TTCGTCCTCT CGGCGGCACA CCCCGGCCAG
GACTTCGAGT GGACGGCGAC GGACGTGAGC AACGCCTACG AGTTCCAGCA GGACGTGTAC
CGGATGGTGC TGTCGTTCAC CGACGGTGCC GGCGGTCGGA CCGAGAGCAG ACCGCACGAC
GCCTACCTCG AACGGGAGAT CGATCGGACG ATCGCCGCGG TCACCGACGA GTACGACCGC
TTTCGCTTCC ACCGGGTCGT CACCGAACTC CAGCAGTTCG CCCAGTTGCT GCGACGCTAC
CGCGAGTACG AGACCCCGTA CCGCTTCGCG TACAGTCGGG GGCTGCGCGT GCTGACGAAA
CTGCTCGCGC CGCTCGCCCC CTATCTGGCC GAGGAGCTGT GGTCCGCGCT GGACGGCGAG
GGGCTGATCG CCGGTGCCGA CTGGCCCACG CCGCTGCACG ACATCCCCGA CTACCGCACC
GAGCGCGAAC TCGTCCGGAC GACGCTCGCT GACGTGCGCG AGATCACCGA CGTGGTCGAG
ATCACCGATC CAGACGAGAT CGAGCTCGTC GTCGCGCCCG ACTGGAGCTA TCGCGCCTAC
GAGATCGCTC GCGAGGTGAA CCAGGAGCCA GACTCGACCG CGAGCGAGGG GAGCCGTCGC
TCTCGGGACG GCAGTACACC CGCGGGACAG CCGGCAGACG GGGCGGTCGT CGGTCGTATC
ATGGACGCCG ACGCCGTGCC CGGCACCGAG GCGGCGGCCG ACTACGCGGC CGAGCTGGCC
GACCGTTCGG GCGGGTTCGA GCCAGTGCTG GCTCCCGAGC GGGAGCTGAC GGTGCTGGAA
CAGGCGTCGT GGCTGTTCGC CGAGGAGTTC GACGCGGACG TGGTCGTCAG GCGAGCCGAG
CCCGACGGCG AGCAGGCACA CAGAGCCCGG CCGAACAAGC CCGCCATCCA CATCACGTGA
 
Protein sequence
MSNRYNHARV QEYWQHVWDR EGVYELDDAA IDRGEGTYVL GMFPYTSGSL HMGHIRNYAI 
TDAVARYRRM NGEDVLHPMG WDAFGLPAEN AAYERDTDPE SWTRTCIEQM RDELERMGFG
YDWSREITTC EPEYYRWNQW LFTRFYENDL VEYTGAQVNW CPDCETVLAD AQVDEGDGHA
DVRDDTPGHA DGEGVCWRCG TAVETRELDQ WFFTITDYAD ELADGLDALD GWPESVREIQ
RNWIGRREGA QITFEIDGDP IDVFTTRPDT VFGATYLALA PGHGVARELA QRTSSGASEA
VDDFHDRAAD GRAEGTAGVD TGLTATNPVT GEEIPVYVAS YVLSDVGTGA VMGVPAHNER
DHAFATAHDL PVERVVAPND GEHGERNGQP LPYTDEGMIV ADGAYEGLAS AAARERLLDH
DAVTAATTYR LRDWLISRQR YWGTPIPMVH CEECGAVPVP DEDLPVELPE FVQTTGNPLA
ASDEFVQTTC PDCGGPAERE TDTMDTFVDS SWYFLRYLNP DLSTAPFENE VADDWLPVDV
YVGGDEHAVL HLLYIRFFTR ALADLGLLDR REPVDRLVNQ GTVLHGGEKM SKSKGNDVAP
HEYGAETTRL FVLSAAHPGQ DFEWTATDVS NAYEFQQDVY RMVLSFTDGA GGRTESRPHD
AYLEREIDRT IAAVTDEYDR FRFHRVVTEL QQFAQLLRRY REYETPYRFA YSRGLRVLTK
LLAPLAPYLA EELWSALDGE GLIAGADWPT PLHDIPDYRT ERELVRTTLA DVREITDVVE
ITDPDEIELV VAPDWSYRAY EIAREVNQEP DSTASEGSRR SRDGSTPAGQ PADGAVVGRI
MDADAVPGTE AAADYAAELA DRSGGFEPVL APERELTVLE QASWLFAEEF DADVVVRRAE
PDGEQAHRAR PNKPAIHIT