Gene Hhal_0048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0048 
Symbol 
ID4710333 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp49687 
End bp51174 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content65% 
IMG OID639854506 
Productlysyl-tRNA synthetase 
Protein accessionYP_001001645 
Protein GI121996858 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1190] Lysyl-tRNA synthetase (class II) 
TIGRFAM ID[TIGR00499] lysyl-tRNA synthetase, eukaryotic and non-spirochete bacterial 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGATA GCGACGAGAA CAAGCTCATC GCCCAGCGCC GCGAGAAGTT GCAGGGCCTG 
CGCGAGACCG GCCAGGCCTT CCCGAACGAT TTCCGCCGCG ATGCGCTGGC CGCCGACCTT
CACGAGCGCT ACGGCCCGCT GGATAGCGAG GCCCTGGAGC AGGAGGGCGC CCGGGTGACA
GTTGGCGGAC GCATGATGGC CAAACGGGTG ATGGGCAAGG CGAGCTTCGC GCAGCTGCAG
GACAGCTCTG GCCGGATCCA GCTCTATCTG CGTCGGGACG ACCTGCCGGA AGGTGCTTAC
GCCGCCTTCA AGGGCTGGGA TGTGGGCGAT ATCGTCGGTG CCACCGGGAC GATCTTTCGC
ACCAAGAAGG GGGAGCTATC GATCCAGTGC GAGGAGATCC GGCTCCTGAC CAAATCCCTG
CGACCGCTGC CGGAGAAGTG GCACGGGCTG AACGATCACG AGATCCGCTA CCGCCAGCGC
TACTTGGATC TGATCGTCAA TGAGGAGAGT CAACGGGTCT TCGCCCTGCG CACCCGGGTG
TTGGCTGCCA TGCGCCGCTT CCTCGACGAG CGCGGCTTCA TGGAAGTCGA GACCCCGATG
ATGCAGCCGA TCCCCGGGGG CGCTGCGGCC CGGCCCTTCG TGACCCACCA CAACGCCCTG
GGGCGGGATC TGTATCTGCG CATCGCGCCG GAGCTCTACC TCAAGCGGCT GGTGGTCGGC
GGGTTCGAAC GCGTCTACGA AGTCAACCGC AACTTCCGGA ATGAAGGCGT ATCGACCCGG
CACAATCCGG AATTCACCAT GCTGGAGTTC TATCAGGCCT TCGCCGATTA TCGAGACCTG
ATGGACCTCA CTGAAGAGAT GCTCCGCCAC CTGGCCGAAC AGGCGCTGGG CTCCGGGCAG
GTGACCTGGC AGGGTGAAAC TTACGACTTT GCATCAAGTT TTGAGCGTAT ATCACTGATT
GAAGCGGTAC TTCGCTTCAA TTCGGACCTC GGCATCGAAG ATCTGACTGA GCGCGAGCGG
GCGGCCCAGG CGGCGCAGGC GCGCGGGATC GAGGTGCCTC CCGGCGACGG GCTGGGCAAG
ATCCAGGCGG CCCTGTTCGA GGAGACAGCC GAGCCCCGCC TGCACGGCCC CGTGTTCGTC
ACCGACTACC CGAAGGAGGT GTCGCCGCTG GCCCGTCCGC TCGACGACGA CCCGTGCTAC
ACCGAGCGCT TCGAGCTGAT CGTCGGTGGC CGCGAAATCG CCAACGGCTT CTCGGAGTTG
AACGACGCCG AGGACCAGGC CGCGCGTTTC CGGGCCCAGG CGGCGGAGCG GGATGCCGGC
GACCAGGAGG CGATGCACTA CGACGCGGAC TATATCCGTG CCCTGGAGTA CGGCCTGCCG
CCGACCGCCG GCGAGGGGAT CGGGATCGAT CGGCTGGTCA TGCTGCTCGC CGACTGCGAT
TCGATCCGCG ATGTCCTTCT GTTCCCCGCC ATGCGCCCCG AGTCTTAA
 
Protein sequence
MSDSDENKLI AQRREKLQGL RETGQAFPND FRRDALAADL HERYGPLDSE ALEQEGARVT 
VGGRMMAKRV MGKASFAQLQ DSSGRIQLYL RRDDLPEGAY AAFKGWDVGD IVGATGTIFR
TKKGELSIQC EEIRLLTKSL RPLPEKWHGL NDHEIRYRQR YLDLIVNEES QRVFALRTRV
LAAMRRFLDE RGFMEVETPM MQPIPGGAAA RPFVTHHNAL GRDLYLRIAP ELYLKRLVVG
GFERVYEVNR NFRNEGVSTR HNPEFTMLEF YQAFADYRDL MDLTEEMLRH LAEQALGSGQ
VTWQGETYDF ASSFERISLI EAVLRFNSDL GIEDLTERER AAQAAQARGI EVPPGDGLGK
IQAALFEETA EPRLHGPVFV TDYPKEVSPL ARPLDDDPCY TERFELIVGG REIANGFSEL
NDAEDQAARF RAQAAERDAG DQEAMHYDAD YIRALEYGLP PTAGEGIGID RLVMLLADCD
SIRDVLLFPA MRPES