Gene Hlac_0018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0018 
SymbolvalS 
ID7401366 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp15448 
End bp18126 
Gene Length2679 bp 
Protein Length892 aa 
Translation table11 
GC content65% 
IMG OID643707072 
Productvalyl-tRNA synthetase 
Protein accessionYP_002564694 
Protein GI222478457 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.910453 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.161561 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAGTG GTGAGTACGA CCCAGACACC GTCGAGCCGC GGTGGCAGCG GCGATGGGTC 
GACGAGGAGA CGTACGCCTA CCCCGACGAC GACCCGGTCG ATCCGAACAC GGTCTTCTCC
ATCGACACGC CCCCGCCGAC GGTATCGGGG AGCCTCCACA TGGGCCACCT GTACGGCTTT
ACCCTCCAGG ACTTCGTCGC CCGATTCGAG CGCATGAACG GTGGAGAGAC GTTCTTCCCG
TTCGGCTACG ACGACAACGG TATCGCCTCC GAGCGGCTCA CCGAGGACGA ACTCGATATC
CGCCACCAGG ACTTCGAACG CCGGGAGTTC CAGGCGAAAT GTCGGGAGGT CTGTGCCGAG
TACGAGGCGC AGTTCACCGA GAACGTCCAG TCGCTCGGCG TCTCAGTCGA CTGGGACCAC
ACTTACCAGA CGATCGAGCC GCGCGTCCAG CGCATCTCCC AGCTGTCGTT CCTCGACCTG
TACGATCAGG GTCGCGAGTA CCGCGAGAAG GCGCCCGCGA TCTGGTGTCC CGAGTGCGAG
ACCGCCATCT CGCAGGTCGA GACCGAGGAC GACGAGCAGG CCAGCCACTT CCACGATATC
GCCTTCCCCG TGGTCGGAGG CGACGCAACC GATGACGGCG CCGAGGAGTT CGTCATCTCG
ACGACGCGCC CCGAACTTCT CCCCGCGTGC GTTGCCGTCT TCGTCCACCC CGACGACGAC
GAGAACCAGA ACCTCGTCGG TGAGTCCGCC GAGGTCCCGC TGTTCGGCCA CGAGGTGCCG
ATCATCGCCG ACGAGCGCGT CGACATGGAG ACGGGTTCCG GCATCGTGAT GTGCTGTACG
TTCGGCGACC AGAACGATAT CGAGTGGTAC CAGGTCCACG ACCTGAACCT CCGGGTCGCT
ATCGACGAGT CCGGACATAT GACCGACGTC GCCGAGGGGT ACGAAGGGAT GCACGCAGAC
GAGGCCCGCG AGGCTATCGT CGAAGACCTC GACGGGGCGG GCGCGCTGCT GGACCGCCGC
GATATCACCC ACACCGTCAA CGTTCACGAG CGCTGCGGGA CGAGCGTCGA GTTCCTCGTC
ACCGAGCAGT GGTACGTCGA GATGCTCGAC AAGACCGACG AGTACCTCGA GATCGGCCGG
GAGATGGAGT GGTCTCCGGA GAAGATGTTC ACCCGGTACG AGCACTGGGT CGAGGGGCTC
CAGTGGGACT GGCTCATCTC TCGTCAGCGC TCCTCGGGCA TCCCGTTCCC GGTGTGGTAC
TGCGAGGACT GCGGGGAGAT CGTCGTCGCC GAGAAGGCCG ACCTGCCCGT CGACCCCCTC
TCGGACGACC CGCCGGTCGA CGCCTGTCCC GAGTGCGGCC ACGACGAGTT CGAACCTGAA
GACGACGTGC TCGACACGTG GGCCACCTCC AGCCTGACCC CGCTCATCAA CGCCGGCTGG
GACTGGGACG AGGACGCCGG GGAGTTCACC ATGGAACACC CCGAACTGTA CCAATTCGAC
CTCCGACCAC AGGGCCACGA TATCATCAGC TTCTGGCTGT TCCACACGCT GGTGAAGTGC
TACGAGCACA CCGGTGAGGT TCCGTTCGAG GAGACGATGA TCAACGGCCA CGTCCTCGAC
GAGAATCGGG AGAAGATGTC GAAGTCCGTC GGTAACGTCG TCGAGCCGGA GGCGGTGCTG
GCGGAGTTCC CGGTCGACGC CACGCGCTAC TGGGCCGCCG GGACCGCCGT CGGCGACGAC
TTCCCGTTCA AAGAGAAGGA CCTCCGCGCG GGCGAGAAGC TGATCCGCAA GCTGTGGAAC
GCCTCTAAGC TCGTCGAGTC GCTGGCGCCG GAGCCGTACC CGGATACGCC GGCCGACGAA
GACCTCCGAG AGCTCGACCG CTGGCTGCTC GCCGAGCTCG ACGACCGAAT CGAGCGACTC
ACTGGGCTCT TCGAGGATCG CGCGTTCTCG AAGGCCCGCG ACGAACTCCG GAGCTTCTTC
TGGAACACGT TCTGTGACGA CTACCTCGAG ATCGTCAAAC AGCGCGAGGA CGACGCCGCG
GCGTACACGC TCCGGACGGC ACACCGGCGG TTCCTGAAGT TGTTTGCCCC GCTGCTCGCG
CACGTTACCG AAGAGCTCTG GCACGACATG TACGCCGATG GAGCGAGCGA CCCCGACGCG
GTCGACGCCG CCGTCGCCGA CGGCGCGCGC GACTCGATCC ACCTCGCCGA CTGGCCCGAG
CCGCTCGGTC TGGAGGCAGA CCACGAGGCT GGTGCGGCCG CCACGGCAGT CGTCGGTGCC
CTCCGAAAGT ACAAGAGCAA GAACCAGCTC CCGCTGAACG CCGAGCTCGA TGCCGTCGAG
GTGTACGCCG ATGTCCGCGG GTTTGAGGAG GACATCACGG GCGTGATGCA TGTTGCGGAC
CTTACCGTCC ATCCCGATGA GGACGCCCCG GTCGAGACGG TGATCACCGG GATCGACCTC
GACTACGCCA CCGTTGGGCC GAAGTACGGT GATCAGGTCG GGGACATCGA GGCGGCGCTC
GCGAAAGAGG AGTACGAGAT CGACGGCGAG GAGCTCCACG TTACGGGTGT TACGCTTGTC
GAGGAGGAGT TCGCGATCGA GGAGGAGCGA CAGTACCAGG GCGACGGCGA GCTGTTAGAG
GCGGACGACG TGGTCGTTAT CGTCCGTAAC GAGGCGTAG
 
Protein sequence
MPSGEYDPDT VEPRWQRRWV DEETYAYPDD DPVDPNTVFS IDTPPPTVSG SLHMGHLYGF 
TLQDFVARFE RMNGGETFFP FGYDDNGIAS ERLTEDELDI RHQDFERREF QAKCREVCAE
YEAQFTENVQ SLGVSVDWDH TYQTIEPRVQ RISQLSFLDL YDQGREYREK APAIWCPECE
TAISQVETED DEQASHFHDI AFPVVGGDAT DDGAEEFVIS TTRPELLPAC VAVFVHPDDD
ENQNLVGESA EVPLFGHEVP IIADERVDME TGSGIVMCCT FGDQNDIEWY QVHDLNLRVA
IDESGHMTDV AEGYEGMHAD EAREAIVEDL DGAGALLDRR DITHTVNVHE RCGTSVEFLV
TEQWYVEMLD KTDEYLEIGR EMEWSPEKMF TRYEHWVEGL QWDWLISRQR SSGIPFPVWY
CEDCGEIVVA EKADLPVDPL SDDPPVDACP ECGHDEFEPE DDVLDTWATS SLTPLINAGW
DWDEDAGEFT MEHPELYQFD LRPQGHDIIS FWLFHTLVKC YEHTGEVPFE ETMINGHVLD
ENREKMSKSV GNVVEPEAVL AEFPVDATRY WAAGTAVGDD FPFKEKDLRA GEKLIRKLWN
ASKLVESLAP EPYPDTPADE DLRELDRWLL AELDDRIERL TGLFEDRAFS KARDELRSFF
WNTFCDDYLE IVKQREDDAA AYTLRTAHRR FLKLFAPLLA HVTEELWHDM YADGASDPDA
VDAAVADGAR DSIHLADWPE PLGLEADHEA GAAATAVVGA LRKYKSKNQL PLNAELDAVE
VYADVRGFEE DITGVMHVAD LTVHPDEDAP VETVITGIDL DYATVGPKYG DQVGDIEAAL
AKEEYEIDGE ELHVTGVTLV EEEFAIEEER QYQGDGELLE ADDVVVIVRN EA