Gene Hlac_2073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2073 
Symbol 
ID7400593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2061563 
End bp2062933 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content67% 
IMG OID643709144 
Productseryl-tRNA synthetase 
Protein accessionYP_002566721 
Protein GI222480484 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0172] Seryl-tRNA synthetase 
TIGRFAM ID[TIGR00414] seryl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.400527 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.190866 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGTCTC GACAGTTCGT CCGGGAGAAC CCCGAGGTCG TTCGCGAGGC GCTCGACAAC 
AAGGGCGTCG ACGTGGACCT CGACCGGATA CTCGACGTTG ACGAGGAGTG GCGGGAGCTG
AAGAGCCGTG GCGACGATTT GCGCCACGAG CGCAACGAGG TCTCCTCGAC GATCGGCGAG
CTGAAACAGG CCGGCGAGGA GGAGGCGGCC CAAGAGGCGA TCGAGCGCTC ACAGGAGGTC
AAGTCGGAGC TGCAGGAGAT CGAAGAGCGC GCCGACGAGC TGGAGGCCGA ACTGGAGGAG
TCCCTGCTCG AACTCCCCCA GATCCCCCAC GAGTCGGTGC CGGTCGGGGC CGACGAGTCG
GAGAACGTCG AGCGACGCCG CGAGGGGTTC GACGACCTGC GCGAGGTTCC CGACAACGTG
GAGCCCCACT ACGATCTGGG CGAGGAGCTG GAGATCCTCG ACTTCGAGCG CGGCGCGAAA
GTCGCCGGCG GCGGCTTCTA CGTCGCGAAG GGCGACGGCG CCCGGCTGGA GCATGCGCTG
ATCCAGTTCA TGCTCGACGT GCATCGCGAG CAGGATTACC GTGACGTGTT CCCGCCGATC
GCGGTCAACT CCACGTCGAT GCGCGGCACC GGCCAGCTCC CGAAGTTCAC CGAGGACGCC
TACCGGATCG AGGGGACCAA CGAGGACGCG TACGACGACG ACGACCTCTG GCTGCTCCCG
ACCGCGGAGG TGCCCGTCAC GAACCTCCAC CGCGACGAGA TCCTGCTCGG CGAGGACCTC
CCGCTCAAGT ACCAGGCGTA CACGCCGAAC TTCCGGCAGG AGGCGGGTGA GCACGGCACC
GAAACGCGCG GGATCGTCCG CGTCCACCAG TTCAACAAGG TGGAGATGGT GAACTTCGTC
CGGCCCGAGG AGAGCCACGA GCGCTTCGAG GGCCTCGTCG ATGAGGCCGA GGAGGTGCTT
CGCCGCCTCG AACTTCCCTA CCGCATCCTG GAGATGTGTA CCGGCGATCT GGGGTTCACG
CAGGCGAAGA AGTACGACCT TGAAGTCTGG GCGCCGGCCG ACGACATGGA CGAGGGCCCC
GCAGAGGGCG GCCGCTGGCT GGAGGTCTCC TCCGTCTCGA ACTTCGAGGA ATTCCAGGCG
CGCCGTGCCG GGATCCGGTA CCGCGAGGAG CACCACGAGT CCGCGGAGTT CCTCCACACC
CTGAACGGTT CGGGGCTCGC CGTCCCGCGG ATCGTCGTCG CGATCTTGGA GTACTACCAG
AACGACGACG GCACCGTCAC CGTCCCCGAG GCGCTGCGCC CGTACATGGG CGGCACAGAG
GTGATCGAGG GTCACGACGC GGTCGGCGAG ACGAAGCTCG GCGGGGAGTA G
 
Protein sequence
MLSRQFVREN PEVVREALDN KGVDVDLDRI LDVDEEWREL KSRGDDLRHE RNEVSSTIGE 
LKQAGEEEAA QEAIERSQEV KSELQEIEER ADELEAELEE SLLELPQIPH ESVPVGADES
ENVERRREGF DDLREVPDNV EPHYDLGEEL EILDFERGAK VAGGGFYVAK GDGARLEHAL
IQFMLDVHRE QDYRDVFPPI AVNSTSMRGT GQLPKFTEDA YRIEGTNEDA YDDDDLWLLP
TAEVPVTNLH RDEILLGEDL PLKYQAYTPN FRQEAGEHGT ETRGIVRVHQ FNKVEMVNFV
RPEESHERFE GLVDEAEEVL RRLELPYRIL EMCTGDLGFT QAKKYDLEVW APADDMDEGP
AEGGRWLEVS SVSNFEEFQA RRAGIRYREE HHESAEFLHT LNGSGLAVPR IVVAILEYYQ
NDDGTVTVPE ALRPYMGGTE VIEGHDAVGE TKLGGE