Gene Pars_0707 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0707 
Symbol 
ID5055266 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp629292 
End bp630752 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content61% 
IMG OID640468264 
Productlysyl-tRNA synthetase 
Protein accessionYP_001152945 
Protein GI145590943 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1190] Lysyl-tRNA synthetase (class II) 
TIGRFAM ID[TIGR00499] lysyl-tRNA synthetase, eukaryotic and non-spirochete bacterial 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.282019 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGAGGC AGAGTGTTGA GAAGGTGGGG GAGTGGAGGC ACCTTTTGGC CTCTCTGCGG 
GGGGCTGGGG TGGAGCCCTA CCCCCACTCC TTTACTGTGG AGCACAGCAT AAAGGCGCTT
AACGAGCTTA GGCGTCAGGC CCTCCTAGAT CCGTGGCTGG GCGCCACTAT CAGAACCGCG
GGGAGGGTTA CAGACGTGAG GCGGCACCCC AACGTGGTTT TTATCGACCT CTACGAGGAC
GGGGCGCGGT TCCAGGTGAT GGCGGATCCG AAGCTCCCGG TTCTTGAGCA CGTATGGCGC
GGGGACTTTA TCGGAGTGGA GGGGCCTCTT GTGAAGACCC AGCGGGGGGA CTACGCAGTT
AAGGCCTCCT CAATCGTCCT CTTGGCTAAG GCGGTTCAGC CTCTGCCGGA GTGGGGGAAG
GTGGACCGAT CCTCCCCGTT CTATATGCGT TACCGCTCGG TGGCGATGGT TCTCGACCTT
CAGTTGCGGT GGCGGGTGGC GGCCCGGGCG CGGCTGATAC AGGCGTTTAG GGAGGCGATG
TGGAGGCGGG GGTTTTTGGA GATCCCCACC CCCGTCCTCC AGCCTATATA CGGCGGGGCG
GCGGCGCGGC CCTTCACGAC TAAGATCTGG GCTATAGACG AGGAGTGGTA TCTCCGCATC
TCGCCGGAGC TCTACCTCAA GCGGTACATA ATCGCCGGCT TCCCGAAGGT CTTCGAAATT
GGCCCCCAGT TCCGGAACGA AGATATAGAC GCCCTTCACA ACCCGGAGTT TTGGTCGCTG
GAGGCCTACC AGGCCTACGC CGACTATAAG GATATTATGA GGCTGACCGA GGAGGTGGTG
TATGAGGCTG TCAGGGCCGT CTTGGGCACC GGCGTGGTTA AGTACAGGGA GTGGAGCATA
AACTTCTCGC CTCCGTGGCG GAGGGTTACG CTTCACGACG CGTTGCGGGA GTTCGCCGGG
GTTGACCCCG ACAGGCTTAC AGACGACGAC ATAAAGGAAA GACTGAGGGA ACTCCAGGTG
CCGCTTAGGG TGTACAACAG GGGGGTGGCC CTGGTTAAGC TCTTCGAGAA GCTGGTGGAG
AAGAAGCTGG TGCAACCCAC CTTCGTCTTG GACTACCCCG AGGAGTCCAC CCCCTTGTGT
AAGCCGCACC GGGAGAAGGC CGGCCTTGTG GAGCGCTTCG AGGCCTTTGT GGGGGGTCTC
GAAATTGCAA ACGCCTACAC TGAGCTGAAC GACCCGGTGA AGCAGTACGA GTTTTTTGCC
CGGGAGGAGG AGCTGTTTCC CAAAGACGAG GCGCACCCCT TGGACTGGGA CTTCGTGGAG
GAGCTGTCCT TTGGCATGCC CCCGACCGGC GGCGTGGGGA TTGGGGTGGA TAGGCTTGCG
ATGATTATTA CAAACGCCGA GTCTATTAAA GATGTTATCC CGTACCCGAT TGTGTCGCGT
CGCTCCTTGG CGGAGGGCTA G
 
Protein sequence
MERQSVEKVG EWRHLLASLR GAGVEPYPHS FTVEHSIKAL NELRRQALLD PWLGATIRTA 
GRVTDVRRHP NVVFIDLYED GARFQVMADP KLPVLEHVWR GDFIGVEGPL VKTQRGDYAV
KASSIVLLAK AVQPLPEWGK VDRSSPFYMR YRSVAMVLDL QLRWRVAARA RLIQAFREAM
WRRGFLEIPT PVLQPIYGGA AARPFTTKIW AIDEEWYLRI SPELYLKRYI IAGFPKVFEI
GPQFRNEDID ALHNPEFWSL EAYQAYADYK DIMRLTEEVV YEAVRAVLGT GVVKYREWSI
NFSPPWRRVT LHDALREFAG VDPDRLTDDD IKERLRELQV PLRVYNRGVA LVKLFEKLVE
KKLVQPTFVL DYPEESTPLC KPHREKAGLV ERFEAFVGGL EIANAYTELN DPVKQYEFFA
REEELFPKDE AHPLDWDFVE ELSFGMPPTG GVGIGVDRLA MIITNAESIK DVIPYPIVSR
RSLAEG