Gene Hlac_2671 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2671 
SymbolpheS 
ID7400877 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2658391 
End bp2659899 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content68% 
IMG OID643709744 
Productphenylalanyl-tRNA synthetase subunit alpha 
Protein accessionYP_002567312 
Protein GI222481075 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0016] Phenylalanyl-tRNA synthetase alpha subunit 
TIGRFAM ID[TIGR00468] phenylalanyl-tRNA synthetase, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.811124 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACTCC CGGAACGACA GCTCGCGGTC CTTGAGGCCG CGAGCGCGAC GGACGAACGG 
ACGATAGACG AGATCGCGGC GGAGACCGGT CTGAAGCCCG AGACGGTCAC CGGCGCTGCC
TTCGATCTCC GCGACGAGGG GCTCTGCTCC GTCGTCGAGA CCGCCGCCGA GACGCTCGGC
CTCACCGACG AGGGTCGACG GTACGTCGAC GACGGACTCC CCGAGACGCG GCTCTACCGT
GCCGGTCTCG CGCTCGACGC CGACGAGTCC GCGGTCTCGA TGGGGCAGGT CATCGGCGAG
GCCGACCTCG ACGGTCCCGA AGTCGACATC GCGCTATCGA ACTTCGCACG CAAGGGCTTC
GGGTCGATCG ACTCGGGCGA GCTTCGGGTC GACCCCGACG CCGACCCCGA CGCCGACTCC
GAGGCGGCCG CACTCGCGGC GCTCGCTGAC GGCGAGACTC CGGACGCCGC CGACGCGGTC
CTCGAACAGC TTGACTCCCG CGGGCTCGTC GACCACGGGG AGTCGGTGAC CCGATCGGTG
ACGCTCACCG ACGACGGCGT CGACGCGCTG ATGATGGGCA TCGAGGCGAC GGAGACGGTC
GCGCAGGTCA CCCCGGAACT GCTCGCCAGC GGCGAGTGGC GCGACGTGGA GTTCTCGGAA
TACAACGTCG AGGCCGACGC GCCGACGACG CGGGGCGGTC GGAAACACGT CCTCCGCCGG
ACCGCGGACC GCGTGAAAGA CGTCTTGGTC GGCATGGGCT TTCAGGAGAT GGAGGGCCCG
CACGCCGACA GCGACTTCTG GATCAACGAC TGCCTGTTCA TGCCACAGGA CCACCCGGCG
CGGACCCACT GGGACCGGTT CGCACTCGAC GTGGACCCGA TGGAAGACAT TCCCGACGAG
CTGATTCGCC GCGTCGAGTC GGCCCACCGC GACGGTTGGG GCACGGACGG CGACGGCTAC
CACTCGCCGT GGTCCGAGGA GTTCGCCCGC GAGATTGCCT TGCGCGGGCA CACCACGTCG
CTGTCGATGC GATACCTCTC GGGGATCGCG GGCGCAGAGC TTGAACCCCC ACAGCGGTAC
TTCTCCGTCG AGAAGGTGTA CCGCAACGAT ACGCTCGACC CGACGCACCT CCTCGAGTTC
TTCCAGATCG AGGGGTGGGT GATGGCCGAG GACCTCTCCG TGCGCGATCT GATGGGCACC
TTCGAGGAGT TCTACCGGCA GTTCGGGATC ACCGACATCC GGTTCAAGCC GCACTACAAC
CCGTACACGG AGCCGTCCTT CGAGCTGTTC GGGGAACACC CGGAGACCGG CGAGGAGATC
GAGATCGGTA ATTCGGGCGT CTTCCGCGAG GAGGTCACCG GTCCGCTCGG CGTCGACTGC
GACGTGATGG CGTGGGGGCT CGCCTTGGAA CGGCTCGCCA TGCTCACCAC TGGTGCGGAG
GACATCCGTG ATCTCCACGG AACCTTGGCT GACATCGAGT TCCTGCGAGA CGCGGAGGTG
AGCTACTGA
 
Protein sequence
MRLPERQLAV LEAASATDER TIDEIAAETG LKPETVTGAA FDLRDEGLCS VVETAAETLG 
LTDEGRRYVD DGLPETRLYR AGLALDADES AVSMGQVIGE ADLDGPEVDI ALSNFARKGF
GSIDSGELRV DPDADPDADS EAAALAALAD GETPDAADAV LEQLDSRGLV DHGESVTRSV
TLTDDGVDAL MMGIEATETV AQVTPELLAS GEWRDVEFSE YNVEADAPTT RGGRKHVLRR
TADRVKDVLV GMGFQEMEGP HADSDFWIND CLFMPQDHPA RTHWDRFALD VDPMEDIPDE
LIRRVESAHR DGWGTDGDGY HSPWSEEFAR EIALRGHTTS LSMRYLSGIA GAELEPPQRY
FSVEKVYRND TLDPTHLLEF FQIEGWVMAE DLSVRDLMGT FEEFYRQFGI TDIRFKPHYN
PYTEPSFELF GEHPETGEEI EIGNSGVFRE EVTGPLGVDC DVMAWGLALE RLAMLTTGAE
DIRDLHGTLA DIEFLRDAEV SY