Gene Hlac_0032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0032 
Symbol 
ID7401385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp35998 
End bp38073 
Gene Length2076 bp 
Protein Length691 aa 
Translation table11 
GC content65% 
IMG OID643707091 
Productputative serine protein kinase, PrkA 
Protein accessionYP_002564708 
Protein GI222478471 
COG category[T] Signal transduction mechanisms 
COG ID[COG2766] Putative Ser protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0177084 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAACC GACACACGCT CGAACGGCTC AGCGAGGAGT ACCGGACGAA CGTCCCGGAC 
GATCTCCGGG AACCCCGCTC GTTCAACTGG TACCTCGACG CGCTGTACGA GGAGCCGCGG
ATCGCGCGGA ACGCTCACCA GCGCGTCGCG GACATGTTCG ACCACTACGG CACCCAATAC
GACGAGGAAC GCGGCGTCGT CGAGTACGCG CTCGCCGCGG ACGACCCCCT CCACGACGGC
GAGAACGTCT TCTACGGCCG AGAGGTCCAC GAGGCGATCC ACGAGTTCGT CAACAAGGTG
AAATCCGGCG CTCGGGGCCT CGGACCCGAA AAGCGGATCA AGCTTCTGCT CGGTCCTGTC
GGCTCCGGTA AGTCGCATTT CGACTGGCTG GTCCGTCGGT ACTTCGAGGC GTACACCCGC
GAGGACGCCG GTCGGATGTA CACCTTCCGC TGGGTGAACC TCTGTTCGGT GATCGACGAT
CAGGACCCCG GCGACGACAC CGTGCGCTCG CCGATGAACC AGGACCCGCT CGTGCTCATC
CCGCGCCCCC AGCGCGAGGG CGTGATCGAC GAGCTCAACG AGCGGCTCGA CGCACCCTAC
ACCCTCCGAA ACGACCAACA CCCGGACCCG GCCTCGGAGT TCTACCTGAA CGAACTGCTC
GCGCACTACG ACGACGACTT ACAGCGGGTC CTCGACAACC ACGTCGAGGT CGTCCGGCTC
GTCGCCGACG AGAACCGCCG GGAGTGCATC GAGACGTTCG AGCCGAAAGA CAAGAAGAAC
CAGGACGAAA CGGAACTCAC CGGCGACGTC AACTACGCGA AGCTCGCCGT CTACGGCGAG
TCAGATCCAC GCGCGTTCGA CTACGCCGGC GCGTTCTGTA ACGCCAACCG CGGGCTGTTC
TCCGGGGAGG AACTACTCAA ACTGCAGCGA GAGTTCCTCT ACGATTTCCT CCACGCCTCT
CAGGAGTCGA CGATCAAGCC GAAGAACAAC CCCCGGATCG ACATCGATCA GGTGATCGTC
GGGCGGACGA ACATGCCGGA GTACCGCGAG AAGACCGGCG ACGAGAAGAT GGAGGCGTTC
AACGACCGCA CCAAGCGGAT CGACTACCCC TACGTGTTGG AGTACGAGAG CGAGGCGAAA
ATCTACGAGA AGATGCTCAA CAACGCCGAC GTGCCCAACG TCCACGTCGA GCCGCACGCC
TTGGAGATGG CCGGCCTCTT CGGCGTTCTC ACCCGCTTAG AGGAGCCGGC CGACGAGACG
GTGAGCCTCC TCGATAAGGC GAAGGTGTAC AACGGCGAGC TAGAAGACGA GGAGATCGAC
CGTCGGAAGC TCCGCGAGGA CGCCGCCGAA TCCGCCGACG TGGGCGAGGG GATGGACGGG
ATCTCCGCCC GATTCGTCGG CGACGAGATC GCCGAGGCGA TCATGGACGC CACCCACCGC
GACCGCAGCT ACCTCTCGCC GCTGTCCGTC TTCGACCACT TCGAGGCGAA CCTCGGCGGC
CACGGCTCGA TCGCGGAAGC GGACCTCGAC CGCTACGAGC GGCTCTTAGA GACCGTCCGC
GAGGAGTACC GCGAGCGCGC CATCGAAGAC GTGCGCCACG CGTTGGCGTA CGACGTTGAC
GAGCTCCGCC GGCAGGGTGA GAAGTACATG GACCACGTGA TGGCGTACAT CGACGACGCC
ACCGTCGACG ACGAGCTCAC CGGCCGCGAG ACCGAGCCGG ACGAGACGTT CCTGCGCGCG
GTCGAAGAGC AGCTCGACGT GCCCTCCGAT CGCAAGGACG ACTTCCGACA GGAGGTGTCG
AACTGGGTCT CCCGGCGCGC TCGCGAGGGG CGCGGCTTCG ACCCGCGGGA GAACGACCGA
CTCCGGCGCG CGCTCGAACG CAAGCTGTGG GAGGACAAGA AGCACAACAT CAACTTCTCG
GCGCTGGTCT CCGCGACCGA CCTCGACGAC GAGGAGCGGA GCGCGTGGGT CGACGCCTTG
GTCGACCGCG GCTACTCCGA GGACGGCGCC GCGGAGGTGC TGGAGTACGC GGGCGCGGCC
GTCGCCCGCT CCGAGATCGA AAACGGCGGG GACTGA
 
Protein sequence
MPNRHTLERL SEEYRTNVPD DLREPRSFNW YLDALYEEPR IARNAHQRVA DMFDHYGTQY 
DEERGVVEYA LAADDPLHDG ENVFYGREVH EAIHEFVNKV KSGARGLGPE KRIKLLLGPV
GSGKSHFDWL VRRYFEAYTR EDAGRMYTFR WVNLCSVIDD QDPGDDTVRS PMNQDPLVLI
PRPQREGVID ELNERLDAPY TLRNDQHPDP ASEFYLNELL AHYDDDLQRV LDNHVEVVRL
VADENRRECI ETFEPKDKKN QDETELTGDV NYAKLAVYGE SDPRAFDYAG AFCNANRGLF
SGEELLKLQR EFLYDFLHAS QESTIKPKNN PRIDIDQVIV GRTNMPEYRE KTGDEKMEAF
NDRTKRIDYP YVLEYESEAK IYEKMLNNAD VPNVHVEPHA LEMAGLFGVL TRLEEPADET
VSLLDKAKVY NGELEDEEID RRKLREDAAE SADVGEGMDG ISARFVGDEI AEAIMDATHR
DRSYLSPLSV FDHFEANLGG HGSIAEADLD RYERLLETVR EEYRERAIED VRHALAYDVD
ELRRQGEKYM DHVMAYIDDA TVDDELTGRE TEPDETFLRA VEEQLDVPSD RKDDFRQEVS
NWVSRRAREG RGFDPRENDR LRRALERKLW EDKKHNINFS ALVSATDLDD EERSAWVDAL
VDRGYSEDGA AEVLEYAGAA VARSEIENGG D