Gene Hlac_0318 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0318 
Symbol 
ID7399708 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp340893 
End bp342665 
Gene Length1773 bp 
Protein Length590 aa 
Translation table11 
GC content67% 
IMG OID643707380 
Productputative PAS/PAC sensor protein 
Protein accessionYP_002564992 
Protein GI222478755 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.375606 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.106516 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGATT CGCCGGATCC TGCAGTCCTT TTGGATCTCG CAGGAGACAA GATCGCCGTC 
CTCGACGAAG ACGGGATCTT CCGGCACCTC AACGCGGCCG TCGCCGACTT GCTCGGGTTC
CACTCGGACG ACCTCGTCGG GACGGACGCA TTCGCGCTCG TCCACCCCGA CAACGAGGAG
CGGCTCCAAG AGACCTTCGC GCGGATCGTC TCCGGGGAGC TGACACCCGA CGAGCCACTG
GAGTACCGGT ATCGCACCGC CGACGGTGGG TGGGTATGGC TCCGGACGAC GGTGCACCCG
CCCGAGGAGA CGGAAATCGA CGGGTACGTC CTGACCTCGC GGGACATCAC GAGCGAGGTC
GAGTCCCGGC GCCGACTGGA GACGATCGCA TCGGCCTCGT CCGACGTGTT CTGGATGTTC
TCCGCCGAGT GGGACGAGCT GCTGTTCATC AGCGACATCG TCGAGGAGGT CTTCGGCGTG
TCGAGGGACA CGCTCGAACG GCAGCCAAAT CGGTTCCTCG ACGTTGTCCA TCCCGATGAC
CGCTCATACG TCGAGCGAGC GATGGACCGA CTCTCGAACG GCGAATCGAC GCTGATCGAC
TACCGACTGG GGTCCGCCGA CGGGACCACG AAGTGGGTCC GCGTGCCCGG CGAGCCAGTG
ATCGAGAACG GCACGGTCGT GGCGGTCACG GGCTTCGCCC GCGATGTCAC CGACGAGTAC
CGCCGCGAGC GACAGCTCGC CGTGATGGAC AACCTTCTGC GACACACGAT CCGCAACGAC
ATGAACATCG TCGACGGGAC CGCGGAGCGC ATCGTTGACG CCGTCGCTGC CGCGGACGCG
TTCGATCCGG AGGCGTGGGG CGACAGCGTC GCGGCCGCGG AGGGTAACGC CGAGATTGGT
CCTGACGCCC TCGCCGAACT CGGGGCGGAC CTACAGGAGC ACGCGGAGAC GATCCGACGG
ATCGCCTCCG ACCTGTTGGC GACGGCAGAG AAACAGCGCG GGGTGATCGA CCTGCTGCGA
CAGCGCGGGT CACCCCGAGC GGTCGAGGTG GCGCCCGTGG TCGAGGAGGC GCTCGGAATG
GTCGTCGACG ACTGCGACGA GACGGTCGAC GTCACCTACC GCGAGCCGGT CGACGGAGAG
GGCGCGAGGG AAGGCGAGCC CGGAGACGAG ACCGAGAACG TGGACGGGAT GGCGAGTGAG
GAGACGGCAG GCGAGGGGAC GACGAATGAG GAGACGGCAG GCGAGGAGAC GGCGGGCGAC
AACTCGACAC TCCCGCGGGT ATCGGTCTCG TACCCGCCGA ACGCGAAGGC GTTCACGCAT
CCGGAGCTCG ACTACGCGAT CGCGGAGTTG GTCGAGAACG CCCTCGAACA CGCGGAGTCG
ACGCCGCGGA TCCGGATCGA CGTGTGTACA ACCGACGAGT CGATCGAGGT GTCGATCCGC
GACAACTGCC CGCCGATCCC GGTCGAGGAG CGATACGTAA TCACCGACCG ATGGGAGATG
GACGACCTCC GTCACACCGG GGGGATGGGC TTGTGGCTGG TGTACTGGGT CGCAAACCGG
TCGGGCGGCG ACCTGACCTT CGACACCCAC GCCGACGGGA ACGTCGTGAC GCTCTCCGTT
CCGAACGCGA AGTGTGGCAC GATCAACGAG GATCCACGGG AGACGACCCT GTCAAACCGC
CCGATGACCG CCGCAGTCGA GGGGGCAGAC ACGCGCATTC GGACCGACGG ATCCACCACC
TCGGAACCGA AACGGCGCGA CGAGACGGAC TGA
 
Protein sequence
MSDSPDPAVL LDLAGDKIAV LDEDGIFRHL NAAVADLLGF HSDDLVGTDA FALVHPDNEE 
RLQETFARIV SGELTPDEPL EYRYRTADGG WVWLRTTVHP PEETEIDGYV LTSRDITSEV
ESRRRLETIA SASSDVFWMF SAEWDELLFI SDIVEEVFGV SRDTLERQPN RFLDVVHPDD
RSYVERAMDR LSNGESTLID YRLGSADGTT KWVRVPGEPV IENGTVVAVT GFARDVTDEY
RRERQLAVMD NLLRHTIRND MNIVDGTAER IVDAVAAADA FDPEAWGDSV AAAEGNAEIG
PDALAELGAD LQEHAETIRR IASDLLATAE KQRGVIDLLR QRGSPRAVEV APVVEEALGM
VVDDCDETVD VTYREPVDGE GAREGEPGDE TENVDGMASE ETAGEGTTNE ETAGEETAGD
NSTLPRVSVS YPPNAKAFTH PELDYAIAEL VENALEHAES TPRIRIDVCT TDESIEVSIR
DNCPPIPVEE RYVITDRWEM DDLRHTGGMG LWLVYWVANR SGGDLTFDTH ADGNVVTLSV
PNAKCGTINE DPRETTLSNR PMTAAVEGAD TRIRTDGSTT SEPKRRDETD