Gene Hlac_3482 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_3482 
Symbol 
ID7402328 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012030 
Strand
Start bp226640 
End bp229624 
Gene Length2985 bp 
Protein Length994 aa 
Translation table11 
GC content67% 
IMG OID643710023 
Productputative PAS/PAC sensor protein 
Protein accessionYP_002567589 
Protein GI222481353 
COG category[R] General function prediction only 
COG ID[COG3413] Predicted DNA binding protein 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGGACT TGACGGCTGC ACTCCGCGAG ACGCTCGACA CGTTCGCGCC CGACGGGACG 
CCGCTGACGA CGAGCGAGGT GGCTGAGGCA CTCGACCTCG GTCGACGGAG CACGTACGAC
AGACTGGACC GACTCGTGGA CGCCGATGAA CTACGCACGA AGAAGGTCGG CGCGAGCGCG
CGCGTCTGGT GGCGGAACGA CACAGGTGAC GCCGCTCTAC CTGACGCTAA AATCGCTGAC
GCCTCGACGA ACGGAGCCAG TACCGATGCG GGAGCGGGTG CGGTCGAGAA TCCCCCGCTC
GTGGACGTGC TCGAGTCCTC GCCGACCGGC GTGGCGGCGT TCGCGGACGA CGGCACGTGC
GCGACCGCCA ACGAGCAGTT CCGGACGCTG TTCGGGCTCG ACGGCGACGT CTCGCTCGAG
GCTCTCGAAG CGGTGCCGTT GACCGACCAC GACGGTGACG TGATCGGCGC CGCCGACCGA
CCGGTTCGTC GAGTTCGTAG GACGAACCGT CCGGTCGTCG ACGAGCACGT ACGCGTCGAC
ACCGACGACG GGCATCGATG GGTGTCGATG ACTGTACAGG AGTCCGACGA CGGAGTCGTC
GTCACGGCGT CGGACGTGAC GGGCGTCGTC GAACGGTCGC GTCGACTGCG ACGGGAGCGT
GACGCTGTCG CGGCCGAGCT ATCGGAGTTC GCGTCCCACG CGGTCGACAG CCGACTGAAG
CTGGACGAAG ACGGGACGGT GCTGTCCGTC GACGACCGTG CCGCGGCACT CCTCGAACCG
GACGTGGTAG AACTCGTCGG TGCGTCGACG AGCGAGGCGT TCGATGCACT CAGGGGCGCA
AGCGCGATCG TCGACGCGGC GCTCGAGTCG GAGTCGAAGC AGACGGGCGA CTGTCGGCAC
GGCGACCGCG ACGCGTGGTT CGAGGTCGAG GCCGTCCCGA CGACCTCGGG CGCGTCCGTG
TTGCTCCGGG AGGTCACCGA ACAGGTGGAG CACGAACGCG AACTCCAGCG GTACGTCGGC
GTGGTCGACG CACTCGGCGA ACCGGTGTAC GAGCTCGATA GCGAAGGCCG GTTCGCGTTC
GTCAACGACG CCATCACGGA GCTCTCGGGG TACTCGCGCG AGGAACTGCT CGGCGAGCAC
GTGTCGCTCG TGATACCGGA CGATGCCGTG GACCGCATCG AGCCGCAGAT CAGCGAACTC
TTGACGGAGG ACGCCCCGGA CCGAGTGCGC TCGGAGTACC ACGTGACCAC GAAGCGCGGG
CACGCAGTGC CCGTCGAGAA CCGGTTGACG GTGCTCACGG ACGAAGCGGG GAACGTCCGC
GGGAACGCCG GCTTCGTCAG CGACATCACT GAACGCAAGG AACGAGAGCG GGAACTGGAG
CGCTACGAGC GCATCGTCGA GACGGTCGAG GACGGCATCT ACGTGCTCGA CCAGGAGGAC
CGTTTCGTCG TCGTGAACGA CGCGTTCGCG TCGATGGCTG GCGTCGACGG CGAGGATATC
ATCGGCCAGA AGGCGTCCAT CGTCTTCGAC GAGACGTTCG CGGAGCGAGT GAACGACAGG
AATGCGGCGC TGTCAGCGGA CACGCTCGAG AGCGCGAAGT TCGAAGAGAC GTTCGCGCCG
GTCGACGGCG ACCCGCTCGT GGTGGAGACG CGGTTCACGA CGTTCGCGTC CAAGGACGGG
AACACCGGAC GCGTCGGTGT GGTGCGGGAC GTCGGGGAGC GCGTCGAGCG GGAACGACGC
ATCGAACGCC AGCGGGCGCG TCTCGAGGCG CTCAACGAGG TGAACGCGGT GGTTCGCGAC
GTCGCGACGG GAGCCATCGA CGGGTCAACT CGCGAGGAGA TCGAGACGAT GGTCTGCAAG
CGTCTCGCCG CGTCGAACGC CTACGAGTTC GCGTGGATCG GGGAGATGAC TGGCGTCGAC
GGGTCGCTCG CCGTCCGAAC CGCGGCGGGT CTCGACGACC CGGACCGCGC TTCGCTGTCG
GGGATGCTCG AGGGAGTCCG TGGTCGCGGT GCGGTATCTC GTGCGGTCCG CGACCGGACG
GCACAGACGG TCCAGGACGC GTCGTTGTTG CCGACGTCGG ACCCATGGCG GGCGCTCGCC
GCCAGGTTCG GGTTCCGGTC GGCGATGGCG ATCCCGATAA CGCACGACGG CCGGATGTTC
GGCGTCCTGA ACGTGCATAC CGACCGGGAG TCCGCGTTCG CTGACGAGGA ACGACGCCTC
GTGGAGCACA TCGGTGAGGT CGTCGGGCAC GCGATCGCGG CCGTGGAGCG CAAGCGTGCG
CTCGCGAGCG AAGCCGTCCT GGAACTGGAC TATCGGGTGC CGAGGGCGTT CTCGTCGCTC
GACGTCTCCG AGTCGCTGTC GGGCACGCTC ACGTTCGACG AGACGGTGTC GACGAAGGGC
GATGAGGTGC TCGTGTACGG AACGGCGTCG CCGACGGCGA TGGAGTCGCT GGCGTCCCTC
GTCGACGAGG TACCGTACTG GGAGTCGGTG TCGGTCGTCG ACACCGACGA GGCCGGTAAC
TCGAAGTTCG AGTTGCACGC GAAGGAACCA CCCGTGTTCT CGATGGTGAC GGCACGCGGC
GGGTACGTGG ACGAGGTCGT GACCGTGGAC GGCAACTCCA GGTTCGTGTT GCACGTGCCG
CCCGGACAGG ACGTCCGTGC GATCACTGAG GGGGTGTCGG CGGCGTATCC GTCGGCGGAA
CTCGTCGCTC AGCGGCAGAT ATCTCCATCG CAGCCGTCGA TGGCGAGACT CCAGGACCGT
ATCGCGGAGG ATCTCACGGA CCGTCAGCGT GCCGCTCTGT ATGCGGCGTA CCACTCGGGG
TTCTTCGAGT GGCCGCGAGC GGCGACCGGC GAGGACGTCG CAGAGTCGCT CGGGGTGGCA
CCGCCAACGT TCAACCAACA CATCAGGAAG GCGGAGCGGA AGGTGTTCGA GGCGCTACTG
GGGGAGGGGG GCGAGGAGTC CTCGGGTTGG ACTGACACCG AGTAG
 
Protein sequence
MGDLTAALRE TLDTFAPDGT PLTTSEVAEA LDLGRRSTYD RLDRLVDADE LRTKKVGASA 
RVWWRNDTGD AALPDAKIAD ASTNGASTDA GAGAVENPPL VDVLESSPTG VAAFADDGTC
ATANEQFRTL FGLDGDVSLE ALEAVPLTDH DGDVIGAADR PVRRVRRTNR PVVDEHVRVD
TDDGHRWVSM TVQESDDGVV VTASDVTGVV ERSRRLRRER DAVAAELSEF ASHAVDSRLK
LDEDGTVLSV DDRAAALLEP DVVELVGAST SEAFDALRGA SAIVDAALES ESKQTGDCRH
GDRDAWFEVE AVPTTSGASV LLREVTEQVE HERELQRYVG VVDALGEPVY ELDSEGRFAF
VNDAITELSG YSREELLGEH VSLVIPDDAV DRIEPQISEL LTEDAPDRVR SEYHVTTKRG
HAVPVENRLT VLTDEAGNVR GNAGFVSDIT ERKERERELE RYERIVETVE DGIYVLDQED
RFVVVNDAFA SMAGVDGEDI IGQKASIVFD ETFAERVNDR NAALSADTLE SAKFEETFAP
VDGDPLVVET RFTTFASKDG NTGRVGVVRD VGERVERERR IERQRARLEA LNEVNAVVRD
VATGAIDGST REEIETMVCK RLAASNAYEF AWIGEMTGVD GSLAVRTAAG LDDPDRASLS
GMLEGVRGRG AVSRAVRDRT AQTVQDASLL PTSDPWRALA ARFGFRSAMA IPITHDGRMF
GVLNVHTDRE SAFADEERRL VEHIGEVVGH AIAAVERKRA LASEAVLELD YRVPRAFSSL
DVSESLSGTL TFDETVSTKG DEVLVYGTAS PTAMESLASL VDEVPYWESV SVVDTDEAGN
SKFELHAKEP PVFSMVTARG GYVDEVVTVD GNSRFVLHVP PGQDVRAITE GVSAAYPSAE
LVAQRQISPS QPSMARLQDR IAEDLTDRQR AALYAAYHSG FFEWPRAATG EDVAESLGVA
PPTFNQHIRK AERKVFEALL GEGGEESSGW TDTE