Gene Hlac_1305 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1305 
Symbol 
ID7399400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1316216 
End bp1318012 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content73% 
IMG OID643708369 
ProductBacterio-opsin activator HTH domain protein 
Protein accessionYP_002565967 
Protein GI222479730 
COG category[R] General function prediction only 
COG ID[COG3413] Predicted DNA binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGAGG GACTTGACGC CGAGGAGTAT GAGGCGCTCG TGACCGGGGC GGAGACGTAC 
CGCGCCGCGC TGGTCGTCCG TCTCGGCGGC GAGGCGGGGC TCCGAACTGG CGAGATTACG
TGCGTCACCC CGGGACACCT CCGTGAGACG GAAGGCGACG CGGATCTCTC CCTCCTCGCG
GTTCCATCGG ACGACACGGA AGGGGAAACC AAAGCGGGCG ACGATCCCGA GACGGAGGGC
GCACCACGGG ACCACAGCGG GATCGACCGC GAAACGGCGA TTCCGGCGTC ACTGGCAGCC
GAGCTGCGGC GGTACGCAGA GAGCGCCGAC CTCCGCGAGT CGGAGCCATT CGTCGACGTG
TCGCCCCGCC GCGTCCAGAT GATCGTGAGC GAGACCGCAG AGCACGCGGC CGCGCGGACC
GACGGCCTCG TCGACCCCGA CGTGACCCCG CGAGATCTCC GACGGACCTT CGCGCGACGA
CTCCTCGTCG ACCGCGGCGT CGACCCCCAC GCGGTTCGCG AGGCCGGCGG CTGGGAGACG
ATGGCGACGC TCGACGGTTA CCTCGGGGCG CTCGACGGGG ACGCAATCGC CGAGGCGATC
GCCGGCGATC GGGCCGGGTC CTCGGACGGA CCGGCGGCAG AATCTGCGCC CACCACCCTT
GGCGGGTTCG AGGCGCTCGC CGACGGAGAC GACCGGAAGA CGCCCCTCGC GACGGTTCCC
GGCGGCGTAG TTGAGGCCGA TCGCTGGGCC GAGGCGTGGG TCGCCCGCGG GATGGGGGAC
CGAGACCGCG TCGAGATCGC GGGCGCGGCC GGGGCCGACC GAGAGACCCT CGTCGATCGC
GGTGCGACCG CCGACGGTCC GTGTCGTGAC GCGGTCGAGG CGGGAGAGCC GGTCGCGACC
GAGGGATCGC CCGCGACCGC GGGTCGACCG GCGATCGCGG TTCCCGTTCG GTACCGCGAC
GTGACGCACG GCGCGCTGTG CGTCGTTGCC GGCGGAGAGC CGCCAGTCTC TCCCGTCTCG
CCGGCCGAGC GCCGGGAGAT CGCGGCGCTC GGCCGGTGCC TCGGGTGGGC CGTGACTGCG
GGGCGCTGGC GCGACCTGCT CCACTCCGAC GCGGTGACCG AGGTGGAGTT CCACACCGGC
GACGAGGGTG CGTTCCTCTC CCGCGCGAGC GCGGCGCTCG GCTGTCGGAT CGATCTGGCC
TCGACGGTGG CGGTCGACGA CGACGCCTCT CGCTTCTACC TCTCCGTCGA AGGGGCGCGC
CCGCAGGCGC TGGCCGACGC AGTCGCGGGT GCGTCCGGCG TCTCGGATCT CCGCGTGATC
GAGACCCGAG AAGACGGTTG CGACGTGTCC GTGCGCGTGG AGGGCGGGTC GGCGGTCCGA
GCGCTCACCG AGCACGGCGC GACCGTTCGC GACGCGACCG CGGAGGACGG GCGGGTCAGG
GTCGTCGCAG ACCTTCCGGA AGGCGCCGAT GTCCGCCCCG TCGCCGACGG GTTCCGGGCT
ACCTTCGCCG ACGCGCGACT CGCGAGCAAG GAGTCTGTCG CGCGGCCGGC CCGTAGCGAG
GACTCGTTGC GAGACGGAGT CGCGGAACGG TTCACCGACC GCCAGTGGGC CGCGCTGTCG
GCTGCGTACC ACGGCGGATA TTTCGACTGG CCGCGCGGGA GCACCGCCGA GGAGGTCGCC
GACGCCATGG ACGTCTCCTC GCCGACGTTT CACAATCACC TCCGGAAGGC CCAACGCAGA
CTGCTCGACG AGCTCTTCGA GGACGGCCGG CGGGCCCGTC GCCTCGATCA GGGGTGA
 
Protein sequence
MVEGLDAEEY EALVTGAETY RAALVVRLGG EAGLRTGEIT CVTPGHLRET EGDADLSLLA 
VPSDDTEGET KAGDDPETEG APRDHSGIDR ETAIPASLAA ELRRYAESAD LRESEPFVDV
SPRRVQMIVS ETAEHAAART DGLVDPDVTP RDLRRTFARR LLVDRGVDPH AVREAGGWET
MATLDGYLGA LDGDAIAEAI AGDRAGSSDG PAAESAPTTL GGFEALADGD DRKTPLATVP
GGVVEADRWA EAWVARGMGD RDRVEIAGAA GADRETLVDR GATADGPCRD AVEAGEPVAT
EGSPATAGRP AIAVPVRYRD VTHGALCVVA GGEPPVSPVS PAERREIAAL GRCLGWAVTA
GRWRDLLHSD AVTEVEFHTG DEGAFLSRAS AALGCRIDLA STVAVDDDAS RFYLSVEGAR
PQALADAVAG ASGVSDLRVI ETREDGCDVS VRVEGGSAVR ALTEHGATVR DATAEDGRVR
VVADLPEGAD VRPVADGFRA TFADARLASK ESVARPARSE DSLRDGVAER FTDRQWAALS
AAYHGGYFDW PRGSTAEEVA DAMDVSSPTF HNHLRKAQRR LLDELFEDGR RARRLDQG