Gene Hlac_1654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1654 
Symbol 
ID7399605 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1675338 
End bp1676816 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content69% 
IMG OID643708722 
Producthypothetical protein 
Protein accessionYP_002566309 
Protein GI222480072 
COG category[S] Function unknown 
COG ID[COG1641] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00299] conserved hypothetical protein TIGR00299 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.10098 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAACGC TCGTCTTCGA CGGTCGGACC GGTGCCGCCG GCGACATGAT TTGCGCCGCG 
CTGATTGCGG CCGGCGCCGA CCCCGACGCC CTTGCGCCTG TGACCGATCG GCTCCCGGTC
CGGTACGAGA TCGGCGAGAC GACCAAAAAC GGGATCCGGG CGACGACCGT CGACGTGCTT
CTCGACGGCG ACGACGACGA TCACGATCAC ACTCACGGTG ACGGTCACAG CCACGGCCAC
AGCCACAACC ACGGCGACGG TGACGGCCAC GACCATGCTC ATGAGCACGG TCATTCCCAT
GAGCATGATC ACTCTCACGA TCACTCTCAC GATCACTCTC ACGATCACAC CCACGATCAC
ACCCACGATC ACGCCGAGTC CGAACACGCC GAGGGTGCCG GCGTCCACCG CACCTACCCC
GAAGTCGTCG CGCTCGTCGA GTCGATGGAT CTCCTCGAAT CCGTCGAGTC GCTCGCGCTC
GACGCTTTCG AGCGGCTCGG TCGCGCGGAG GCGTCGGTCC ACGGCACCGA ACTCGACGAG
ACCCACTTCC ACGAGGTCGG CGCGGACGAT GCCATCGCCG ACGTGGTCGG CGCGGCGCTC
CTCTTAGACG ATCTCGACCC CGAGCGGGTC GTAACGACCC CAGTCGCGAC GGGCGGCGGC
GAGGTCGAGA TGAGCCACGG CGTCTACCCG GTCCCCGCGC CGGCGACAAC CGAGGTCGCG
GCTGGCGCCG ATTTCTCGGT AGTCGGCGGG CCGATCGACC GCGAACTCCT CACGCCCACG
GGCGCCGCGA TCCTCGCCGC GGTCGCCGAG GGCGCCGACG CGATCCCCGA CCTCGACGTC
GACGCCACTG GCTACGGGGC GGGCGACGCG ACCTTCGAGA ACCACCCGAA CGTGCTCCGG
GTGCTGGTCG GCGAGGGACG CGATCTGGAG GTAGGCGGGC GCGATCACGG CGACGCTCCC
CACCACACAG ACGACCACCA AACCGACGAC CACCACACCC ACGGCCTCGT CCACGACGAC
ATCGCCGTCC TCGAAACAAA CCTCGACGAC GCCGATCCCG AGGTACTCGG CGGGCTCCAA
GAGACACTCT CGCGCGCCGG CGCCCGCGAC GTGACGATCG TTCCGACGAC GATGAAGAAA
TCGCGGCCGG GCCACCTCGT GAAAGTGATC TGCAAGCCCG AAGACGCCGA GGCGATCGCG
GAGCGGCTCG CCCGCGAGAC GGGGACTCTC GGCGTCCGCC ACTCCGGCGC GAGCCACCGG
TGGATCGCCG AGCGCGACTT CGAGACGGTA ACGCTCTCGA TCGACGGCGG CGACCACGAG
GTGACGGTGA AGGTCGCTTC GACCGCCGAC GGTGACGTCT ACGACGTGAG CGCCGAATAC
GACGACGCTG CCGAGGTCGC CGAGACGACT GGGCTCCCGA TCCGGGACGT GCTCCGTCAG
GCGGAGCGAG AGGTTCGCGA TCGGCTCGAC GAGGAGTAG
 
Protein sequence
MRTLVFDGRT GAAGDMICAA LIAAGADPDA LAPVTDRLPV RYEIGETTKN GIRATTVDVL 
LDGDDDDHDH THGDGHSHGH SHNHGDGDGH DHAHEHGHSH EHDHSHDHSH DHSHDHTHDH
THDHAESEHA EGAGVHRTYP EVVALVESMD LLESVESLAL DAFERLGRAE ASVHGTELDE
THFHEVGADD AIADVVGAAL LLDDLDPERV VTTPVATGGG EVEMSHGVYP VPAPATTEVA
AGADFSVVGG PIDRELLTPT GAAILAAVAE GADAIPDLDV DATGYGAGDA TFENHPNVLR
VLVGEGRDLE VGGRDHGDAP HHTDDHQTDD HHTHGLVHDD IAVLETNLDD ADPEVLGGLQ
ETLSRAGARD VTIVPTTMKK SRPGHLVKVI CKPEDAEAIA ERLARETGTL GVRHSGASHR
WIAERDFETV TLSIDGGDHE VTVKVASTAD GDVYDVSAEY DDAAEVAETT GLPIRDVLRQ
AEREVRDRLD EE