Gene Hlac_1643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1643 
Symbol 
ID7399593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1664074 
End bp1665918 
Gene Length1845 bp 
Protein Length614 aa 
Translation table11 
GC content53% 
IMG OID643708710 
Producthypothetical protein 
Protein accessionYP_002566298 
Protein GI222480061 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.784972 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.125813 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCACAG AATCATCTAC ACCGGCTGAA CTGACGGAAG AACTGACCAA AGAGCGTGTT 
CGATCCCACC GTGACGACGT GATCCAGCAG GGATTCTCTA CCTCGGAGAT TGCGCTCATT
GAGGCACTTC CTGCCTCGGG CAAGAGTTAC GGAGTTCTCC AATGGGGGGC AGAAACGGAT
AATCAAATGA CGATTTTGGC TCCCCACCAC GACCTCTTGA ACGAGTATGA GAACTGGTGT
GCCGAACTGA ATCTCTCAGT AAAGCGCCTA CCTTCGTTCC ACCGAGACTG TGAGAGTGTA
TCACTTGACG ACGATGGAGA ACCTGCGGAT GAGCGCACAA AGGAACTTCT CGGATTGTAT
CGGCAGGGGA TCAGGGGAGA GAGGATTCAC CAGCAGGCAA GCAAGTTGGT CAGTAGTAAT
CTGGCTTGCC AGCACGATGG GGAATGTCCC TATATCCAGA AGCTCAATAT CGATACAGAT
GCTTACGATG TTCTCCTTGG CCACTATCTC CATGCCTACC AGACCGACTG GACTGATGAA
CGGTACGTAG CGGTCGACGA GTTCCCGGGT GACGCGTTCG TACAGGAGTT CACAGGACAC
GTCCCACCAG CTGTCACGGC ATATCTCCAA CAGGAAGACC GCCTACCGTT TCACGACTAC
GCTGAATTGC TTGAACGGCA GTCAGAGTTT CAGGACGAAG TCGAAGCGTG GAAAGAGGAC
GTTTGGTCTG ACTATGACGC TGCCCACGTT CTTCGAAATT CGAACTCGTC AGCACACGCT
CTCGCCCCGT TGATGACACG GGCGAATCTT GAGAAGGAGC GTCTGGATAA CCGATGGCAA
TTTGCTGACC TCGGGCGTGG AAAAGTAGCT GCTCGAAATA CCGATCACCA ATGGTCGTTC
CTCTTGCCTC CGAACTTCGA AGGAGCGGAG AGCGTTGTTG TCCTCGATGG AACGCCAGTC
ATCGAACTGT GGGAACTCGT AATGGGAGCA GATATTGAGC GGATTCCTCT TCTCGATGAC
GACCAGAAGC AGCTATATCT TGAGAGTGTC CTCGGACTTA ATCTCGTCCA GACCACAGAC
AACTGGAACG CCTATCAGGG TGGAGAGGGA GTGTCCCCCA CCGTCGATAT ACCCGTTGTG
GAAAAGATCG CAGAAGTTGA AGGCAGAAAC CCCGGGGTTA TCACCTCTAA GAAGGGCCTC
AATCAGTACG AAAATCATGG GCTGAACTCA TTGGTCACGC AGACAGAGAA CTATGGCGGT
CTGAAGGGTA TCAACACCCT CGGGACGACA AGGGTCGGCG TTATTCTCGG AAATCCCCAT
CCGGGCGACG ACGTAATCGA GAAGTGGGGC GCGCTGGCCA ATATTTCCGT AGAGCGACAG
GAAGGAACTG AAGGCAAAAA TACCGACTAT GGGCCCTTCG GGAATCGTGC GATGGAAGCA
GAGATCCAGA ACAAGGTTCT CCAAGCAGCT ATGCGATTCG GACGTACCGA GGAACACGGA
GAAAAGGGGG CAACTGTGTA CGTCCACACG TCGGCACTTC CGGAGTGGGT AGAGGCCAAG
AAGCGTTTCG CTACCGTTGA CTCGTGGATT ACCCATAAGA ACGGAATGAA GCAGGTGATT
GAGACAATCC GAAACTTCGA TGACTGGAAG GCTTTGGAGT GGAAGGTCGG AAAGGTAGCT
GAGTGTGTGA CCATCTCCAA GAACTCCACC CGAAAACACC TCAAGACACT TGCCGAACAG
GGGTATCTCG ATAAACGGAC TGCGGGCCGT GGTGGTGCGT TTCATTTCTC GAATGTTCGT
TTGGAGGAAG CGCAGAAGTA CGGACACGTG GAATTTGCTG AGTAG
 
Protein sequence
MSTESSTPAE LTEELTKERV RSHRDDVIQQ GFSTSEIALI EALPASGKSY GVLQWGAETD 
NQMTILAPHH DLLNEYENWC AELNLSVKRL PSFHRDCESV SLDDDGEPAD ERTKELLGLY
RQGIRGERIH QQASKLVSSN LACQHDGECP YIQKLNIDTD AYDVLLGHYL HAYQTDWTDE
RYVAVDEFPG DAFVQEFTGH VPPAVTAYLQ QEDRLPFHDY AELLERQSEF QDEVEAWKED
VWSDYDAAHV LRNSNSSAHA LAPLMTRANL EKERLDNRWQ FADLGRGKVA ARNTDHQWSF
LLPPNFEGAE SVVVLDGTPV IELWELVMGA DIERIPLLDD DQKQLYLESV LGLNLVQTTD
NWNAYQGGEG VSPTVDIPVV EKIAEVEGRN PGVITSKKGL NQYENHGLNS LVTQTENYGG
LKGINTLGTT RVGVILGNPH PGDDVIEKWG ALANISVERQ EGTEGKNTDY GPFGNRAMEA
EIQNKVLQAA MRFGRTEEHG EKGATVYVHT SALPEWVEAK KRFATVDSWI THKNGMKQVI
ETIRNFDDWK ALEWKVGKVA ECVTISKNST RKHLKTLAEQ GYLDKRTAGR GGAFHFSNVR
LEEAQKYGHV EFAE