Gene Hlac_2265 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2265 
Symbol 
ID7399975 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2252949 
End bp2254157 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content71% 
IMG OID643709338 
ProductHEAT domain containing protein 
Protein accessionYP_002566911 
Protein GI222480674 
COG category[C] Energy production and conversion 
COG ID[COG1413] FOG: HEAT repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0591737 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCTGT ACCAGCACGC GCGAGATGGG AACGCCGAGC GGCTCAGAGA CGCCATGGGC 
AGCGACAGCG CCGCGGTCCG GAAGCGTGCG GCCGAGTTCC TCGGCGAGGT CGCCGACCAC
GGCGACCAGC CGTCTATCGA CGTACTCCTG CGCGCCGCGA CCGGGGACGA GGACGCACAG
GTCCGCGGGG GCGCTGTCGA TGCGCTCGAC GAGATCGGTG AGGCGGCGCT CGAACAGCTC
CTCTCGGAGC TCACCGGCAC CAAAGGGTCC GAGGCGGAGT GGGTCACCGC CCGGAAGTTC
GCCCGCGCGC TGCAGGCCGA CCGCCCGGAG CTACGGATGG CCGCCGCCAA CGCGCTCGGC
CGACTCGACG ACGGGAGCGG TCTCCAGCAC CTCGTCGAGG CGCTCGACGA CGAGGACGCC
CGCGTCAGGC TCCGGGCGTG TCAGGCCTGC GGGACCTTCG CCGATCCGCG TGCAATCCCC
GGGCTGACCG AGCGGCTCGA CGACGAGCCG CGAGTCCGGC GGGCCGCCGC CAACGCCCTC
GGAAACATCG GTACCGATCG GGCGCTTTCT CCCCTCCTGG ACCTGCTCGA CGACGCCGAC
GAGTCACTCC GGCGCATCGC CGCCGGCGCG CTCGGGAAGG CGAACAACCC CGAGCCGGTC
GAGCCGCTGG CGCGCGCGCT CGGCGACGAG AGCGCGGTAG TGCGCAACGC CGCGGTGTAC
TCGATCATCG AGCTGCTCTC GAACGTACCG ACTCAGCAGA GCCACGCGGT CCGCGATCGA
GTCGTCTCCG AGCTGAAGCA GGCCGACGAC GAGACGGTCG TCGAGCCGCT GGTAGAGATC
CTCACAGACG GCCAACAGAG CCGCCAGCGA CGGAACGCGG CGTGGATCCT CGGCCGCGTC
GCAGAACGGG AGTCGACGGT CGCAGTCGAG GCGCTCGCGG ACGCGCTCGC GGACGACGAC
GCGCAGACCG CGCAGTTCGC GGCCACCAGC CTCAAGAGCC TCGGCGGGCC GATCGTCGAG
GACCGCCTCC TCGACCGGCT CGGAACCGAA CACCCCGAGG ACGCCCGGGC GAAGGCGGTG
TTCGTGCTCG GTCAGGTGGG TGGACAGGAG ACGCTCAACC GGCTCGAAGA GTTCACTGAC
GACGAGAGCC CTGCGGTCCG GAAGCGGGTG TTCTCGGCAG TCTCGAAGCT CCGGGCCGGG
GGTCCGTAA
 
Protein sequence
MSLYQHARDG NAERLRDAMG SDSAAVRKRA AEFLGEVADH GDQPSIDVLL RAATGDEDAQ 
VRGGAVDALD EIGEAALEQL LSELTGTKGS EAEWVTARKF ARALQADRPE LRMAAANALG
RLDDGSGLQH LVEALDDEDA RVRLRACQAC GTFADPRAIP GLTERLDDEP RVRRAAANAL
GNIGTDRALS PLLDLLDDAD ESLRRIAAGA LGKANNPEPV EPLARALGDE SAVVRNAAVY
SIIELLSNVP TQQSHAVRDR VVSELKQADD ETVVEPLVEI LTDGQQSRQR RNAAWILGRV
AERESTVAVE ALADALADDD AQTAQFAATS LKSLGGPIVE DRLLDRLGTE HPEDARAKAV
FVLGQVGGQE TLNRLEEFTD DESPAVRKRV FSAVSKLRAG GP