Gene Hlac_1997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1997 
Symbol 
ID7402016 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1991696 
End bp1992733 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content69% 
IMG OID643709068 
Productputative deoxyhypusine synthase 
Protein accessionYP_002566645 
Protein GI222480408 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1899] Deoxyhypusine synthase 
TIGRFAM ID[TIGR00321] deoxyhypusine synthase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.428923 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.191458 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACA GCGACGACGG GGGCGACCCG CCGCACGAGG AGTTCCACGA GGACCCGGTC 
GGACACACCC GCGCGACGGC CGGGATGACC GTCGGGGAGC TGGTCGAGGG GTACGGCGAC
GCGGGGATCG GCGCAGCGTC GGTCAACGAG GCGGGCGACG TGCTCGCAGA GATGTTCGCG
AACGACGACT GCACCGTGTT CCTCTCGCTG GCGGGCGCGA TGGTGCCCGC GGGGATGCGC
CGGATCGTCT CCGATCTCAT CCGAGACGGC TACGTCGACG CGCTGGTGAC GACGGGCGCG
AACCTCACCC ACGACGCCAT CGAGGCCATC GGCGGGAAAC ACCACCACGG TCGGACCCAC
GACCCCGAGA AGAGTCTCCG CGAGCACGAC GAGGGGCTCC GCGACGAGGG CGTCGACCGC
ATCTACAACG TCTACCTCCC GCAGGAGCAT TTCGCGGCCT TCGAGGGTCA CCTGCGCGAG
GAGGTGTTCC CGGCGCTCGA AGCCGATCCG GACGACGACG GAAACGGCGC CGTCGGCATC
GCAGATCTCA CACGCGAGCT CGGACGCGCC AACGCCGCGG TTAACGAACG CGACGACGTG
GCCGAGGACG CCGGCGTCGC CGCCGCGGCC TACGAGTGCG ATGTGCCCGT CTACTGTCCC
GCCGTGCAGG ATTCCGTGCT CGGGTTACAG GCGTGGATGT ACGCCCAGAC TGCCGACTTC
ACGCTCGACG CCTTAGACGA CATGACGGAA CTGACCGACC TCGCGTTCGA CGCCGACGAC
GCCGGCTGCC TGCTTGTCGG CGGCGGCGTC CCGAAGAACT TCACGCTCCA GACGATGCTC
GTCACGCCCC GCGCCTACGA CTACGCCGTT CAGATCACGA TGGACCCGGA GGCGACCGGC
GGGCTCTCCG GTGCCACCTT AGAGGAGGCT CGGTCGTGGG GGAAACTGGA GAAGGACGCG
CGCAACGCCT CCGTCTACGG CGACGCGACC GTTATGCTGC CGATGCTTAT TGCTGCCGCC
CGCGAGCGCG TGGAGTAG
 
Protein sequence
MSDSDDGGDP PHEEFHEDPV GHTRATAGMT VGELVEGYGD AGIGAASVNE AGDVLAEMFA 
NDDCTVFLSL AGAMVPAGMR RIVSDLIRDG YVDALVTTGA NLTHDAIEAI GGKHHHGRTH
DPEKSLREHD EGLRDEGVDR IYNVYLPQEH FAAFEGHLRE EVFPALEADP DDDGNGAVGI
ADLTRELGRA NAAVNERDDV AEDAGVAAAA YECDVPVYCP AVQDSVLGLQ AWMYAQTADF
TLDALDDMTE LTDLAFDADD AGCLLVGGGV PKNFTLQTML VTPRAYDYAV QITMDPEATG
GLSGATLEEA RSWGKLEKDA RNASVYGDAT VMLPMLIAAA RERVE