Gene Hlac_1503 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1503 
Symbol 
ID7400331 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1510589 
End bp1515367 
Gene Length4779 bp 
Protein Length1592 aa 
Translation table11 
GC content71% 
IMG OID643708565 
Productglycoside hydrolase 15-related 
Protein accessionYP_002566161 
Protein GI222479924 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3387] Glucoamylase and related glycosyl hydrolases 
TIGRFAM ID[TIGR01535] glucan 1,4-alpha-glucosidase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.547945 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACTGC GAACCGCTCT CACGGAACAC GAACGTCGGC GCGGCGAGCG CTACCCGGCG 
GAGCGCCCGA CGACCGCCGG CGCGTTCACG GGCGACGACG GTCGGCTGGT CCACGTCGGA
CCGAACGGGA CCGTCCACGA CTGTTCGTAC TCCCTGTCTG GGGTCGGTGG CGCCGACCGA
CTCCGCATGG GGATCACGGC CGGGCGGGGG GTCCGCTGGC TCGACGACCT GACCACGACC
CGCCAGCACT ACGACGGCGA TACGCCGCTC GTCGAAACGG AGTATGACGC CGGCCGCTAC
ACCGTCCACC AGTTCGACCT GGTCGTGAGC GACACCCACC TCACCCACGT CGAGCTGCGC
GGTGCCCCGC CAGCGGACGC GGAACTCGTC GCGGCCTGCG CGTTCTCGCC GGACATGGTC
GAGGGCCGCG TCGGTAACCT CGTCCACGAG GAGGCCGGCC CCCAGTCCGG AAGCGTCGTC
GAGGTGTACC ATCGGACCGA ACACGACTTT CTCACGGCCT CTACGGGACT GTCGGCCGCC
CACGGTCAGC GGCTCCGCAC TGTCTCCGAA CTGCTCGGCG AGAGCGGCGA GGGGTTCCCG
CACCGCGGCG AGATCGACCA ACGCGAGGAT TCTCGGCTCA CCCCCGACGT GGTCGTCCGC
GCGCCGTTCG AGCGCGACGG CCGGACCGAG CGCGTCACGC TTGCGAGCCG CGCCGTCCTC
GACAGGCGGG AGACCCGAGA GACAGTCGAT GACGCCAAGG ACGCGATTTC GGACGGTCCG
GACCGTCAGC GCCGCATTGA AGAGCTGTCG CGGATCGCGA CCGCGTACCC CGACGCCGAC
GACCTCCGCG AGGCCGCCGA GGGTCGTGGT CCGACCGTCC CCGACGACGT GCCGCGGCGG
TCGGTGGTCG CGGGTGACCT CCGCGCGCTC GACCTACTCA CTGCCGAGTC CGGTGCGCGG
ATCGCCGCCC CGGAGTTCGA CCCCTTCTAC TCCACGTCCG GCGGCTACGG CTACACGTGG
TTCCGCGACG AGGCGGAGAC GTCGCTGGCC CTGCTCGGCG CGAGCGACGA GCTCGGCTTG
GACGCCGACG AGGAGCTGCT CGCGACGGCC TCCTTCTTCT GTCGCACGCA GGACGCGGAC
GGCTCGTGGC CCCACCGGGT GTGGGCCGAC TCCGGCAAGG TCGCGCCCGG CTGGGCGAAC
GCCCGAATCG AGGGTGCGAA CGCCACCGCC GGCCCGAACG ATCAGCTGGA TCAGCCCGCG
TCCGTCGTCG CCTTCCTCGC CCGACTCCGC CGAACCACGG ACCTCCCCCC TGAGTGGCGC
GATCGTGTCG ACGACACAAT CGCGGACGCT ATCGATTTCC TCCGCGAGAC GACCGAGCCG
GACGGGCTCC CCCGCCGGTG TCAGAACTGC TGGGAGAACG CGCTCGGCCG GTTTACTCAC
ACCGGCGCGA CCTACCTCCG CGCGTTCGCT GCCGTGGCGC GCGCGCCGCT CCCCGATGAT
GTGCGGGCCG ACGCCGCCGA CGCGGCCGAC GCCGCCCTTG CGGGCCTAAA CGGTCGGTGG
AACCCGGACA CGGAGCGATT CCCCCAGCGG GCGAGCGCTG AGAGCCGCGA TGACCGCTCG
GACGCGAGCA CGTTCGCGCT TGCGAGCGCC GCGACCGAGT ACGCCGCGTT GCGCGACGAG
CGCAACGAGA TCGGTGCCGA CGGCGGTTCG ACTCCCGAGA TCGACACCAC ACCGGCTGCG
GCCGACGTCG ACTTCGACGC GTTCCTCGAC CGTGTGACGA CGCACGTGTT GACGACGATC
GGCGAGCTAC GCCGCGAGAC GGCCGACGTG GAGGGGCTCG TTCGGTTCTG GGGCGACGAC
TGGCGCACCG CGGAGCAGGG CGCCGCGAAG GTGTGGTCGA TCGCGACGCT GTGGGGAGCG
ACGGCCGCCG CCGAACTGGG CGGGCTCATC CGCGAGCGAG ACGGGGACGC AGCCGACCTC
TTCAGCGCCG CGCGCGACCT CTACGCGCTG TGTGAGCCCG ACGGGCCGTT CGTCAACGAC
GCCGGGCTCA TCGCCGAACA GGCGTTCGAC GACGGCGACC TCGACGGTGC GACCCCGATC
GCGTGGTCGC ACGCCCTCCG GATCGACGCG ACGGTGACCC TCGCGAGACA CGGGGCGCTC
CCGGTCCCGC ACGACGAGCC GCGGGGGCCG GACGAGGCAC CCCACTGGAC GACCGGCCGG
AAGTTCGGCG TCGGGACGCC GGCCGACCAC AAGGCCGCCG ACCCGGTCCC GGTCTGGTTC
ACGCTGACCG AGGGTGCGCT CACCGAGGCG CGGTTCCCCC GGATCGACGT GATGAACGTC
CGGACGTTCG ACTTCCTCGT CGCCGACCCG GAGACGGGGT ACACGGTCCG GACCTTCGAC
GAGACGAGCC ACGTGACGGC GACGGAGACG GTCGAGCGGA CCACGGAGCC CACCGTCGAC
GACGCCCTCG CGTACACGCA GACGATCTGC GAGACTGGCG ATGGCCACGG CCACAGCTGG
ACGCTCACCG TCGAGTACGC GGTCGATACC GACGGCGACG CGATCCTCGC CGACGTGGCG
TTCGAGGGGA GCCGCGAGTA CGACGTGTAC GCCTTAGTCG ACACGACGAT CACGAACGTC
GGATCGAACG ACCGCGGCGA CCGCGTCGAT GACTCGGACG GCTACCACCT CCTCGCGCGC
AACAACGACG CCGCCGAGCG CAGCACGGGG AAGCTCGTCG ACGACGACGG CGACTCCTTC
GAGGTCGCGC TCGCGATTGA CAGCGACGAC GGTTTCGAGT GGGCGAGCGT GCTCGCGGCG
GGCAGCGACA AGGCGGGTGC ACTGTTCGCC GACGGCGACC GGGGCGAGGG GACCGAGACG
GCCACCGGCA ACGTCGTACT CGCCGGACTC GTGGGGAGCG GTACCGAGGT GTCGGACGCG
GTCGCGCTCG GGTTCGCCGA GAACGCCGAC ACCGCCGCCG CCCTCGGAGA GGCTCGCGGC
GCCATCTCCC GCGGCTTCGC GGACATCTCC GAGTCGTACG TCGAGACGTG GCGCGAGTGG
CTCGCCGACC GGAAGTTCCC CGCTTCCGTG ACCGGCGACG CAGACCTAGA GACGCAGTAC
CGGTTCGCGC TGATGACGCT TGCGGCCGTC GAGGACAAAC GGCACGACGG CGCCGGGATC
GCCAGCCCGT CGGTACCGTG GGGCGAGACC GAGTACGCCG CCGAAGACCG AGGCTACGGC
TACAACTTCG TCTGGTCGCG CGACCTCTAT CAGGTGTTCA CCGCGCTGAT CGAGGTTGGC
GAGGTCGAAC GCGGTGCCGA CGCGCTCGCG TACCTGTACA ACACCCAGCA GGACGACGAC
GGGTTCCTCC CGCAGAACAC CTACATCGAC GGTCGTACTC GGTGGGGCGG CGAGCAGATG
GACAACATCG CGTTCCCGTC GGTGATGGCG TGGCAACTGT ACGAACACGG GATCTCGCCC
GGCGACACCG ACTACACCTA CGATCAGGTC CGTCGCTCGC TCGGCTATAT CGCCGCCAAC
GGCCCGGAGA CGGCTCAGGA GCGCTGGGAG GAGGAGGCCG GCTTTTCGCC GTCGAGCATC
GCCGCCGAGA TCGCGGGGCT CTGTTGTGGC GCCGCGCTTG CGGTCGCCGA GGCCGATCGG
ATCGAGGCCG GAAACGCCGG TTCCGGATCC GGACCGGACG CTGACTCCGA CTCACTCCGT
GCCGACGCCC TCGCGTGGCT CGCGCTCGCG GACGACTGGA CGACCCGCGT CGAGGAATGG
TGCGCGACGG CCACAGGTAC CGAGCGGCAC GCCGAGACGC CGTACTACCT CCGGGTCACC
GCCGACGGCG ACCCCGAGTC CGGGCGCCCC CGGACGATCG CCAACGACGG CCCCACCTAC
GACGAGCGGG AGATCGTTGA CGGCGGCTTC CTCGAACTGG TCCGACTCGG CGTGAAGCCC
GCCGACGACG ACGTGGTCCG CAACTCCGTG TCGGTCGTCG ACGATTCGAT CCGCGTCGAC
ACCCCGTACG GTCCCGCGTG GTACCGGTAC GTCGGCGACG CGTACGGCGA ACTGGCGCGC
AGCGATCCGG GCGGCCCGTG GGCCGGCACC GGTGACGGCC GCGGTCGGCT GTGGCCGATC
TTCACCGGCG AGCGCGGCGA GTACGAACTA CGCGCTCGCG CCGACGGCCC AGACGCCTTT
GGCGGTACCG ACGAGGACGC CCTCGAACCC GACGCTCTCT TGGAGACGAT GGCCGGGTTC
GGCAACTCCG GACGGATGCT CCCCGAGCAG GTGTGGGACC GTGAGCATCC GACCAACTAC
GGCTGGGAGT TCGGCGAGGG AACCGGGGGG GCCACCCCGC TCGCGTGGTC GATGGCGGGG
TTCATCCGGC TCGCTCACGG GGTCGAGGCC GGCGAACCCG TCGAGACGCC GACGGTCGTC
CGCGACCGCT ACGTCGAGCG CGACCGCTCG GCGACGCCCG ACCTCGACGC GACCGCGGAG
TACGTCAACA ACCATCTCGT CGTCGCCGGC GAGACCACCG CCGACGTGGT CGCGGTGTAC
ACGCCGGACG AATCTGCGCT CGTGACGGTC GAGGACGGGG AGTACGAGTT CCGCCTCGAC
GCGGCGCTCG ATGCGAAGAC GGTCGTCGTC GCCGCCGCGA ACACCGACGG CGGCGCGGAC
GATACGGCGG CCGACGCGGG CGATGAGGCG ACCGATACCG ACGCGATCCT CGACGAGTTC
TCAGGTGCCG GCACGGCTGT TAAGCGGCTT CGGGTGTAG
 
Protein sequence
MRLRTALTEH ERRRGERYPA ERPTTAGAFT GDDGRLVHVG PNGTVHDCSY SLSGVGGADR 
LRMGITAGRG VRWLDDLTTT RQHYDGDTPL VETEYDAGRY TVHQFDLVVS DTHLTHVELR
GAPPADAELV AACAFSPDMV EGRVGNLVHE EAGPQSGSVV EVYHRTEHDF LTASTGLSAA
HGQRLRTVSE LLGESGEGFP HRGEIDQRED SRLTPDVVVR APFERDGRTE RVTLASRAVL
DRRETRETVD DAKDAISDGP DRQRRIEELS RIATAYPDAD DLREAAEGRG PTVPDDVPRR
SVVAGDLRAL DLLTAESGAR IAAPEFDPFY STSGGYGYTW FRDEAETSLA LLGASDELGL
DADEELLATA SFFCRTQDAD GSWPHRVWAD SGKVAPGWAN ARIEGANATA GPNDQLDQPA
SVVAFLARLR RTTDLPPEWR DRVDDTIADA IDFLRETTEP DGLPRRCQNC WENALGRFTH
TGATYLRAFA AVARAPLPDD VRADAADAAD AALAGLNGRW NPDTERFPQR ASAESRDDRS
DASTFALASA ATEYAALRDE RNEIGADGGS TPEIDTTPAA ADVDFDAFLD RVTTHVLTTI
GELRRETADV EGLVRFWGDD WRTAEQGAAK VWSIATLWGA TAAAELGGLI RERDGDAADL
FSAARDLYAL CEPDGPFVND AGLIAEQAFD DGDLDGATPI AWSHALRIDA TVTLARHGAL
PVPHDEPRGP DEAPHWTTGR KFGVGTPADH KAADPVPVWF TLTEGALTEA RFPRIDVMNV
RTFDFLVADP ETGYTVRTFD ETSHVTATET VERTTEPTVD DALAYTQTIC ETGDGHGHSW
TLTVEYAVDT DGDAILADVA FEGSREYDVY ALVDTTITNV GSNDRGDRVD DSDGYHLLAR
NNDAAERSTG KLVDDDGDSF EVALAIDSDD GFEWASVLAA GSDKAGALFA DGDRGEGTET
ATGNVVLAGL VGSGTEVSDA VALGFAENAD TAAALGEARG AISRGFADIS ESYVETWREW
LADRKFPASV TGDADLETQY RFALMTLAAV EDKRHDGAGI ASPSVPWGET EYAAEDRGYG
YNFVWSRDLY QVFTALIEVG EVERGADALA YLYNTQQDDD GFLPQNTYID GRTRWGGEQM
DNIAFPSVMA WQLYEHGISP GDTDYTYDQV RRSLGYIAAN GPETAQERWE EEAGFSPSSI
AAEIAGLCCG AALAVAEADR IEAGNAGSGS GPDADSDSLR ADALAWLALA DDWTTRVEEW
CATATGTERH AETPYYLRVT ADGDPESGRP RTIANDGPTY DEREIVDGGF LELVRLGVKP
ADDDVVRNSV SVVDDSIRVD TPYGPAWYRY VGDAYGELAR SDPGGPWAGT GDGRGRLWPI
FTGERGEYEL RARADGPDAF GGTDEDALEP DALLETMAGF GNSGRMLPEQ VWDREHPTNY
GWEFGEGTGG ATPLAWSMAG FIRLAHGVEA GEPVETPTVV RDRYVERDRS ATPDLDATAE
YVNNHLVVAG ETTADVVAVY TPDESALVTV EDGEYEFRLD AALDAKTVVV AAANTDGGAD
DTAADAGDEA TDTDAILDEF SGAGTAVKRL RV