Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_1503 |
Symbol | |
ID | 7400331 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | - |
Start bp | 1510589 |
End bp | 1515367 |
Gene Length | 4779 bp |
Protein Length | 1592 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643708565 |
Product | glycoside hydrolase 15-related |
Protein accession | YP_002566161 |
Protein GI | 222479924 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3387] Glucoamylase and related glycosyl hydrolases |
TIGRFAM ID | [TIGR01535] glucan 1,4-alpha-glucosidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.547945 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGACTGC GAACCGCTCT CACGGAACAC GAACGTCGGC GCGGCGAGCG CTACCCGGCG GAGCGCCCGA CGACCGCCGG CGCGTTCACG GGCGACGACG GTCGGCTGGT CCACGTCGGA CCGAACGGGA CCGTCCACGA CTGTTCGTAC TCCCTGTCTG GGGTCGGTGG CGCCGACCGA CTCCGCATGG GGATCACGGC CGGGCGGGGG GTCCGCTGGC TCGACGACCT GACCACGACC CGCCAGCACT ACGACGGCGA TACGCCGCTC GTCGAAACGG AGTATGACGC CGGCCGCTAC ACCGTCCACC AGTTCGACCT GGTCGTGAGC GACACCCACC TCACCCACGT CGAGCTGCGC GGTGCCCCGC CAGCGGACGC GGAACTCGTC GCGGCCTGCG CGTTCTCGCC GGACATGGTC GAGGGCCGCG TCGGTAACCT CGTCCACGAG GAGGCCGGCC CCCAGTCCGG AAGCGTCGTC GAGGTGTACC ATCGGACCGA ACACGACTTT CTCACGGCCT CTACGGGACT GTCGGCCGCC CACGGTCAGC GGCTCCGCAC TGTCTCCGAA CTGCTCGGCG AGAGCGGCGA GGGGTTCCCG CACCGCGGCG AGATCGACCA ACGCGAGGAT TCTCGGCTCA CCCCCGACGT GGTCGTCCGC GCGCCGTTCG AGCGCGACGG CCGGACCGAG CGCGTCACGC TTGCGAGCCG CGCCGTCCTC GACAGGCGGG AGACCCGAGA GACAGTCGAT GACGCCAAGG ACGCGATTTC GGACGGTCCG GACCGTCAGC GCCGCATTGA AGAGCTGTCG CGGATCGCGA CCGCGTACCC CGACGCCGAC GACCTCCGCG AGGCCGCCGA GGGTCGTGGT CCGACCGTCC CCGACGACGT GCCGCGGCGG TCGGTGGTCG CGGGTGACCT CCGCGCGCTC GACCTACTCA CTGCCGAGTC CGGTGCGCGG ATCGCCGCCC CGGAGTTCGA CCCCTTCTAC TCCACGTCCG GCGGCTACGG CTACACGTGG TTCCGCGACG AGGCGGAGAC GTCGCTGGCC CTGCTCGGCG CGAGCGACGA GCTCGGCTTG GACGCCGACG AGGAGCTGCT CGCGACGGCC TCCTTCTTCT GTCGCACGCA GGACGCGGAC GGCTCGTGGC CCCACCGGGT GTGGGCCGAC TCCGGCAAGG TCGCGCCCGG CTGGGCGAAC GCCCGAATCG AGGGTGCGAA CGCCACCGCC GGCCCGAACG ATCAGCTGGA TCAGCCCGCG TCCGTCGTCG CCTTCCTCGC CCGACTCCGC CGAACCACGG ACCTCCCCCC TGAGTGGCGC GATCGTGTCG ACGACACAAT CGCGGACGCT ATCGATTTCC TCCGCGAGAC GACCGAGCCG GACGGGCTCC CCCGCCGGTG TCAGAACTGC TGGGAGAACG CGCTCGGCCG GTTTACTCAC ACCGGCGCGA CCTACCTCCG CGCGTTCGCT GCCGTGGCGC GCGCGCCGCT CCCCGATGAT GTGCGGGCCG ACGCCGCCGA CGCGGCCGAC GCCGCCCTTG CGGGCCTAAA CGGTCGGTGG AACCCGGACA CGGAGCGATT CCCCCAGCGG GCGAGCGCTG AGAGCCGCGA TGACCGCTCG GACGCGAGCA CGTTCGCGCT TGCGAGCGCC GCGACCGAGT ACGCCGCGTT GCGCGACGAG CGCAACGAGA TCGGTGCCGA CGGCGGTTCG ACTCCCGAGA TCGACACCAC ACCGGCTGCG GCCGACGTCG ACTTCGACGC GTTCCTCGAC CGTGTGACGA CGCACGTGTT GACGACGATC GGCGAGCTAC GCCGCGAGAC GGCCGACGTG GAGGGGCTCG TTCGGTTCTG GGGCGACGAC TGGCGCACCG CGGAGCAGGG CGCCGCGAAG GTGTGGTCGA TCGCGACGCT GTGGGGAGCG ACGGCCGCCG CCGAACTGGG CGGGCTCATC CGCGAGCGAG ACGGGGACGC AGCCGACCTC TTCAGCGCCG CGCGCGACCT CTACGCGCTG TGTGAGCCCG ACGGGCCGTT CGTCAACGAC GCCGGGCTCA TCGCCGAACA GGCGTTCGAC GACGGCGACC TCGACGGTGC GACCCCGATC GCGTGGTCGC ACGCCCTCCG GATCGACGCG ACGGTGACCC TCGCGAGACA CGGGGCGCTC CCGGTCCCGC ACGACGAGCC GCGGGGGCCG GACGAGGCAC CCCACTGGAC GACCGGCCGG AAGTTCGGCG TCGGGACGCC GGCCGACCAC AAGGCCGCCG ACCCGGTCCC GGTCTGGTTC ACGCTGACCG AGGGTGCGCT CACCGAGGCG CGGTTCCCCC GGATCGACGT GATGAACGTC CGGACGTTCG ACTTCCTCGT CGCCGACCCG GAGACGGGGT ACACGGTCCG GACCTTCGAC GAGACGAGCC ACGTGACGGC GACGGAGACG GTCGAGCGGA CCACGGAGCC CACCGTCGAC GACGCCCTCG CGTACACGCA GACGATCTGC GAGACTGGCG ATGGCCACGG CCACAGCTGG ACGCTCACCG TCGAGTACGC GGTCGATACC GACGGCGACG CGATCCTCGC CGACGTGGCG TTCGAGGGGA GCCGCGAGTA CGACGTGTAC GCCTTAGTCG ACACGACGAT CACGAACGTC GGATCGAACG ACCGCGGCGA CCGCGTCGAT GACTCGGACG GCTACCACCT CCTCGCGCGC AACAACGACG CCGCCGAGCG CAGCACGGGG AAGCTCGTCG ACGACGACGG CGACTCCTTC GAGGTCGCGC TCGCGATTGA CAGCGACGAC GGTTTCGAGT GGGCGAGCGT GCTCGCGGCG GGCAGCGACA AGGCGGGTGC ACTGTTCGCC GACGGCGACC GGGGCGAGGG GACCGAGACG GCCACCGGCA ACGTCGTACT CGCCGGACTC GTGGGGAGCG GTACCGAGGT GTCGGACGCG GTCGCGCTCG GGTTCGCCGA GAACGCCGAC ACCGCCGCCG CCCTCGGAGA GGCTCGCGGC GCCATCTCCC GCGGCTTCGC GGACATCTCC GAGTCGTACG TCGAGACGTG GCGCGAGTGG CTCGCCGACC GGAAGTTCCC CGCTTCCGTG ACCGGCGACG CAGACCTAGA GACGCAGTAC CGGTTCGCGC TGATGACGCT TGCGGCCGTC GAGGACAAAC GGCACGACGG CGCCGGGATC GCCAGCCCGT CGGTACCGTG GGGCGAGACC GAGTACGCCG CCGAAGACCG AGGCTACGGC TACAACTTCG TCTGGTCGCG CGACCTCTAT CAGGTGTTCA CCGCGCTGAT CGAGGTTGGC GAGGTCGAAC GCGGTGCCGA CGCGCTCGCG TACCTGTACA ACACCCAGCA GGACGACGAC GGGTTCCTCC CGCAGAACAC CTACATCGAC GGTCGTACTC GGTGGGGCGG CGAGCAGATG GACAACATCG CGTTCCCGTC GGTGATGGCG TGGCAACTGT ACGAACACGG GATCTCGCCC GGCGACACCG ACTACACCTA CGATCAGGTC CGTCGCTCGC TCGGCTATAT CGCCGCCAAC GGCCCGGAGA CGGCTCAGGA GCGCTGGGAG GAGGAGGCCG GCTTTTCGCC GTCGAGCATC GCCGCCGAGA TCGCGGGGCT CTGTTGTGGC GCCGCGCTTG CGGTCGCCGA GGCCGATCGG ATCGAGGCCG GAAACGCCGG TTCCGGATCC GGACCGGACG CTGACTCCGA CTCACTCCGT GCCGACGCCC TCGCGTGGCT CGCGCTCGCG GACGACTGGA CGACCCGCGT CGAGGAATGG TGCGCGACGG CCACAGGTAC CGAGCGGCAC GCCGAGACGC CGTACTACCT CCGGGTCACC GCCGACGGCG ACCCCGAGTC CGGGCGCCCC CGGACGATCG CCAACGACGG CCCCACCTAC GACGAGCGGG AGATCGTTGA CGGCGGCTTC CTCGAACTGG TCCGACTCGG CGTGAAGCCC GCCGACGACG ACGTGGTCCG CAACTCCGTG TCGGTCGTCG ACGATTCGAT CCGCGTCGAC ACCCCGTACG GTCCCGCGTG GTACCGGTAC GTCGGCGACG CGTACGGCGA ACTGGCGCGC AGCGATCCGG GCGGCCCGTG GGCCGGCACC GGTGACGGCC GCGGTCGGCT GTGGCCGATC TTCACCGGCG AGCGCGGCGA GTACGAACTA CGCGCTCGCG CCGACGGCCC AGACGCCTTT GGCGGTACCG ACGAGGACGC CCTCGAACCC GACGCTCTCT TGGAGACGAT GGCCGGGTTC GGCAACTCCG GACGGATGCT CCCCGAGCAG GTGTGGGACC GTGAGCATCC GACCAACTAC GGCTGGGAGT TCGGCGAGGG AACCGGGGGG GCCACCCCGC TCGCGTGGTC GATGGCGGGG TTCATCCGGC TCGCTCACGG GGTCGAGGCC GGCGAACCCG TCGAGACGCC GACGGTCGTC CGCGACCGCT ACGTCGAGCG CGACCGCTCG GCGACGCCCG ACCTCGACGC GACCGCGGAG TACGTCAACA ACCATCTCGT CGTCGCCGGC GAGACCACCG CCGACGTGGT CGCGGTGTAC ACGCCGGACG AATCTGCGCT CGTGACGGTC GAGGACGGGG AGTACGAGTT CCGCCTCGAC GCGGCGCTCG ATGCGAAGAC GGTCGTCGTC GCCGCCGCGA ACACCGACGG CGGCGCGGAC GATACGGCGG CCGACGCGGG CGATGAGGCG ACCGATACCG ACGCGATCCT CGACGAGTTC TCAGGTGCCG GCACGGCTGT TAAGCGGCTT CGGGTGTAG
|
Protein sequence | MRLRTALTEH ERRRGERYPA ERPTTAGAFT GDDGRLVHVG PNGTVHDCSY SLSGVGGADR LRMGITAGRG VRWLDDLTTT RQHYDGDTPL VETEYDAGRY TVHQFDLVVS DTHLTHVELR GAPPADAELV AACAFSPDMV EGRVGNLVHE EAGPQSGSVV EVYHRTEHDF LTASTGLSAA HGQRLRTVSE LLGESGEGFP HRGEIDQRED SRLTPDVVVR APFERDGRTE RVTLASRAVL DRRETRETVD DAKDAISDGP DRQRRIEELS RIATAYPDAD DLREAAEGRG PTVPDDVPRR SVVAGDLRAL DLLTAESGAR IAAPEFDPFY STSGGYGYTW FRDEAETSLA LLGASDELGL DADEELLATA SFFCRTQDAD GSWPHRVWAD SGKVAPGWAN ARIEGANATA GPNDQLDQPA SVVAFLARLR RTTDLPPEWR DRVDDTIADA IDFLRETTEP DGLPRRCQNC WENALGRFTH TGATYLRAFA AVARAPLPDD VRADAADAAD AALAGLNGRW NPDTERFPQR ASAESRDDRS DASTFALASA ATEYAALRDE RNEIGADGGS TPEIDTTPAA ADVDFDAFLD RVTTHVLTTI GELRRETADV EGLVRFWGDD WRTAEQGAAK VWSIATLWGA TAAAELGGLI RERDGDAADL FSAARDLYAL CEPDGPFVND AGLIAEQAFD DGDLDGATPI AWSHALRIDA TVTLARHGAL PVPHDEPRGP DEAPHWTTGR KFGVGTPADH KAADPVPVWF TLTEGALTEA RFPRIDVMNV RTFDFLVADP ETGYTVRTFD ETSHVTATET VERTTEPTVD DALAYTQTIC ETGDGHGHSW TLTVEYAVDT DGDAILADVA FEGSREYDVY ALVDTTITNV GSNDRGDRVD DSDGYHLLAR NNDAAERSTG KLVDDDGDSF EVALAIDSDD GFEWASVLAA GSDKAGALFA DGDRGEGTET ATGNVVLAGL VGSGTEVSDA VALGFAENAD TAAALGEARG AISRGFADIS ESYVETWREW LADRKFPASV TGDADLETQY RFALMTLAAV EDKRHDGAGI ASPSVPWGET EYAAEDRGYG YNFVWSRDLY QVFTALIEVG EVERGADALA YLYNTQQDDD GFLPQNTYID GRTRWGGEQM DNIAFPSVMA WQLYEHGISP GDTDYTYDQV RRSLGYIAAN GPETAQERWE EEAGFSPSSI AAEIAGLCCG AALAVAEADR IEAGNAGSGS GPDADSDSLR ADALAWLALA DDWTTRVEEW CATATGTERH AETPYYLRVT ADGDPESGRP RTIANDGPTY DEREIVDGGF LELVRLGVKP ADDDVVRNSV SVVDDSIRVD TPYGPAWYRY VGDAYGELAR SDPGGPWAGT GDGRGRLWPI FTGERGEYEL RARADGPDAF GGTDEDALEP DALLETMAGF GNSGRMLPEQ VWDREHPTNY GWEFGEGTGG ATPLAWSMAG FIRLAHGVEA GEPVETPTVV RDRYVERDRS ATPDLDATAE YVNNHLVVAG ETTADVVAVY TPDESALVTV EDGEYEFRLD AALDAKTVVV AAANTDGGAD DTAADAGDEA TDTDAILDEF SGAGTAVKRL RV
|
| |