Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_3909 |
Symbol | |
ID | 8744537 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | + |
Start bp | 159861 |
End bp | 164663 |
Gene Length | 4803 bp |
Protein Length | 1600 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 646514491 |
Product | glycoside hydrolase family 2 sugar binding protein |
Protein accession | YP_003405438 |
Protein GI | 284167160 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGACT CACAGCAACG GGAGACAGAG ACCGAACAGA CACCTCGCGG TCGGCTCTCA CCGACGCGAC GGAAGTTCCT GCAGGCGACC GGGATGGCCG CGCTGGCGAC CGGGGTCAGC GCCCAGTCCG ACGCCGCGGC ACCGTCCGAG GGACCGGGCG GCCCCGGTCG CATCTCGAAC CTCGCGGCGT ACCTGGAAGA CCCCCAGACG GTCGCGGAGA ACGTCGAGCC GACTCACGTC ACCACCGCGG TTCCCTACGG GTCGGTGCGA GCGGCTTGCC AGGCGGACGA ACCGTTCACG GAACTCGAGT CGCGGTTCGC CGAGTCCGAG TACGTCCGGC TGCTCAACGG CGAGTGGCGG TTCGCGTTCC ACGAGAGCCC CTCCGAACTC CCCGACTCGT ACGATGACCT CGCGGCCGAC GACTGGGGAT CGATCGACGT CCCCGGCGTC TGGCAGACCC AGGGGCACGG CCAGCGGATC TACGCGAACA ACTCGATCAC GTGGCAACAC TATGATCCGG GACAGGAGGG CGACCTCACG CCGGGCGAGG ACGGCACGGT CGACGTCCCC GGCGTCGATG ACGACGGGAT CGACCCCGTC GGGACCTATC GGCGGACGTT CTCGGTACCG AGAGACTGGG ATGGCCGGCA GACGTTCCTG CACTTCGAGG CCGTCAAGCA GGCCTTCTTC GTCTGGATCG ACGACGAGTA CGTCGGCTTC CGAACGGGGT CGATGACGGC CGCCGAGTTC GACGTCACCG ACCACGTCGA AGCCGGCGGC GACTACGAGG TCACGGTGCA GGTGTACCGC TGGAGCGACA GCGAAGCCCT GGAGACGATC GACATGTTCC GGTACGCCGG CATCTTCAGG AACGTCTACC TGTTCGCGAC GCCGACGGTC CACCTCAGGG ACTTCTACGC GCGGACCGAC CTCGACGACG ACTACGAGGA CGGCCGCCTC CGGATCGACG CCGAGATCGC GAACTACGGC GACGCTCCGG CGGGGAAGTA CACCGTCCGC GCGCACCTCT ACGAGACCGA TCGGACGAAG CCGGGTCGCG GTCCACCCCG TGGGAAGGGG AAAGAGAACG CGGAGGCGAA GGGCAATAGG AAAAAGAACG GCCGCGGCCC GGGCGACAAC CCGCCGCGCG GTCGCAAGGT GACGACAGTC GAGGCGACCG CGACCGTCGA CGAGGACGGC GCCGTCGTCA CTCTCGAGGC CGACGTCGAC GACCCCGCGA AGTGGTCGGC CGAGCACCCG AACCTCTACC AGCTCGCGCT GGAACTCGTC GACGCTCGTG GCGAGACGGC CGAGGCGATG CTCGAGAAGG TCGGCTTCCG CACGTACGAG ACGACTCGCG GGCAGCAGGG CGCCGAGGTC CTGGTCAACG GCGAGGCGGT CGACATCCAG GGCGTGAACC GCCACGAGAC CGACCCCGAC TTCGGACGGA CGGTACCGAT CGATAGGGTC CGGGAGGACC TCGAGACGAT GCAGCGGTTC AACGTCAACG CGATCCGGAC CTCCCACTAC CCGAACGACC CGTCGTTCTA CCGGCTGGCC GACGAGTACG GGATCTACGT GCAAGACGAG GTCGGCGTCG AGACCCACTG GTGGGAGGGG CTGCTCGCGC ACACCGACGC GTACCACGAC CAGGCCGTCG AGCAGTTCCG GCGCATGGTG CTCCGGGACC GCAACCACGC GTCGATCTTC AGCTGGTCGA CCGGTAACGA GGCCGGCACC GGCGCCGAGC ACCTGGAGAT GGCGTCGCTG GCGGCCGACT CCGACGAGTA CATCCCCGAC GACACCTCCG AGGTCAGCGG CGTCGAGAAC GTCGAGTCCT TCGACGGCGA GGCCGAAGGG TTCGCGCCCG ACCGCATCCT GTACCACCAA CCCAACGGCG GCGGCTGGAA CGTCGAGTAC AGCGACATGC TCGGGCCGCG GTACCCCGAC GTCGGGACCC TGCTGTCGGT CGCCGACGGC TCCTACATCG GCGACGGGCT CCGGCCCGTC GCGATGGGCG AGTACAATCA CGCCATGGGC AACAGCCTGG GGCTGGTCCA CGAGATGTGG AACGAGCACA TCAAGCCGCC CGCGCGGGAG GCCACGGACC GCAGCGACGC AGACAATCCG GGCGTGCTCG TCGGCACGCC AGAGGTCGTC CCCGGACCGG ACGTCGAACC GGGCGCCCCC GACGGCGCGG TCGTCCTCGG CGAGAGCGAC GCGATCGAGA TCCGGCGCGA CGAGAGCCTC GACGTCGACC CCGGATTCAG CGTCGGCGCC ACCGTCTCGA ACGTCGACGC GGACGCCGAC GCGTCGCTGG TCGACGAGGG GCGCTACGCG CTCGAACTGT CCGACGGCGA ACTCGTACTC TCGATCGGAT CGGACTCGGT CTCGGCGTCG CTCCCCGCGG GCGGGGTCGA CGACTGGACG ACCGTCGTCG GCGTCGCGGA CGCCGACGAA CTCCGGCTCT ACGTCGACGG CGAGCGGGTC GCCGCGACGA GCCACTCGGT CTCTGACCTC CCGTCGAGCG ACGGGCCCGT CCGGATCGGT ATCGACGGCG ACGCCGAAAT CACCGTCGAC GGTATCGGTA TCTACGACCG CGCCGTGAGC GACGACGAGG CGAGCGGCGC CGACGGCACG CTCTCCGACG GCGCCGTCGT CGCCTACGAC TTCGCGGATC TGATCCGCGA CCAGAGCCTC GTCGGCGGGT TCGTCTGGGA CTGGGTCAAC CAGGACCTCA ACGACGTCAC CGAGGACGGC CAGGAGTTCC AGTTCTACCA CAGCGACGGC CCCGACGGGG CGTTCTGCCT CAACGGCCTG GTCTGGTCCG ACCGCGATCC CCAGCCCGAG ATGTGGCAGC TCAAGCACAG CCACCAGCCG GTCGGCGTCG CCGACGCCGC CGTCGAGGAC GGCGAGGTCT ACGTCTCGAA CCGCTACAAC TTCACCGGCC TCGACGCGCT GGAGGGGTCG TGGGAGCTGA CCGCCGACGA CGAGACCGTC AAATCCGGCG AGCTCGACCT CGACATCGAA CCCGGCGAGA CGCGCCGCGT CGACGTCCCG GTCGGGGCGC CGTCCGATCC CGAACCGGGC GTCGAGTATC GGCTGAGCGT CTCGTTCGCG CTCGCAGAGG CGACCGACTA CGCCGACGCC GGCCACGAGG TCGCCTTCGA GCAGCTCGAG ATCCCGGTCG ACGCGCCAGA ACCGGAGCCG GAGAGCCTCG ACGCCATGGC ACCGCTCTCG GTCGAGGAGG GCGACGACAT CGTCATCTCC GGCGACGGCT TCGAGTACGT CGTCTCCGGC GACGCGGGGA CGCTCTCTTC GATGCGGTAC GGCGGGAGCG AACTCGTAGA GGACGGCCCG CTCTTTAACG CGTGGCGCGC GCCGATCATG AACGAACACC AGCAGTGGGG CTCGGCGCCC GCGTTCTCGT GGTACGAGGC CGGCCTCAAC GACCTGACCC ACTCCGTCGA CTCCGTCGAG ACGAACGCGG TCGATGATTC GCTGGTGCAG CTCACGGTCG ACGGCTTCGC GGCGGGAACC GAAGTACCGC CGCCGCTGCT GACACCGGAT GCCTCTGGTG AAGGGAACGA CGGCGAGGTC GTCGGCGATC CGGACGTCGT CTCGGGCCAG AGCGGTCAGG CCGTCGCGCT CGACGGCGAG TCTCAGCACG TCAACGCGGG CAACGACGCC AGCGTCGACT TCGGTGAGCC GGGATTCACG ATCCAGGTGC GGTTCAAGGG CGTGCCCGCG AACGGCGAAC ACAACCCGTT CGTCAGCAAG GGCGACCACC AGTACGCGCT GAAGATCTCC AGCAGCGACG AGTTCCAGTT CTTCATCTAT CAGGACACCT GGATCACGCA CAACGCGCCG ATACCGTCGG GTCTGGCGGA AGACGAGTGG CACACCCTGA CGGGGGTAGC GACCGACGAC GAACTGCAGC TGTACCTAGA CGGGGAGACG CTGGGGAGTA GCGGCCACAG CGCGACCAGC GTCAATCAGT CGGCGTTCCC GGTTCATGTC GGTCACAACG CGGAGAACAC GAACCGGTAC ACGGAGACGG CGATCGACGA GGTGCGGATG TACGACCGGG CGCTGTCGCC GAGTGAAATC TCGTCCGGCT TCGACGAACC GCCGGAGAGC GCGGTGTTGT GGTACGAACT GGACACGTTC GAGGAGGGCG AGTCGACGGT GATGGGCTTC GAGACGCAGT ATCGCTACCG GATCTACGGT AGCGGAGACG TGCGGATGAA CGTCGAAGCC GTTCCGAACG AGCCACTGCG CAACACGGTG AGCGGCTGGC TCCCGAAGGT CGGCGTCCAG CTGGACCTGC CCGAGCGCTT CGACGCGTTC GAGTGGTACG GCCGCGGCGA GTTAGAGACC TACCCCGACC GCAAGTGGAG CGTCCCGGTC GGGCGCTACG CCGGGTCGGT CGACGAGCAG TACGTGCCGT ACCTGCCGCC GACCGACAAC GGCAACAAGG CCCAGACGCG CTGGGCGACG CTCTCTGACG GCGAAGTCTC GCTGCTCGGG ATGCCCGGCG ACGAGGACGC CAACGTGAGT CTCGAGCAGT GGTCGAACCT CGACGAGGCC GAGCACCAGT ACGAACTGGA GGAGCGCGGC TCGATCGGGT TCAATCTCGA CCACCGCGTG ACCGGCGTCG GCGGGACGCC GACCGATCCG ATCGACCGCT ACCAGGTCGA GGTCGAGCCG ACCGCATTCA GCGTGGTGCT CCGGCCGTTC GATCCCGACG ACGCGGATCC GATGGAACTG GCGAACCGAC GACTGCCAGA CGCCGACGAG TAG
|
Protein sequence | MSDSQQRETE TEQTPRGRLS PTRRKFLQAT GMAALATGVS AQSDAAAPSE GPGGPGRISN LAAYLEDPQT VAENVEPTHV TTAVPYGSVR AACQADEPFT ELESRFAESE YVRLLNGEWR FAFHESPSEL PDSYDDLAAD DWGSIDVPGV WQTQGHGQRI YANNSITWQH YDPGQEGDLT PGEDGTVDVP GVDDDGIDPV GTYRRTFSVP RDWDGRQTFL HFEAVKQAFF VWIDDEYVGF RTGSMTAAEF DVTDHVEAGG DYEVTVQVYR WSDSEALETI DMFRYAGIFR NVYLFATPTV HLRDFYARTD LDDDYEDGRL RIDAEIANYG DAPAGKYTVR AHLYETDRTK PGRGPPRGKG KENAEAKGNR KKNGRGPGDN PPRGRKVTTV EATATVDEDG AVVTLEADVD DPAKWSAEHP NLYQLALELV DARGETAEAM LEKVGFRTYE TTRGQQGAEV LVNGEAVDIQ GVNRHETDPD FGRTVPIDRV REDLETMQRF NVNAIRTSHY PNDPSFYRLA DEYGIYVQDE VGVETHWWEG LLAHTDAYHD QAVEQFRRMV LRDRNHASIF SWSTGNEAGT GAEHLEMASL AADSDEYIPD DTSEVSGVEN VESFDGEAEG FAPDRILYHQ PNGGGWNVEY SDMLGPRYPD VGTLLSVADG SYIGDGLRPV AMGEYNHAMG NSLGLVHEMW NEHIKPPARE ATDRSDADNP GVLVGTPEVV PGPDVEPGAP DGAVVLGESD AIEIRRDESL DVDPGFSVGA TVSNVDADAD ASLVDEGRYA LELSDGELVL SIGSDSVSAS LPAGGVDDWT TVVGVADADE LRLYVDGERV AATSHSVSDL PSSDGPVRIG IDGDAEITVD GIGIYDRAVS DDEASGADGT LSDGAVVAYD FADLIRDQSL VGGFVWDWVN QDLNDVTEDG QEFQFYHSDG PDGAFCLNGL VWSDRDPQPE MWQLKHSHQP VGVADAAVED GEVYVSNRYN FTGLDALEGS WELTADDETV KSGELDLDIE PGETRRVDVP VGAPSDPEPG VEYRLSVSFA LAEATDYADA GHEVAFEQLE IPVDAPEPEP ESLDAMAPLS VEEGDDIVIS GDGFEYVVSG DAGTLSSMRY GGSELVEDGP LFNAWRAPIM NEHQQWGSAP AFSWYEAGLN DLTHSVDSVE TNAVDDSLVQ LTVDGFAAGT EVPPPLLTPD ASGEGNDGEV VGDPDVVSGQ SGQAVALDGE SQHVNAGNDA SVDFGEPGFT IQVRFKGVPA NGEHNPFVSK GDHQYALKIS SSDEFQFFIY QDTWITHNAP IPSGLAEDEW HTLTGVATDD ELQLYLDGET LGSSGHSATS VNQSAFPVHV GHNAENTNRY TETAIDEVRM YDRALSPSEI SSGFDEPPES AVLWYELDTF EEGESTVMGF ETQYRYRIYG SGDVRMNVEA VPNEPLRNTV SGWLPKVGVQ LDLPERFDAF EWYGRGELET YPDRKWSVPV GRYAGSVDEQ YVPYLPPTDN GNKAQTRWAT LSDGEVSLLG MPGDEDANVS LEQWSNLDEA EHQYELEERG SIGFNLDHRV TGVGGTPTDP IDRYQVEVEP TAFSVVLRPF DPDDADPMEL ANRRLPDADE
|
| |