Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_3903 |
Symbol | |
ID | 8744531 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | + |
Start bp | 148514 |
End bp | 151705 |
Gene Length | 3192 bp |
Protein Length | 1063 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 646514487 |
Product | Beta-galactosidase |
Protein accession | YP_003405434 |
Protein GI | 284167156 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.120366 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTCGAG ACTGGGCCGA CCCGGAGACG GTCGGTCGAA ATCGGATCGA TCCGCACGCG TACGGCCTTC CGTACGCCGA GACGGACACC GCGACCGCGG GAAACCGAGC GGCCTCGCCC TGGATCGCGT CGCTGAACGG CGAGTGGCGG TTCCGGTTGG CGGAGACGCC GACCGCCGCG CCCGACGGGT TCCACGAGCC GGACGCCGAC GTCGGCGACT GGGACCGTAT CGAGGTGCCC CAGCACTGGC AGACCGCCGG CTACGGCGAT CCCCACTACA CGAACGTGGT CTACCCGTTC CCGCTCGATC CGCCCCACGT CCCGACCGAG AACCCGACCG CGTCGTACCG CCGGACGTTC CACGTCCCCG ACGACTGGGA CGAGCGCCAG ATCCGACTCC GATTCGGCGG CGTCGACTCC GCGTTCCACC TCTGGATCAA CGGCGAGGAA GTCGGGTACA GCGAGGGGAG CCGGCTCCCG TCGGCGTTCG ACGTCACCGA CTACGTCTCG CCGGGCGAGA ACACGGTCGC CGTCCGCGTC TACAAGTGGT CGACCGGGAG CTACCTCGAG GACCAGGACA TGTGGTGGCT CAGCGGGATC TTCCGGGACG TCGCCCTCTC GGCTCACCCG ACGGTACAGG TCGCGGACGT GGACGTCCGG ACCGACCTCG ACGAGCGATA TGAGGACGCC GTCCTGCAGG CGTCCGTCGA CGTACGCAAC GTCGGCGACG ACGCCGGGAC GGCCCGAATC GAACCGACGC TGCGCGATGC GGACGGAACG CCGGTCTCGA CGACGCTCGA GGCGCGGTCC GTCGCGCTCG AGGCCGGCGA GGCGACGACC CTCGAGTTCG AGACGACCGT CGAGGAGCCC CGCAAGTGGA CCGCGGAGAC GCCCAACTGC TACGATTTCG CGCTCGGTAT CTCCGACGGA CGGGGCGACG ACGAGACGGT CCTCGCGCAG ACGGTCGGCT TCCGTGAAAT CGAGATCGTC GACGGACAGT TGCTGGTCAA CGGCCGACCG GTGACGATCC GCGGCGTCAA CCGCCACGAC TTCCACCCCG ACCGCGGCCG CGCCGTCCCG CTCGAGGCGA TGCGGGAGGA CGTCGAGCTG ATGAAGCGGC ACAACATCAA CGCGGTTCGT ACCGCCCACT ACCCGAACGA TCCGCGGTTC TACGAGCTCT GTAACGAGTA CGGGCTCTAC GTGCTCGACG AGACCGACCT CGAGTGCCAC GGGATGGTCC ACGCGGAGAC GACCGAGCAC GTAAGCGACG ATCCCGACTG GGAAGCCGCG TACGTCGACC GGATGGTTCG GATGGTCGAG CGCGACAAGA ACCACCCCAG CGTCATCTGC TGGTCGCTGG GCAACGAGTC GGACCTCGGG GCCCACCACG AACGGATGGC CGCGGCGACG CGCGAGCGCG ATCCGACGCG GCCGATCCAC TACGAACCCG ACACGGAGCA GACGGTCTCC GATATCATCG GGCCGATGTA CCCGCCCTTC GAGCAACTCG AGGAATGGGC CGAGGCGGAC CTCGAGCATC CGGTCGTGCT CTGCGAGTAC GCCCACGCGA TGGGGAACGG ACCGGGGAAC CTCCGGGAGT TCTGGGACCT CTTTTACGAG CACGAGGGGA TGCAGGGCGG CTTCGTCTGG GACTGGATCG ATCAGGGACT CCGGCGGACG GCCGACGACG GGACGGAGTG GTTCGCCTAC GGCGGCGACT TCGGCGACGA ACCGAACGAC GCGAACTTCA ACATCAACGG GCTCGTCTTC CCCGATCGGA AGCCCTCGCC CGGACTCACC GAGTACAAGA AGGTCATCGA GCCGGTCGTC CTGCGCGAGG ACGATCTCGA GCGCGGGGAG CTCACCGTCG AGAACCGGTA CGATTTCCGG TCGCTCGAGC ACCTCCGCGC CTCCTGGCGC CTGCTATCCG ACGGCCGCGT CGTCGAGAGC GGACGGCTGC CGCTGCCCTC GATCGCCGCC GGCGAGTCCG CGACGGTCAC GGTTCCCGTC GACGTGGACG GACTCGAGAC AGACGGACTC GATGCGGACG CCGAACACGT CCTCACTGTC GACGTCTCGC TTGCCCGCGA GACGGCGTGG GCGCCGCAGG GACACACGGT CGCGACCGGG CAGTTCGAGC TTCCGGAAAG CGGATCCGGG ACCGGCTCCG CTTCGCAGCC GTCGACCGGC GTCGCCGCGC CGCTGACGTG TGCGGGAGAC GGGGAGGAGA TCCGCGTTTC GAACGAGCAG TTCGAACTGG TCTTCGATCG CACGTTCGGC GTCATCGACT CGCTCGCGTA CCGGAATCGG TCGCTGTTGG AGGACGGTCC GTCGGTCGGA ATCTGGCGCG CGCCGACGGA CAACGACGGG GGGCTCCCGC TCTCGCGGAC GCTCCTCTCG CAATTCACCG AACGCTACGA GAACGAGGAA CTCGTTCAGG CGGGGGACCT CGCGACCGTC GGGTTCGAAC AGCTCTGGCG GGAGCACGGG CTCGATCGGC TGCAGTTCCG CGTCGACGAC GTCACGTGTG TTCGGGGCGA GCGAGACGCC GATCCCGTTA CGATCACCGT CGACGGCCGC CTCGCGCCGC CGATATACGA CCACGGGTTC GCAGTCGAGC AGACGTACAT GATCGAGCGC ACCGGTGCGA TAACCGTCGA CACCGCGATC AAGCCCGAAG GAGACCTGTC GCTGCTGCCC TCGCTCCCTC GAGTCGGGCT CGATCTCACG CTCGAAGACG ACCTCGATCG GGTCACGTGG TACGGACGCG GGCCAGGCGA GTCGTACGTC GACAGCAAGG AGGCCGCCCT GCTCGGCCGG TACAGTCGCT CGGTCGCCGA TCTGCAGACG CCCTACGTCG CCCCCCAGGA GAGCGGGAAC CGAACGGACA CCCGCTGGGT GACGTTCACC GACCAGCGCG GGACCGGCCT CTTCGTCACC GGCGAAACGC CGTTCGATTT CAGCGCACAC CCCTTCAGTA CCGCCGATCT CGACGCTGCC GGGCACACGC ACGAGCTTCC GGATCGAGAC GGCGTCTGGG TTTCGCTCGA CGACGGCCAC TGTGGGCTCG GGACCGGAAG CTGCGGACCG CCGACGCTCG AGGAGTACCG ACTCGAGCCA GAGCCGATCT CGTTCCGTAT GGAACTACAC CCGTTCGCTG CAGACGAGCT TCCGGCGACC GATCGGTACT GA
|
Protein sequence | MTRDWADPET VGRNRIDPHA YGLPYAETDT ATAGNRAASP WIASLNGEWR FRLAETPTAA PDGFHEPDAD VGDWDRIEVP QHWQTAGYGD PHYTNVVYPF PLDPPHVPTE NPTASYRRTF HVPDDWDERQ IRLRFGGVDS AFHLWINGEE VGYSEGSRLP SAFDVTDYVS PGENTVAVRV YKWSTGSYLE DQDMWWLSGI FRDVALSAHP TVQVADVDVR TDLDERYEDA VLQASVDVRN VGDDAGTARI EPTLRDADGT PVSTTLEARS VALEAGEATT LEFETTVEEP RKWTAETPNC YDFALGISDG RGDDETVLAQ TVGFREIEIV DGQLLVNGRP VTIRGVNRHD FHPDRGRAVP LEAMREDVEL MKRHNINAVR TAHYPNDPRF YELCNEYGLY VLDETDLECH GMVHAETTEH VSDDPDWEAA YVDRMVRMVE RDKNHPSVIC WSLGNESDLG AHHERMAAAT RERDPTRPIH YEPDTEQTVS DIIGPMYPPF EQLEEWAEAD LEHPVVLCEY AHAMGNGPGN LREFWDLFYE HEGMQGGFVW DWIDQGLRRT ADDGTEWFAY GGDFGDEPND ANFNINGLVF PDRKPSPGLT EYKKVIEPVV LREDDLERGE LTVENRYDFR SLEHLRASWR LLSDGRVVES GRLPLPSIAA GESATVTVPV DVDGLETDGL DADAEHVLTV DVSLARETAW APQGHTVATG QFELPESGSG TGSASQPSTG VAAPLTCAGD GEEIRVSNEQ FELVFDRTFG VIDSLAYRNR SLLEDGPSVG IWRAPTDNDG GLPLSRTLLS QFTERYENEE LVQAGDLATV GFEQLWREHG LDRLQFRVDD VTCVRGERDA DPVTITVDGR LAPPIYDHGF AVEQTYMIER TGAITVDTAI KPEGDLSLLP SLPRVGLDLT LEDDLDRVTW YGRGPGESYV DSKEAALLGR YSRSVADLQT PYVAPQESGN RTDTRWVTFT DQRGTGLFVT GETPFDFSAH PFSTADLDAA GHTHELPDRD GVWVSLDDGH CGLGTGSCGP PTLEEYRLEP EPISFRMELH PFAADELPAT DRY
|
| |