Gene Htur_3909 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_3909 
Symbol 
ID8744537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013744 
Strand
Start bp159861 
End bp164663 
Gene Length4803 bp 
Protein Length1600 aa 
Translation table11 
GC content68% 
IMG OID646514491 
Productglycoside hydrolase family 2 sugar binding protein 
Protein accessionYP_003405438 
Protein GI284167160 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGACT CACAGCAACG GGAGACAGAG ACCGAACAGA CACCTCGCGG TCGGCTCTCA 
CCGACGCGAC GGAAGTTCCT GCAGGCGACC GGGATGGCCG CGCTGGCGAC CGGGGTCAGC
GCCCAGTCCG ACGCCGCGGC ACCGTCCGAG GGACCGGGCG GCCCCGGTCG CATCTCGAAC
CTCGCGGCGT ACCTGGAAGA CCCCCAGACG GTCGCGGAGA ACGTCGAGCC GACTCACGTC
ACCACCGCGG TTCCCTACGG GTCGGTGCGA GCGGCTTGCC AGGCGGACGA ACCGTTCACG
GAACTCGAGT CGCGGTTCGC CGAGTCCGAG TACGTCCGGC TGCTCAACGG CGAGTGGCGG
TTCGCGTTCC ACGAGAGCCC CTCCGAACTC CCCGACTCGT ACGATGACCT CGCGGCCGAC
GACTGGGGAT CGATCGACGT CCCCGGCGTC TGGCAGACCC AGGGGCACGG CCAGCGGATC
TACGCGAACA ACTCGATCAC GTGGCAACAC TATGATCCGG GACAGGAGGG CGACCTCACG
CCGGGCGAGG ACGGCACGGT CGACGTCCCC GGCGTCGATG ACGACGGGAT CGACCCCGTC
GGGACCTATC GGCGGACGTT CTCGGTACCG AGAGACTGGG ATGGCCGGCA GACGTTCCTG
CACTTCGAGG CCGTCAAGCA GGCCTTCTTC GTCTGGATCG ACGACGAGTA CGTCGGCTTC
CGAACGGGGT CGATGACGGC CGCCGAGTTC GACGTCACCG ACCACGTCGA AGCCGGCGGC
GACTACGAGG TCACGGTGCA GGTGTACCGC TGGAGCGACA GCGAAGCCCT GGAGACGATC
GACATGTTCC GGTACGCCGG CATCTTCAGG AACGTCTACC TGTTCGCGAC GCCGACGGTC
CACCTCAGGG ACTTCTACGC GCGGACCGAC CTCGACGACG ACTACGAGGA CGGCCGCCTC
CGGATCGACG CCGAGATCGC GAACTACGGC GACGCTCCGG CGGGGAAGTA CACCGTCCGC
GCGCACCTCT ACGAGACCGA TCGGACGAAG CCGGGTCGCG GTCCACCCCG TGGGAAGGGG
AAAGAGAACG CGGAGGCGAA GGGCAATAGG AAAAAGAACG GCCGCGGCCC GGGCGACAAC
CCGCCGCGCG GTCGCAAGGT GACGACAGTC GAGGCGACCG CGACCGTCGA CGAGGACGGC
GCCGTCGTCA CTCTCGAGGC CGACGTCGAC GACCCCGCGA AGTGGTCGGC CGAGCACCCG
AACCTCTACC AGCTCGCGCT GGAACTCGTC GACGCTCGTG GCGAGACGGC CGAGGCGATG
CTCGAGAAGG TCGGCTTCCG CACGTACGAG ACGACTCGCG GGCAGCAGGG CGCCGAGGTC
CTGGTCAACG GCGAGGCGGT CGACATCCAG GGCGTGAACC GCCACGAGAC CGACCCCGAC
TTCGGACGGA CGGTACCGAT CGATAGGGTC CGGGAGGACC TCGAGACGAT GCAGCGGTTC
AACGTCAACG CGATCCGGAC CTCCCACTAC CCGAACGACC CGTCGTTCTA CCGGCTGGCC
GACGAGTACG GGATCTACGT GCAAGACGAG GTCGGCGTCG AGACCCACTG GTGGGAGGGG
CTGCTCGCGC ACACCGACGC GTACCACGAC CAGGCCGTCG AGCAGTTCCG GCGCATGGTG
CTCCGGGACC GCAACCACGC GTCGATCTTC AGCTGGTCGA CCGGTAACGA GGCCGGCACC
GGCGCCGAGC ACCTGGAGAT GGCGTCGCTG GCGGCCGACT CCGACGAGTA CATCCCCGAC
GACACCTCCG AGGTCAGCGG CGTCGAGAAC GTCGAGTCCT TCGACGGCGA GGCCGAAGGG
TTCGCGCCCG ACCGCATCCT GTACCACCAA CCCAACGGCG GCGGCTGGAA CGTCGAGTAC
AGCGACATGC TCGGGCCGCG GTACCCCGAC GTCGGGACCC TGCTGTCGGT CGCCGACGGC
TCCTACATCG GCGACGGGCT CCGGCCCGTC GCGATGGGCG AGTACAATCA CGCCATGGGC
AACAGCCTGG GGCTGGTCCA CGAGATGTGG AACGAGCACA TCAAGCCGCC CGCGCGGGAG
GCCACGGACC GCAGCGACGC AGACAATCCG GGCGTGCTCG TCGGCACGCC AGAGGTCGTC
CCCGGACCGG ACGTCGAACC GGGCGCCCCC GACGGCGCGG TCGTCCTCGG CGAGAGCGAC
GCGATCGAGA TCCGGCGCGA CGAGAGCCTC GACGTCGACC CCGGATTCAG CGTCGGCGCC
ACCGTCTCGA ACGTCGACGC GGACGCCGAC GCGTCGCTGG TCGACGAGGG GCGCTACGCG
CTCGAACTGT CCGACGGCGA ACTCGTACTC TCGATCGGAT CGGACTCGGT CTCGGCGTCG
CTCCCCGCGG GCGGGGTCGA CGACTGGACG ACCGTCGTCG GCGTCGCGGA CGCCGACGAA
CTCCGGCTCT ACGTCGACGG CGAGCGGGTC GCCGCGACGA GCCACTCGGT CTCTGACCTC
CCGTCGAGCG ACGGGCCCGT CCGGATCGGT ATCGACGGCG ACGCCGAAAT CACCGTCGAC
GGTATCGGTA TCTACGACCG CGCCGTGAGC GACGACGAGG CGAGCGGCGC CGACGGCACG
CTCTCCGACG GCGCCGTCGT CGCCTACGAC TTCGCGGATC TGATCCGCGA CCAGAGCCTC
GTCGGCGGGT TCGTCTGGGA CTGGGTCAAC CAGGACCTCA ACGACGTCAC CGAGGACGGC
CAGGAGTTCC AGTTCTACCA CAGCGACGGC CCCGACGGGG CGTTCTGCCT CAACGGCCTG
GTCTGGTCCG ACCGCGATCC CCAGCCCGAG ATGTGGCAGC TCAAGCACAG CCACCAGCCG
GTCGGCGTCG CCGACGCCGC CGTCGAGGAC GGCGAGGTCT ACGTCTCGAA CCGCTACAAC
TTCACCGGCC TCGACGCGCT GGAGGGGTCG TGGGAGCTGA CCGCCGACGA CGAGACCGTC
AAATCCGGCG AGCTCGACCT CGACATCGAA CCCGGCGAGA CGCGCCGCGT CGACGTCCCG
GTCGGGGCGC CGTCCGATCC CGAACCGGGC GTCGAGTATC GGCTGAGCGT CTCGTTCGCG
CTCGCAGAGG CGACCGACTA CGCCGACGCC GGCCACGAGG TCGCCTTCGA GCAGCTCGAG
ATCCCGGTCG ACGCGCCAGA ACCGGAGCCG GAGAGCCTCG ACGCCATGGC ACCGCTCTCG
GTCGAGGAGG GCGACGACAT CGTCATCTCC GGCGACGGCT TCGAGTACGT CGTCTCCGGC
GACGCGGGGA CGCTCTCTTC GATGCGGTAC GGCGGGAGCG AACTCGTAGA GGACGGCCCG
CTCTTTAACG CGTGGCGCGC GCCGATCATG AACGAACACC AGCAGTGGGG CTCGGCGCCC
GCGTTCTCGT GGTACGAGGC CGGCCTCAAC GACCTGACCC ACTCCGTCGA CTCCGTCGAG
ACGAACGCGG TCGATGATTC GCTGGTGCAG CTCACGGTCG ACGGCTTCGC GGCGGGAACC
GAAGTACCGC CGCCGCTGCT GACACCGGAT GCCTCTGGTG AAGGGAACGA CGGCGAGGTC
GTCGGCGATC CGGACGTCGT CTCGGGCCAG AGCGGTCAGG CCGTCGCGCT CGACGGCGAG
TCTCAGCACG TCAACGCGGG CAACGACGCC AGCGTCGACT TCGGTGAGCC GGGATTCACG
ATCCAGGTGC GGTTCAAGGG CGTGCCCGCG AACGGCGAAC ACAACCCGTT CGTCAGCAAG
GGCGACCACC AGTACGCGCT GAAGATCTCC AGCAGCGACG AGTTCCAGTT CTTCATCTAT
CAGGACACCT GGATCACGCA CAACGCGCCG ATACCGTCGG GTCTGGCGGA AGACGAGTGG
CACACCCTGA CGGGGGTAGC GACCGACGAC GAACTGCAGC TGTACCTAGA CGGGGAGACG
CTGGGGAGTA GCGGCCACAG CGCGACCAGC GTCAATCAGT CGGCGTTCCC GGTTCATGTC
GGTCACAACG CGGAGAACAC GAACCGGTAC ACGGAGACGG CGATCGACGA GGTGCGGATG
TACGACCGGG CGCTGTCGCC GAGTGAAATC TCGTCCGGCT TCGACGAACC GCCGGAGAGC
GCGGTGTTGT GGTACGAACT GGACACGTTC GAGGAGGGCG AGTCGACGGT GATGGGCTTC
GAGACGCAGT ATCGCTACCG GATCTACGGT AGCGGAGACG TGCGGATGAA CGTCGAAGCC
GTTCCGAACG AGCCACTGCG CAACACGGTG AGCGGCTGGC TCCCGAAGGT CGGCGTCCAG
CTGGACCTGC CCGAGCGCTT CGACGCGTTC GAGTGGTACG GCCGCGGCGA GTTAGAGACC
TACCCCGACC GCAAGTGGAG CGTCCCGGTC GGGCGCTACG CCGGGTCGGT CGACGAGCAG
TACGTGCCGT ACCTGCCGCC GACCGACAAC GGCAACAAGG CCCAGACGCG CTGGGCGACG
CTCTCTGACG GCGAAGTCTC GCTGCTCGGG ATGCCCGGCG ACGAGGACGC CAACGTGAGT
CTCGAGCAGT GGTCGAACCT CGACGAGGCC GAGCACCAGT ACGAACTGGA GGAGCGCGGC
TCGATCGGGT TCAATCTCGA CCACCGCGTG ACCGGCGTCG GCGGGACGCC GACCGATCCG
ATCGACCGCT ACCAGGTCGA GGTCGAGCCG ACCGCATTCA GCGTGGTGCT CCGGCCGTTC
GATCCCGACG ACGCGGATCC GATGGAACTG GCGAACCGAC GACTGCCAGA CGCCGACGAG
TAG
 
Protein sequence
MSDSQQRETE TEQTPRGRLS PTRRKFLQAT GMAALATGVS AQSDAAAPSE GPGGPGRISN 
LAAYLEDPQT VAENVEPTHV TTAVPYGSVR AACQADEPFT ELESRFAESE YVRLLNGEWR
FAFHESPSEL PDSYDDLAAD DWGSIDVPGV WQTQGHGQRI YANNSITWQH YDPGQEGDLT
PGEDGTVDVP GVDDDGIDPV GTYRRTFSVP RDWDGRQTFL HFEAVKQAFF VWIDDEYVGF
RTGSMTAAEF DVTDHVEAGG DYEVTVQVYR WSDSEALETI DMFRYAGIFR NVYLFATPTV
HLRDFYARTD LDDDYEDGRL RIDAEIANYG DAPAGKYTVR AHLYETDRTK PGRGPPRGKG
KENAEAKGNR KKNGRGPGDN PPRGRKVTTV EATATVDEDG AVVTLEADVD DPAKWSAEHP
NLYQLALELV DARGETAEAM LEKVGFRTYE TTRGQQGAEV LVNGEAVDIQ GVNRHETDPD
FGRTVPIDRV REDLETMQRF NVNAIRTSHY PNDPSFYRLA DEYGIYVQDE VGVETHWWEG
LLAHTDAYHD QAVEQFRRMV LRDRNHASIF SWSTGNEAGT GAEHLEMASL AADSDEYIPD
DTSEVSGVEN VESFDGEAEG FAPDRILYHQ PNGGGWNVEY SDMLGPRYPD VGTLLSVADG
SYIGDGLRPV AMGEYNHAMG NSLGLVHEMW NEHIKPPARE ATDRSDADNP GVLVGTPEVV
PGPDVEPGAP DGAVVLGESD AIEIRRDESL DVDPGFSVGA TVSNVDADAD ASLVDEGRYA
LELSDGELVL SIGSDSVSAS LPAGGVDDWT TVVGVADADE LRLYVDGERV AATSHSVSDL
PSSDGPVRIG IDGDAEITVD GIGIYDRAVS DDEASGADGT LSDGAVVAYD FADLIRDQSL
VGGFVWDWVN QDLNDVTEDG QEFQFYHSDG PDGAFCLNGL VWSDRDPQPE MWQLKHSHQP
VGVADAAVED GEVYVSNRYN FTGLDALEGS WELTADDETV KSGELDLDIE PGETRRVDVP
VGAPSDPEPG VEYRLSVSFA LAEATDYADA GHEVAFEQLE IPVDAPEPEP ESLDAMAPLS
VEEGDDIVIS GDGFEYVVSG DAGTLSSMRY GGSELVEDGP LFNAWRAPIM NEHQQWGSAP
AFSWYEAGLN DLTHSVDSVE TNAVDDSLVQ LTVDGFAAGT EVPPPLLTPD ASGEGNDGEV
VGDPDVVSGQ SGQAVALDGE SQHVNAGNDA SVDFGEPGFT IQVRFKGVPA NGEHNPFVSK
GDHQYALKIS SSDEFQFFIY QDTWITHNAP IPSGLAEDEW HTLTGVATDD ELQLYLDGET
LGSSGHSATS VNQSAFPVHV GHNAENTNRY TETAIDEVRM YDRALSPSEI SSGFDEPPES
AVLWYELDTF EEGESTVMGF ETQYRYRIYG SGDVRMNVEA VPNEPLRNTV SGWLPKVGVQ
LDLPERFDAF EWYGRGELET YPDRKWSVPV GRYAGSVDEQ YVPYLPPTDN GNKAQTRWAT
LSDGEVSLLG MPGDEDANVS LEQWSNLDEA EHQYELEERG SIGFNLDHRV TGVGGTPTDP
IDRYQVEVEP TAFSVVLRPF DPDDADPMEL ANRRLPDADE