Gene Htur_3903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_3903 
Symbol 
ID8744531 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013744 
Strand
Start bp148514 
End bp151705 
Gene Length3192 bp 
Protein Length1063 aa 
Translation table11 
GC content68% 
IMG OID646514487 
ProductBeta-galactosidase 
Protein accessionYP_003405434 
Protein GI284167156 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.120366 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCGAG ACTGGGCCGA CCCGGAGACG GTCGGTCGAA ATCGGATCGA TCCGCACGCG 
TACGGCCTTC CGTACGCCGA GACGGACACC GCGACCGCGG GAAACCGAGC GGCCTCGCCC
TGGATCGCGT CGCTGAACGG CGAGTGGCGG TTCCGGTTGG CGGAGACGCC GACCGCCGCG
CCCGACGGGT TCCACGAGCC GGACGCCGAC GTCGGCGACT GGGACCGTAT CGAGGTGCCC
CAGCACTGGC AGACCGCCGG CTACGGCGAT CCCCACTACA CGAACGTGGT CTACCCGTTC
CCGCTCGATC CGCCCCACGT CCCGACCGAG AACCCGACCG CGTCGTACCG CCGGACGTTC
CACGTCCCCG ACGACTGGGA CGAGCGCCAG ATCCGACTCC GATTCGGCGG CGTCGACTCC
GCGTTCCACC TCTGGATCAA CGGCGAGGAA GTCGGGTACA GCGAGGGGAG CCGGCTCCCG
TCGGCGTTCG ACGTCACCGA CTACGTCTCG CCGGGCGAGA ACACGGTCGC CGTCCGCGTC
TACAAGTGGT CGACCGGGAG CTACCTCGAG GACCAGGACA TGTGGTGGCT CAGCGGGATC
TTCCGGGACG TCGCCCTCTC GGCTCACCCG ACGGTACAGG TCGCGGACGT GGACGTCCGG
ACCGACCTCG ACGAGCGATA TGAGGACGCC GTCCTGCAGG CGTCCGTCGA CGTACGCAAC
GTCGGCGACG ACGCCGGGAC GGCCCGAATC GAACCGACGC TGCGCGATGC GGACGGAACG
CCGGTCTCGA CGACGCTCGA GGCGCGGTCC GTCGCGCTCG AGGCCGGCGA GGCGACGACC
CTCGAGTTCG AGACGACCGT CGAGGAGCCC CGCAAGTGGA CCGCGGAGAC GCCCAACTGC
TACGATTTCG CGCTCGGTAT CTCCGACGGA CGGGGCGACG ACGAGACGGT CCTCGCGCAG
ACGGTCGGCT TCCGTGAAAT CGAGATCGTC GACGGACAGT TGCTGGTCAA CGGCCGACCG
GTGACGATCC GCGGCGTCAA CCGCCACGAC TTCCACCCCG ACCGCGGCCG CGCCGTCCCG
CTCGAGGCGA TGCGGGAGGA CGTCGAGCTG ATGAAGCGGC ACAACATCAA CGCGGTTCGT
ACCGCCCACT ACCCGAACGA TCCGCGGTTC TACGAGCTCT GTAACGAGTA CGGGCTCTAC
GTGCTCGACG AGACCGACCT CGAGTGCCAC GGGATGGTCC ACGCGGAGAC GACCGAGCAC
GTAAGCGACG ATCCCGACTG GGAAGCCGCG TACGTCGACC GGATGGTTCG GATGGTCGAG
CGCGACAAGA ACCACCCCAG CGTCATCTGC TGGTCGCTGG GCAACGAGTC GGACCTCGGG
GCCCACCACG AACGGATGGC CGCGGCGACG CGCGAGCGCG ATCCGACGCG GCCGATCCAC
TACGAACCCG ACACGGAGCA GACGGTCTCC GATATCATCG GGCCGATGTA CCCGCCCTTC
GAGCAACTCG AGGAATGGGC CGAGGCGGAC CTCGAGCATC CGGTCGTGCT CTGCGAGTAC
GCCCACGCGA TGGGGAACGG ACCGGGGAAC CTCCGGGAGT TCTGGGACCT CTTTTACGAG
CACGAGGGGA TGCAGGGCGG CTTCGTCTGG GACTGGATCG ATCAGGGACT CCGGCGGACG
GCCGACGACG GGACGGAGTG GTTCGCCTAC GGCGGCGACT TCGGCGACGA ACCGAACGAC
GCGAACTTCA ACATCAACGG GCTCGTCTTC CCCGATCGGA AGCCCTCGCC CGGACTCACC
GAGTACAAGA AGGTCATCGA GCCGGTCGTC CTGCGCGAGG ACGATCTCGA GCGCGGGGAG
CTCACCGTCG AGAACCGGTA CGATTTCCGG TCGCTCGAGC ACCTCCGCGC CTCCTGGCGC
CTGCTATCCG ACGGCCGCGT CGTCGAGAGC GGACGGCTGC CGCTGCCCTC GATCGCCGCC
GGCGAGTCCG CGACGGTCAC GGTTCCCGTC GACGTGGACG GACTCGAGAC AGACGGACTC
GATGCGGACG CCGAACACGT CCTCACTGTC GACGTCTCGC TTGCCCGCGA GACGGCGTGG
GCGCCGCAGG GACACACGGT CGCGACCGGG CAGTTCGAGC TTCCGGAAAG CGGATCCGGG
ACCGGCTCCG CTTCGCAGCC GTCGACCGGC GTCGCCGCGC CGCTGACGTG TGCGGGAGAC
GGGGAGGAGA TCCGCGTTTC GAACGAGCAG TTCGAACTGG TCTTCGATCG CACGTTCGGC
GTCATCGACT CGCTCGCGTA CCGGAATCGG TCGCTGTTGG AGGACGGTCC GTCGGTCGGA
ATCTGGCGCG CGCCGACGGA CAACGACGGG GGGCTCCCGC TCTCGCGGAC GCTCCTCTCG
CAATTCACCG AACGCTACGA GAACGAGGAA CTCGTTCAGG CGGGGGACCT CGCGACCGTC
GGGTTCGAAC AGCTCTGGCG GGAGCACGGG CTCGATCGGC TGCAGTTCCG CGTCGACGAC
GTCACGTGTG TTCGGGGCGA GCGAGACGCC GATCCCGTTA CGATCACCGT CGACGGCCGC
CTCGCGCCGC CGATATACGA CCACGGGTTC GCAGTCGAGC AGACGTACAT GATCGAGCGC
ACCGGTGCGA TAACCGTCGA CACCGCGATC AAGCCCGAAG GAGACCTGTC GCTGCTGCCC
TCGCTCCCTC GAGTCGGGCT CGATCTCACG CTCGAAGACG ACCTCGATCG GGTCACGTGG
TACGGACGCG GGCCAGGCGA GTCGTACGTC GACAGCAAGG AGGCCGCCCT GCTCGGCCGG
TACAGTCGCT CGGTCGCCGA TCTGCAGACG CCCTACGTCG CCCCCCAGGA GAGCGGGAAC
CGAACGGACA CCCGCTGGGT GACGTTCACC GACCAGCGCG GGACCGGCCT CTTCGTCACC
GGCGAAACGC CGTTCGATTT CAGCGCACAC CCCTTCAGTA CCGCCGATCT CGACGCTGCC
GGGCACACGC ACGAGCTTCC GGATCGAGAC GGCGTCTGGG TTTCGCTCGA CGACGGCCAC
TGTGGGCTCG GGACCGGAAG CTGCGGACCG CCGACGCTCG AGGAGTACCG ACTCGAGCCA
GAGCCGATCT CGTTCCGTAT GGAACTACAC CCGTTCGCTG CAGACGAGCT TCCGGCGACC
GATCGGTACT GA
 
Protein sequence
MTRDWADPET VGRNRIDPHA YGLPYAETDT ATAGNRAASP WIASLNGEWR FRLAETPTAA 
PDGFHEPDAD VGDWDRIEVP QHWQTAGYGD PHYTNVVYPF PLDPPHVPTE NPTASYRRTF
HVPDDWDERQ IRLRFGGVDS AFHLWINGEE VGYSEGSRLP SAFDVTDYVS PGENTVAVRV
YKWSTGSYLE DQDMWWLSGI FRDVALSAHP TVQVADVDVR TDLDERYEDA VLQASVDVRN
VGDDAGTARI EPTLRDADGT PVSTTLEARS VALEAGEATT LEFETTVEEP RKWTAETPNC
YDFALGISDG RGDDETVLAQ TVGFREIEIV DGQLLVNGRP VTIRGVNRHD FHPDRGRAVP
LEAMREDVEL MKRHNINAVR TAHYPNDPRF YELCNEYGLY VLDETDLECH GMVHAETTEH
VSDDPDWEAA YVDRMVRMVE RDKNHPSVIC WSLGNESDLG AHHERMAAAT RERDPTRPIH
YEPDTEQTVS DIIGPMYPPF EQLEEWAEAD LEHPVVLCEY AHAMGNGPGN LREFWDLFYE
HEGMQGGFVW DWIDQGLRRT ADDGTEWFAY GGDFGDEPND ANFNINGLVF PDRKPSPGLT
EYKKVIEPVV LREDDLERGE LTVENRYDFR SLEHLRASWR LLSDGRVVES GRLPLPSIAA
GESATVTVPV DVDGLETDGL DADAEHVLTV DVSLARETAW APQGHTVATG QFELPESGSG
TGSASQPSTG VAAPLTCAGD GEEIRVSNEQ FELVFDRTFG VIDSLAYRNR SLLEDGPSVG
IWRAPTDNDG GLPLSRTLLS QFTERYENEE LVQAGDLATV GFEQLWREHG LDRLQFRVDD
VTCVRGERDA DPVTITVDGR LAPPIYDHGF AVEQTYMIER TGAITVDTAI KPEGDLSLLP
SLPRVGLDLT LEDDLDRVTW YGRGPGESYV DSKEAALLGR YSRSVADLQT PYVAPQESGN
RTDTRWVTFT DQRGTGLFVT GETPFDFSAH PFSTADLDAA GHTHELPDRD GVWVSLDDGH
CGLGTGSCGP PTLEEYRLEP EPISFRMELH PFAADELPAT DRY