Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_3122 |
Symbol | |
ID | 8743742 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | + |
Start bp | 3206745 |
End bp | 3209663 |
Gene Length | 2919 bp |
Protein Length | 972 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 646513706 |
Product | Beta-galactosidase |
Protein accession | YP_003404660 |
Protein GI | 284166381 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATAGAG ACAGCCATGA GGAAGATCGG AGAACGGTGC CTATCAGCAG ACGGACCCTC CTCTCGAGCG TCCCTGGAGC CGCGATCGGC GCTGCGCTCC CGATCGGGAC CGCAAGTGCA CGAGATGACG GCGACGACGA GTCACTGGGT CGGCTCACGC CCCGGCCCGC GACGACGGGC GACGGTGACG TACGATCGCT GAACGGAACG TGGGAGTTCG CGCTCTCGAC AACCATCGAC GACCAAGGCG CCGAGTGGCG CGAGGCGGAG GTGCCAGGCC AGTGGGGATA CAGCGAGTCC GCCATCCCAG AGGGACCAGC GGAGTGGTAT CCCCCGGAGG GGCAACTGGG GTGGTATCGC CGAGAGTTCG AGGCCCCCAA GAGCGACCGT GAACGACTTC TCCTCCGGTT CGACGCGGTG TACAGCGAAG CGAGGGTGTA CCTCAACGGA GCCGAGATCG GCCATCACGT CGGCGGCTAC ACCCCCTTCG AGATCGACGT CACCGACCAC GTCGAGGCGG GGACGAACGT GCTCTCGGTC GGCGTCGCCC AGGCGTCGCC GGCGGACGAT ATGGCCTGGC AGAACGTCAC CGGTGGGATC ACGCGGGACG TGACGCTCGT CTCGGTTCCC GAGATTCACA TCGCCGATTA CGACGTTCGT ACCCACCTAC AGGGGTCGTC AGCGACAGTC GACGTCGAGA CGAGAATCGA AAACGCGAGT GGAGCGGACG CCGATGCGAC GCTGGAGGTG ACGCTTTCGG ACCCGAACGG TGAGACGGTC GCGACCACAG AACGCTCGCT TTCCTCGCGG GAAGGTGGCT CTTGTGACCC CTCGACGACC CTCGAGGTGG CCGACCCGAA CACGTGGAAT CCGGAGGAGC CACAGCTCTA TACCCTCGAG ATCGACCTCA ACGCCGGGGA TTCGACCGAG CGGGTGACCC AGCGCCTCGG GATTCGAGAG ATCGAAGTCG TGGGGGACGA ACTCCGGCTC AACGGCGAGG CAGTGACGCT GCGGGGTGTC AACTGGGAGG AGATTCACAT CCCCGATCAC GGTCACGCGG TCCCGCCGGA ACTCACTCGT GAGGACGCTC GCCGACTAAA GGAGGCGAAC GTCAACTACG TGCGGACGGC CCATCATCCG ACCTCGGAGG CGTTTCTCGA CGCCTGCGAC GAGCTCGGGC TCGTCGTTGA AGTCGAAGCG CCCCACACGT TCGTCGGCCG CGGCCGGGGA GACCCCTATC CGGAGATAGT CGTTTCACAG ACCGTAGAGA TGGTCGAGCG TGACAAGAAC CGGACGTCGG TCTGTCTCTG GTCGATCGCG AACGAGTCGG AGTGGTACGA TGCCTTCGAG ACCGCCGGCC GGCTAACCAA GGCAATCGAT CCGACGCGGC CGACGATTTT CAACTACGAC AACTACGATC CCGACGACCC CTGGCACGAC GTCTACGACG TCCGTTCGCA ACACTACCCC GCGTTTCGCG CGGACTCGAC GATCGAAGAG TACCTCGGTC TCGACGATCC GATCCTGTTC GATGAGTACG CTCACACCTA CTGTTACAAC GGCCGGGAGT TGGTGACCGA TCCCGGACTC CGCGACCAAT GGGGGATTCC CTTCGAGCGG ATCTGGGAGC GCTGTCGGGC CGGCGACTCG GTCGCCGGCG GCGCGATCTG GGCCGGCGGT GACCACCTGG AGCGGTGGAG AGGGTACCTC TGGGGGCTAC TCGACCGCCA CCGCCGGCCA CGACCCGAAT ACTGGCACGT CAAGAAAATC TACTCGCCGG TTCGAGTCGT CGAGACGGAG TGGCTCGGCA ACGGCAACGT TCTCCGATTG ACTATCGAGA ACCGTCACGA GTTCGTCGAC CTCGCCGACC GCTCGATCGA GTTCGAGGGG GCCCGAAACT CCGGAAACCG CCCGATCGAG GCGGCGCCGG GCGAGCGCGT GACAGTGACG GTCCCGGTGG CCGAGGACCG GCTGGAACTC AGGGTCACCC ATCCACACGG CCACACGATC GAGCAGGCCG TTTTCACCGT TGACTCGCCG GGGTGCGAAA GCTACCCGGT ACCGACGGGG ACGCCACTCG AAATCGACGA GGAATCGGTG CGGACGACCG AGGGTCCACC GATGTCGGTC GACCGGAATA CCGGGCGCGT TGAGGTCCAA CCAGAGGACG GGCAGGGAAC GCCGGTCGTG GTCGGCGGAC CGGAGTTGGT CCTGACACCG ACCGAGGCGG AAGCGAGTCA GGAGGGCTCG GGGGCGATCG ACCACCGTCC CGACGGGCGA ACGGTTACCG ACGTGAGGGT CGTCGAGGAC GGTGCGGCCG TCGCCATCGA CGTCGAGTAC GCGGTCGCGA CCGGAACGTT CGTCCTTCGA CCCGTCGACG GCGGCGTCGA AGTCGAGTAC GAGTTCGAGA TCGAGGAGGC ACTCGACGTC CGCGAGGTCG GCGTTGCACT CCCGCTCACA CGCGACCTGA CGACGCTGTC GTGGCGTCGA GAGGGCCAGT GGAGCACCTA CCCCGACGAC CACATCGGGC GGACTGAAGG GACGGCCGTG GCGTTCCCCG AGGGAACCCG GCCGGACCAC GAGGAGATCC GCCTCGGTAG CGACCGGCCC TGGAAGGACG ACGCGACGAG CCACGGCTCC AATGACTTTC GGGGAACGAA GCGAAACGTC TACACCGCGG CCCTGGTGAA CGACCATGGT GCCGGCGTTC AGCTCCGCTC TGAGGGCGAC CACCACGTCC GTGCCCAGGT TCGTTCGGAG TCGGTCGATC TGCTCGCGCT CGAGCGCTCG CTGTCGGGGA CCAACCCCTT CGGGTGGATG AACCGCCAGC CGGTCCTGAA TGAGGACCCC ACGATCGAGG CGGATGAGAT CGTCCGCGGA AACGCAGCGT TCGAGATCAG AGGGTCTCCA TCGGTGTAA
|
Protein sequence | MDRDSHEEDR RTVPISRRTL LSSVPGAAIG AALPIGTASA RDDGDDESLG RLTPRPATTG DGDVRSLNGT WEFALSTTID DQGAEWREAE VPGQWGYSES AIPEGPAEWY PPEGQLGWYR REFEAPKSDR ERLLLRFDAV YSEARVYLNG AEIGHHVGGY TPFEIDVTDH VEAGTNVLSV GVAQASPADD MAWQNVTGGI TRDVTLVSVP EIHIADYDVR THLQGSSATV DVETRIENAS GADADATLEV TLSDPNGETV ATTERSLSSR EGGSCDPSTT LEVADPNTWN PEEPQLYTLE IDLNAGDSTE RVTQRLGIRE IEVVGDELRL NGEAVTLRGV NWEEIHIPDH GHAVPPELTR EDARRLKEAN VNYVRTAHHP TSEAFLDACD ELGLVVEVEA PHTFVGRGRG DPYPEIVVSQ TVEMVERDKN RTSVCLWSIA NESEWYDAFE TAGRLTKAID PTRPTIFNYD NYDPDDPWHD VYDVRSQHYP AFRADSTIEE YLGLDDPILF DEYAHTYCYN GRELVTDPGL RDQWGIPFER IWERCRAGDS VAGGAIWAGG DHLERWRGYL WGLLDRHRRP RPEYWHVKKI YSPVRVVETE WLGNGNVLRL TIENRHEFVD LADRSIEFEG ARNSGNRPIE AAPGERVTVT VPVAEDRLEL RVTHPHGHTI EQAVFTVDSP GCESYPVPTG TPLEIDEESV RTTEGPPMSV DRNTGRVEVQ PEDGQGTPVV VGGPELVLTP TEAEASQEGS GAIDHRPDGR TVTDVRVVED GAAVAIDVEY AVATGTFVLR PVDGGVEVEY EFEIEEALDV REVGVALPLT RDLTTLSWRR EGQWSTYPDD HIGRTEGTAV AFPEGTRPDH EEIRLGSDRP WKDDATSHGS NDFRGTKRNV YTAALVNDHG AGVQLRSEGD HHVRAQVRSE SVDLLALERS LSGTNPFGWM NRQPVLNEDP TIEADEIVRG NAAFEIRGSP SV
|
| |