Gene Htur_3122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_3122 
Symbol 
ID8743742 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp3206745 
End bp3209663 
Gene Length2919 bp 
Protein Length972 aa 
Translation table11 
GC content66% 
IMG OID646513706 
ProductBeta-galactosidase 
Protein accessionYP_003404660 
Protein GI284166381 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAGAG ACAGCCATGA GGAAGATCGG AGAACGGTGC CTATCAGCAG ACGGACCCTC 
CTCTCGAGCG TCCCTGGAGC CGCGATCGGC GCTGCGCTCC CGATCGGGAC CGCAAGTGCA
CGAGATGACG GCGACGACGA GTCACTGGGT CGGCTCACGC CCCGGCCCGC GACGACGGGC
GACGGTGACG TACGATCGCT GAACGGAACG TGGGAGTTCG CGCTCTCGAC AACCATCGAC
GACCAAGGCG CCGAGTGGCG CGAGGCGGAG GTGCCAGGCC AGTGGGGATA CAGCGAGTCC
GCCATCCCAG AGGGACCAGC GGAGTGGTAT CCCCCGGAGG GGCAACTGGG GTGGTATCGC
CGAGAGTTCG AGGCCCCCAA GAGCGACCGT GAACGACTTC TCCTCCGGTT CGACGCGGTG
TACAGCGAAG CGAGGGTGTA CCTCAACGGA GCCGAGATCG GCCATCACGT CGGCGGCTAC
ACCCCCTTCG AGATCGACGT CACCGACCAC GTCGAGGCGG GGACGAACGT GCTCTCGGTC
GGCGTCGCCC AGGCGTCGCC GGCGGACGAT ATGGCCTGGC AGAACGTCAC CGGTGGGATC
ACGCGGGACG TGACGCTCGT CTCGGTTCCC GAGATTCACA TCGCCGATTA CGACGTTCGT
ACCCACCTAC AGGGGTCGTC AGCGACAGTC GACGTCGAGA CGAGAATCGA AAACGCGAGT
GGAGCGGACG CCGATGCGAC GCTGGAGGTG ACGCTTTCGG ACCCGAACGG TGAGACGGTC
GCGACCACAG AACGCTCGCT TTCCTCGCGG GAAGGTGGCT CTTGTGACCC CTCGACGACC
CTCGAGGTGG CCGACCCGAA CACGTGGAAT CCGGAGGAGC CACAGCTCTA TACCCTCGAG
ATCGACCTCA ACGCCGGGGA TTCGACCGAG CGGGTGACCC AGCGCCTCGG GATTCGAGAG
ATCGAAGTCG TGGGGGACGA ACTCCGGCTC AACGGCGAGG CAGTGACGCT GCGGGGTGTC
AACTGGGAGG AGATTCACAT CCCCGATCAC GGTCACGCGG TCCCGCCGGA ACTCACTCGT
GAGGACGCTC GCCGACTAAA GGAGGCGAAC GTCAACTACG TGCGGACGGC CCATCATCCG
ACCTCGGAGG CGTTTCTCGA CGCCTGCGAC GAGCTCGGGC TCGTCGTTGA AGTCGAAGCG
CCCCACACGT TCGTCGGCCG CGGCCGGGGA GACCCCTATC CGGAGATAGT CGTTTCACAG
ACCGTAGAGA TGGTCGAGCG TGACAAGAAC CGGACGTCGG TCTGTCTCTG GTCGATCGCG
AACGAGTCGG AGTGGTACGA TGCCTTCGAG ACCGCCGGCC GGCTAACCAA GGCAATCGAT
CCGACGCGGC CGACGATTTT CAACTACGAC AACTACGATC CCGACGACCC CTGGCACGAC
GTCTACGACG TCCGTTCGCA ACACTACCCC GCGTTTCGCG CGGACTCGAC GATCGAAGAG
TACCTCGGTC TCGACGATCC GATCCTGTTC GATGAGTACG CTCACACCTA CTGTTACAAC
GGCCGGGAGT TGGTGACCGA TCCCGGACTC CGCGACCAAT GGGGGATTCC CTTCGAGCGG
ATCTGGGAGC GCTGTCGGGC CGGCGACTCG GTCGCCGGCG GCGCGATCTG GGCCGGCGGT
GACCACCTGG AGCGGTGGAG AGGGTACCTC TGGGGGCTAC TCGACCGCCA CCGCCGGCCA
CGACCCGAAT ACTGGCACGT CAAGAAAATC TACTCGCCGG TTCGAGTCGT CGAGACGGAG
TGGCTCGGCA ACGGCAACGT TCTCCGATTG ACTATCGAGA ACCGTCACGA GTTCGTCGAC
CTCGCCGACC GCTCGATCGA GTTCGAGGGG GCCCGAAACT CCGGAAACCG CCCGATCGAG
GCGGCGCCGG GCGAGCGCGT GACAGTGACG GTCCCGGTGG CCGAGGACCG GCTGGAACTC
AGGGTCACCC ATCCACACGG CCACACGATC GAGCAGGCCG TTTTCACCGT TGACTCGCCG
GGGTGCGAAA GCTACCCGGT ACCGACGGGG ACGCCACTCG AAATCGACGA GGAATCGGTG
CGGACGACCG AGGGTCCACC GATGTCGGTC GACCGGAATA CCGGGCGCGT TGAGGTCCAA
CCAGAGGACG GGCAGGGAAC GCCGGTCGTG GTCGGCGGAC CGGAGTTGGT CCTGACACCG
ACCGAGGCGG AAGCGAGTCA GGAGGGCTCG GGGGCGATCG ACCACCGTCC CGACGGGCGA
ACGGTTACCG ACGTGAGGGT CGTCGAGGAC GGTGCGGCCG TCGCCATCGA CGTCGAGTAC
GCGGTCGCGA CCGGAACGTT CGTCCTTCGA CCCGTCGACG GCGGCGTCGA AGTCGAGTAC
GAGTTCGAGA TCGAGGAGGC ACTCGACGTC CGCGAGGTCG GCGTTGCACT CCCGCTCACA
CGCGACCTGA CGACGCTGTC GTGGCGTCGA GAGGGCCAGT GGAGCACCTA CCCCGACGAC
CACATCGGGC GGACTGAAGG GACGGCCGTG GCGTTCCCCG AGGGAACCCG GCCGGACCAC
GAGGAGATCC GCCTCGGTAG CGACCGGCCC TGGAAGGACG ACGCGACGAG CCACGGCTCC
AATGACTTTC GGGGAACGAA GCGAAACGTC TACACCGCGG CCCTGGTGAA CGACCATGGT
GCCGGCGTTC AGCTCCGCTC TGAGGGCGAC CACCACGTCC GTGCCCAGGT TCGTTCGGAG
TCGGTCGATC TGCTCGCGCT CGAGCGCTCG CTGTCGGGGA CCAACCCCTT CGGGTGGATG
AACCGCCAGC CGGTCCTGAA TGAGGACCCC ACGATCGAGG CGGATGAGAT CGTCCGCGGA
AACGCAGCGT TCGAGATCAG AGGGTCTCCA TCGGTGTAA
 
Protein sequence
MDRDSHEEDR RTVPISRRTL LSSVPGAAIG AALPIGTASA RDDGDDESLG RLTPRPATTG 
DGDVRSLNGT WEFALSTTID DQGAEWREAE VPGQWGYSES AIPEGPAEWY PPEGQLGWYR
REFEAPKSDR ERLLLRFDAV YSEARVYLNG AEIGHHVGGY TPFEIDVTDH VEAGTNVLSV
GVAQASPADD MAWQNVTGGI TRDVTLVSVP EIHIADYDVR THLQGSSATV DVETRIENAS
GADADATLEV TLSDPNGETV ATTERSLSSR EGGSCDPSTT LEVADPNTWN PEEPQLYTLE
IDLNAGDSTE RVTQRLGIRE IEVVGDELRL NGEAVTLRGV NWEEIHIPDH GHAVPPELTR
EDARRLKEAN VNYVRTAHHP TSEAFLDACD ELGLVVEVEA PHTFVGRGRG DPYPEIVVSQ
TVEMVERDKN RTSVCLWSIA NESEWYDAFE TAGRLTKAID PTRPTIFNYD NYDPDDPWHD
VYDVRSQHYP AFRADSTIEE YLGLDDPILF DEYAHTYCYN GRELVTDPGL RDQWGIPFER
IWERCRAGDS VAGGAIWAGG DHLERWRGYL WGLLDRHRRP RPEYWHVKKI YSPVRVVETE
WLGNGNVLRL TIENRHEFVD LADRSIEFEG ARNSGNRPIE AAPGERVTVT VPVAEDRLEL
RVTHPHGHTI EQAVFTVDSP GCESYPVPTG TPLEIDEESV RTTEGPPMSV DRNTGRVEVQ
PEDGQGTPVV VGGPELVLTP TEAEASQEGS GAIDHRPDGR TVTDVRVVED GAAVAIDVEY
AVATGTFVLR PVDGGVEVEY EFEIEEALDV REVGVALPLT RDLTTLSWRR EGQWSTYPDD
HIGRTEGTAV AFPEGTRPDH EEIRLGSDRP WKDDATSHGS NDFRGTKRNV YTAALVNDHG
AGVQLRSEGD HHVRAQVRSE SVDLLALERS LSGTNPFGWM NRQPVLNEDP TIEADEIVRG
NAAFEIRGSP SV