Gene Htur_4208 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_4208 
Symbol 
ID8744836 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013744 
Strand
Start bp477587 
End bp479350 
Gene Length1764 bp 
Protein Length587 aa 
Translation table11 
GC content61% 
IMG OID646514755 
ProductBeta-galactosidase 
Protein accessionYP_003405702 
Protein GI284167424 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATTTGT TATCGTACCC ACGGGAGTCC GTCTCGCTCG ACGGAACGTG GCAAGCGATC 
CCCGACCAGT ATGAGATGTA CGACGGCTAC TTCGAAGATT TCGTCGATGA CGACGACGAC
GATAACCCTG CCGGATTTTC ACCCAAATCG ATCTACGAAC TCGGCGCCTC CGAGCAAGAG
GGGATGCCGG TCGATTTCAA CGTCCACGAC GGCTATTCCG TCGATGTCCC GGCGAGCTGG
GGAGAAGAGA TCACCGAGTT CCGCCATTAC GAGGGCTGGG TCTGGTTCGC GAAGACGTTC
GACTGGGACG CCGATACGGC CGGCGATAGC TCACACCTCA AATTCGGTGC AGTCAATTAC
AGAGCGGAAG TCTGGCTCAA CGGCGAACGA CTGGGCGAAC ACGAAGGGGG GTTCACGCCG
TTCAGTTTCG ACGTCACCGA CGAGCTGGTC GACGGAGAGA ACCTGCTAAT CGTCAAAGTC
GACAACAAGC GCTACGACGA CGGGATCCCC AACGCCAGCA CCGACTGGTT CAACTTCGGC
GGGATCAACC GGTCGGTCGA GGTCGTCTCG GTGCCGGAGA CATACGTTCG CAACTACAAG
CTCGAGACGG AGCTGTCCGA AGACAGCGTC GACCTCCAGC TCGACGCGTG GGTCGAAAAC
GCCGTCGACG ATACCGAAGT GACGGCCTCG TTCCCCAAAC TGGACGTATC GATAGAGCTA
ACCGCTGACG ATGACGGGGT TTTCACCGGG GAAGCAACGC TCTCTCGAGA TGACGTCACC
CTGTGGAGCC CCTCGGATCC GCAGCTGTAC ACCGTTCGAG TCGCAGCCGA CGACGATACG
ATCGAGGACG AGGTCGGGCT TCGCGAAGTC GACGTCGTCG ACGGCGATCT GCTACTCAAC
GGCGAGGAGA TCTGGCTCAG GGGGATCGCG CTGCACGAGG AGTCCGCCGG AAAGGGGCGT
GCGCTCAACC TCGAGGACGT CGAAGAGCGG TTCGAGTGGA TCACGGAGCT CGGCTGTAAC
TACGCCCGGC TCGCGCACTA CCCGCACACC GAAGCGATGG CGCGGAAAGC CGACGAGGAG
GGGCTCATCC TCTGGGAAGA GATCCCGGCC TACTGGCACA TCAACTTCGG TGACGAGGAG
ATCCAGGAGC TGTACCGTCA GCAGCTCCGA GAGCTGATCC AGCGCGACTG GAACCGGGCG
TCGGTCGCCC TCTGGTCGAT CGCCAACGAA ACCGACCACA AGGACGATAC CCGAAACGAA
GTGCTCCCGG AGATGGCCGA CTACGTCCGC GAACTAGACG ACACCCGGCT CGTCACCGCC
GCGTGCTTCG TCGACGAAAC CGATGATGGA ATCGTTCTCA AGGATCCGCT GCAAGAGCAC
CTCGACGTGG TCGGGATCAA CCAGTACTAC GGCTGGTACT ACGGCGACGC CGACGACATG
GAGCAGTTCC AGGAGAACCC CGATGGGACG CCGGTCCTGA TCTCCGAGAC CGGTGGAGGT
GCGAAGTGGG GCCACCACGG TGACGAGGAC GAGCGCTGGA CCGAGGAGTT CCAGGCCGCG
ATCTATCGCG GACAAACGGA TGCGATCGAC GGAAACGATC AGATCGCCGG GATGGCTCCG
TGGATCCTCT TCGACTTCCG GGCTCCGATG CGGCAGAACG ACCACCAGCG CGGCTACAAT
CGCAAGGGTC TCGTTGATCA ACACGGCCGC AAGAAGCAGG CGTTCCACGT ACTCCGGGGG
TTCTATCAGG AAAAACGGTC CTAA
 
Protein sequence
MHLLSYPRES VSLDGTWQAI PDQYEMYDGY FEDFVDDDDD DNPAGFSPKS IYELGASEQE 
GMPVDFNVHD GYSVDVPASW GEEITEFRHY EGWVWFAKTF DWDADTAGDS SHLKFGAVNY
RAEVWLNGER LGEHEGGFTP FSFDVTDELV DGENLLIVKV DNKRYDDGIP NASTDWFNFG
GINRSVEVVS VPETYVRNYK LETELSEDSV DLQLDAWVEN AVDDTEVTAS FPKLDVSIEL
TADDDGVFTG EATLSRDDVT LWSPSDPQLY TVRVAADDDT IEDEVGLREV DVVDGDLLLN
GEEIWLRGIA LHEESAGKGR ALNLEDVEER FEWITELGCN YARLAHYPHT EAMARKADEE
GLILWEEIPA YWHINFGDEE IQELYRQQLR ELIQRDWNRA SVALWSIANE TDHKDDTRNE
VLPEMADYVR ELDDTRLVTA ACFVDETDDG IVLKDPLQEH LDVVGINQYY GWYYGDADDM
EQFQENPDGT PVLISETGGG AKWGHHGDED ERWTEEFQAA IYRGQTDAID GNDQIAGMAP
WILFDFRAPM RQNDHQRGYN RKGLVDQHGR KKQAFHVLRG FYQEKRS