Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4208 |
Symbol | |
ID | 8744836 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | - |
Start bp | 477587 |
End bp | 479350 |
Gene Length | 1764 bp |
Protein Length | 587 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 646514755 |
Product | Beta-galactosidase |
Protein accession | YP_003405702 |
Protein GI | 284167424 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATTTGT TATCGTACCC ACGGGAGTCC GTCTCGCTCG ACGGAACGTG GCAAGCGATC CCCGACCAGT ATGAGATGTA CGACGGCTAC TTCGAAGATT TCGTCGATGA CGACGACGAC GATAACCCTG CCGGATTTTC ACCCAAATCG ATCTACGAAC TCGGCGCCTC CGAGCAAGAG GGGATGCCGG TCGATTTCAA CGTCCACGAC GGCTATTCCG TCGATGTCCC GGCGAGCTGG GGAGAAGAGA TCACCGAGTT CCGCCATTAC GAGGGCTGGG TCTGGTTCGC GAAGACGTTC GACTGGGACG CCGATACGGC CGGCGATAGC TCACACCTCA AATTCGGTGC AGTCAATTAC AGAGCGGAAG TCTGGCTCAA CGGCGAACGA CTGGGCGAAC ACGAAGGGGG GTTCACGCCG TTCAGTTTCG ACGTCACCGA CGAGCTGGTC GACGGAGAGA ACCTGCTAAT CGTCAAAGTC GACAACAAGC GCTACGACGA CGGGATCCCC AACGCCAGCA CCGACTGGTT CAACTTCGGC GGGATCAACC GGTCGGTCGA GGTCGTCTCG GTGCCGGAGA CATACGTTCG CAACTACAAG CTCGAGACGG AGCTGTCCGA AGACAGCGTC GACCTCCAGC TCGACGCGTG GGTCGAAAAC GCCGTCGACG ATACCGAAGT GACGGCCTCG TTCCCCAAAC TGGACGTATC GATAGAGCTA ACCGCTGACG ATGACGGGGT TTTCACCGGG GAAGCAACGC TCTCTCGAGA TGACGTCACC CTGTGGAGCC CCTCGGATCC GCAGCTGTAC ACCGTTCGAG TCGCAGCCGA CGACGATACG ATCGAGGACG AGGTCGGGCT TCGCGAAGTC GACGTCGTCG ACGGCGATCT GCTACTCAAC GGCGAGGAGA TCTGGCTCAG GGGGATCGCG CTGCACGAGG AGTCCGCCGG AAAGGGGCGT GCGCTCAACC TCGAGGACGT CGAAGAGCGG TTCGAGTGGA TCACGGAGCT CGGCTGTAAC TACGCCCGGC TCGCGCACTA CCCGCACACC GAAGCGATGG CGCGGAAAGC CGACGAGGAG GGGCTCATCC TCTGGGAAGA GATCCCGGCC TACTGGCACA TCAACTTCGG TGACGAGGAG ATCCAGGAGC TGTACCGTCA GCAGCTCCGA GAGCTGATCC AGCGCGACTG GAACCGGGCG TCGGTCGCCC TCTGGTCGAT CGCCAACGAA ACCGACCACA AGGACGATAC CCGAAACGAA GTGCTCCCGG AGATGGCCGA CTACGTCCGC GAACTAGACG ACACCCGGCT CGTCACCGCC GCGTGCTTCG TCGACGAAAC CGATGATGGA ATCGTTCTCA AGGATCCGCT GCAAGAGCAC CTCGACGTGG TCGGGATCAA CCAGTACTAC GGCTGGTACT ACGGCGACGC CGACGACATG GAGCAGTTCC AGGAGAACCC CGATGGGACG CCGGTCCTGA TCTCCGAGAC CGGTGGAGGT GCGAAGTGGG GCCACCACGG TGACGAGGAC GAGCGCTGGA CCGAGGAGTT CCAGGCCGCG ATCTATCGCG GACAAACGGA TGCGATCGAC GGAAACGATC AGATCGCCGG GATGGCTCCG TGGATCCTCT TCGACTTCCG GGCTCCGATG CGGCAGAACG ACCACCAGCG CGGCTACAAT CGCAAGGGTC TCGTTGATCA ACACGGCCGC AAGAAGCAGG CGTTCCACGT ACTCCGGGGG TTCTATCAGG AAAAACGGTC CTAA
|
Protein sequence | MHLLSYPRES VSLDGTWQAI PDQYEMYDGY FEDFVDDDDD DNPAGFSPKS IYELGASEQE GMPVDFNVHD GYSVDVPASW GEEITEFRHY EGWVWFAKTF DWDADTAGDS SHLKFGAVNY RAEVWLNGER LGEHEGGFTP FSFDVTDELV DGENLLIVKV DNKRYDDGIP NASTDWFNFG GINRSVEVVS VPETYVRNYK LETELSEDSV DLQLDAWVEN AVDDTEVTAS FPKLDVSIEL TADDDGVFTG EATLSRDDVT LWSPSDPQLY TVRVAADDDT IEDEVGLREV DVVDGDLLLN GEEIWLRGIA LHEESAGKGR ALNLEDVEER FEWITELGCN YARLAHYPHT EAMARKADEE GLILWEEIPA YWHINFGDEE IQELYRQQLR ELIQRDWNRA SVALWSIANE TDHKDDTRNE VLPEMADYVR ELDDTRLVTA ACFVDETDDG IVLKDPLQEH LDVVGINQYY GWYYGDADDM EQFQENPDGT PVLISETGGG AKWGHHGDED ERWTEEFQAA IYRGQTDAID GNDQIAGMAP WILFDFRAPM RQNDHQRGYN RKGLVDQHGR KKQAFHVLRG FYQEKRS
|
| |