Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4492 |
Symbol | |
ID | 8745121 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013745 |
Strand | + |
Start bp | 87205 |
End bp | 89874 |
Gene Length | 2670 bp |
Protein Length | 889 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 646515029 |
Product | Beta-galactosidase |
Protein accession | YP_003405976 |
Protein GI | 284172594 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00800189 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGCTA GTTACCGAGA ATCGAGATCA CTGACCGGAC CGTGGGAGTT CGTCACCGAC CCGGAGTCCG ACGGACGCGC CGATAACTGG TACGCTCCGG ACGCAACCTG GCCCGATAGC CGTTGCACCG TCGACGTCCC CCACTGCTGG CAAGAGCACG ACGAGTACCG CGAATACACC GGGACCGCGT GGTACCGCCG TCGAACCGTC CTCCCGCGAT CCGGAGACGA TCGCACGGTC CTCCGGTTCG GCGCCGTCGA CTACGAGGCG ACGGTGTGGG TCAACGGCGA GCGAGTCGGC GAGCACCGCG GCGGCTATCT GCCCTTCGAG GTAGATATCA CCGAGGCGGT GACCGACGGC GAGGACGCGA TCGCCGTCGC GGTAACCGAT CCCGAAGACA TAAGCGAGAT CCCTCACGGC AAGCAGGGCG CGCCGTGGTA CCAGCGTGTC AGCGGGATCT GGCAGAACGT GACGGTAGCG ACCGTCCCGA AGTGTCGCAT CACCGACCTT CGTGCGACGC CGAACCTCGA GGACGACACG GTTCGGATCG ATCTCGACGT AGCGCCCGGG GGCGCGACCG ATCTGACCGC CGCCGTAACG ATCGAACGAG ACGGAGCGCC GGTCGCGAGT TCGAGCGTCG ACGTAGACGC GGGTTCGGGA ACCGCAGTGG TAGCCATCGA CGACCCCGAC TACTGGACGC CGGAGACGCC GACGCTGTAC GACGTCGTCG TGGAACTGTC GCGCGACGGC GCGGTCGTCG ACCGGTACGA GGACTACTTC GGCATGCGGA GCGTCGAGTC CCGCGACGGT CGACTGTACC TGAACGGCGA CCCGCTCTCT GTGCGGGGGG CACTCGATCA GGGATACTAC CCGGAGACGC TGTACCGGCC GTTTGACGAG GAGCTGTTCG AAGCCGAGAT CCGCACGGCG AAGGAGCTCG GATTCAACCT GCTTCGCAAG CACATCAAGC CGGCCCACCC GGACTTCCTG GAACTCGCGG ACCGGCTCGG GATTCTCGTC TGGGAAGAGC CCGCGAATCC GACGGTTCAC ACCGAGCGCT CGAGGCGGGA GGTCCGCGAC CAGATTCGAG GTATGATCGA CCGGGACTAC AACCGCCCCA GCGTGATCGT CTGGAGCCTA TACAACGAGG AGTGGGGGAT CGGGAACCCA CAGGGGCTCG ACGACGAGAC GTCGCTGTGG GAGGACGAAG CGAAACAGCG GTACCTCGCG GATCTCTACG ACTCGGCTCG GGAGTGGGAT CCCACGCGGC TCATCTGCGA TAACTCCGGC TGGGCCCACG TGGCGACCGA CGTGAACGAC TACCACCGGT ACTTCGTCAG TCCGGACCGC GCGGCCGCGT GGGAGGACGA CCTCGAGTCG ATGACGTCGT CGCCGGCGGA CAACTACGGC GCGACGGAGA CCGATCCGGC GGACGCGCCC CGGATCGTCT CCGAGTTCGG GACGTGGGGG ATGTGCGATC TGCCCGCGAT CGAGGAGTAC TACGGCGGTG AACCGCCGTG GTTCGACTAC GAGTTCTTCG ACGACCCGAT CAAGCGTCCC GCGGGAGTGC ACGATCGGTT CGAGGAATCG GCCCTCTCGG ACGTCTTCGA CGGCTGGTCG GCGATGGCCG AGGCGTGGCA GCGTCGCGAG TTCCGCTCGA TTACGGACGT CATCGAACGA ATGCGCACCC GCGAGGACGT CGCCGGATAC GTACTCACCG AGCTCTCCGA CATCGAGTGG GAGTTCAACG GGGTGCTGGA CTACCGCCGC GAGGAGAAGG CGTTCCACGA CGATTTCGCT CGGATAAACG GCGATATCCT CGTCAGCGTC GAGCCGGACG CCCACGTCGT CGCGTCGGGC GGGACGCTGA CCGCTGACGT GGTCGTCAGC AACGACACCA CCGACGCCGT CGAGGGTCCG CTCGAGTGGT CGGTCTTCGG GGAGTCGGGC ACGGTCGACG TGGAACTCGA CGGATTCGGC GTCGAGCGCG TCGAAAACGC GATCACGGTC GACGTCCCGT CCGTCGACGG CGTCCGCGAC GACGAGTTGA CGGTCTCGGT GCCGAGCGCG CCAGCGAACG GGGAACCGAT CACCGTCGTT CCCGAACCCA GTGCGGACGG CGAGACGACG GTGTACACCG ACGCCGAGGG GCTCGCTGAC GGACTGTCCG AGACCGGCTT CGAGGTCGCC TCGTCCCTCG ACGAGACCGT CGACGTGGCG CTGGTGATCC GTCCCGACGC TGCCGTTCGA TCGTACGTCG AAGACGGCGG AACCGCCGTG CTTGTCCCCG ACGAGAGCGG GCACATGGCG GACGAGGAGT TCTTCGAGTA CCGCGGTTTA CCGGAAGAGG AGAGCTGGAA CCTCGTGGCG TCGCTGTTCT ACTGTGCCGA CGAGACGCTG GCCGAGTACG TGGATACCGT CCCCGGGTGG GAACTCGAAG GGCTCTACCC CTACGACGTG GTCGCCGACG TCTCGGCCGA GGACGTGCTG ATCGGCTACG TCGAAGGCTG GATCGCGAAC CGATCGGCGG CAGTCGCCGT CCGCGAGGTC GGCGACGGCC GCGTCGGTGC GTTTACCTTC CGGATCACGG ACGCGTATGG CACACAGCCT GTCGGGACGG CGGCCCTCGC CGCACTACTC GACGACCTCG GGTCCGAGCA ACGCGGCTGA
|
Protein sequence | MTASYRESRS LTGPWEFVTD PESDGRADNW YAPDATWPDS RCTVDVPHCW QEHDEYREYT GTAWYRRRTV LPRSGDDRTV LRFGAVDYEA TVWVNGERVG EHRGGYLPFE VDITEAVTDG EDAIAVAVTD PEDISEIPHG KQGAPWYQRV SGIWQNVTVA TVPKCRITDL RATPNLEDDT VRIDLDVAPG GATDLTAAVT IERDGAPVAS SSVDVDAGSG TAVVAIDDPD YWTPETPTLY DVVVELSRDG AVVDRYEDYF GMRSVESRDG RLYLNGDPLS VRGALDQGYY PETLYRPFDE ELFEAEIRTA KELGFNLLRK HIKPAHPDFL ELADRLGILV WEEPANPTVH TERSRREVRD QIRGMIDRDY NRPSVIVWSL YNEEWGIGNP QGLDDETSLW EDEAKQRYLA DLYDSAREWD PTRLICDNSG WAHVATDVND YHRYFVSPDR AAAWEDDLES MTSSPADNYG ATETDPADAP RIVSEFGTWG MCDLPAIEEY YGGEPPWFDY EFFDDPIKRP AGVHDRFEES ALSDVFDGWS AMAEAWQRRE FRSITDVIER MRTREDVAGY VLTELSDIEW EFNGVLDYRR EEKAFHDDFA RINGDILVSV EPDAHVVASG GTLTADVVVS NDTTDAVEGP LEWSVFGESG TVDVELDGFG VERVENAITV DVPSVDGVRD DELTVSVPSA PANGEPITVV PEPSADGETT VYTDAEGLAD GLSETGFEVA SSLDETVDVA LVIRPDAAVR SYVEDGGTAV LVPDESGHMA DEEFFEYRGL PEEESWNLVA SLFYCADETL AEYVDTVPGW ELEGLYPYDV VADVSAEDVL IGYVEGWIAN RSAAVAVREV GDGRVGAFTF RITDAYGTQP VGTAALAALL DDLGSEQRG
|
| |