Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_3892 |
Symbol | |
ID | 8744520 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | + |
Start bp | 128183 |
End bp | 130192 |
Gene Length | 2010 bp |
Protein Length | 669 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 646514476 |
Product | Beta-galactosidase |
Protein accession | YP_003405423 |
Protein GI | 284167145 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1874] Beta-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.113241 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAATCG GAGTCTGTTA CTTCCCGGAG CACTGGCCGC GCGAACAGTG GGAGACGGAT GTTCGGCAGA TGGCCGACGC TGGAATCGAG TACGTCCGGA TGGCGGAGTT CTCGTGGCGG GTCCTCGAGC CCGAACGCGG CGCGTTCGAC TTCGAGTGGC TCGACGAGAT CGTCGAGCTG ATTGGCGAGT ACGGGATGCA GGCCGTCCTG TGTACGCCGA CGGCGGCACC GCCGCGGTGG CTCGTCGAGG AACGCCCCGA GATCCGCCAG CGAGACCGTG ACGGAACCGT CAGGGACGTC GGAAGCCGCC GGCACTACTG TTTCAACTCC GCGGCCTATC GCGAGGAGAC CGAACGCGTC GTCCGAGCGA TGGCGGAGCG CTACGCCGAC GATCCGCGCG TCGTCGGCTG GCAGACCGAC AACGAGTACG GCTGTCACGG CACGACACGG TGTTACTGCG ACGACTGCGC TGACGCGTTC CGGGACTGGG TCCGCGAGGA GTACGAGACC GTCGACGAAC TCAACGAAGC GTGGGGAACG ACGTTCTGGA GCCAACAGTA CGACGACTTC GAACAGGTCG ACCTCCCGCG GCCGACGCCC GCGCAGGACC ATCCGGCGAT GCTGCTCGAT TTCGCCCGAT TCTCGAGCGA CAGCGTCGTC AAGTACAATC GGCTGCAGGC GGACCTGCTT CGCGAGGCGA ACGACGAGTG GTTCGTCACG CACAACTTCA TGAACCTGTT CGAGTCGGTG GACACCTACG ACTTCGACGA GGACCTCGAT CTGATTTCCT GGGACTCGTA CCCGACAGGC CACGTCCAGC AGGCCGGCGG CGAGACGACG ACCGACGAGC TCCGCGCGGG GAACCCCGAT CTGCTCTCGT TCAACCACGA CCTCTACCGG AGCGTACTCG ACCGACCGTT CTGGGTGATG GAACAGCAAC CGGGCGACGT TAACTGGCCG CCCCACTCGA CCCAGCCCGC GGAGGGGGCG ATGCGCCTCT GGGCCCACCA CGCGACCGCC CACGGCGCCG ACGCCGTCGT CTACTTCCGC TGGCGGCGCT GCCTCGAGGG CCAGGAGCAG TACCACGCCG GCCTCCGCAA GCAGGACGGG TCGGCGGATC GGGGCTACGA CGACGCGACG CGGGCCGCCG AGGAACTGTT CGACCTCGAC CACGTCGACG CGCCCGTCGC CCTGCTTCAC GACTACGAGA ACGCGTGGGC GCTCGGCGAA CAGCCCCACG CGCCCGACTT CGACTACTGG CAGCTGTTGC AGTCGTTCTA CGCGTCGCTG CGAGCGCACG GCGTGCAGGT CGATATCGTC CATCCCGAGA GCGACCTCGA GTCCTACGAC GCGGTCGTCG CGCCGACGCT CCACCTGGCG ACCGAGTCGC TGGCCGACCA CCTGACCGCG TACGTCGAAT CCGGCGGCGA ACTGCTGCTC GGCCCGCGGA CGGGGGTCAA AGACGCGCAC AATAAGCTCC GTCCCGATCT CCAGCCCGGT CCGCTGTCGG AGCTCGTCGG CGCGAGCGTC GACCAACACG AGTCGCTCCC GACGCAGTTC GAGCCGACCG TCGCCGGAAC CGACGGGACG AACGCCGAGT ACGCGTTCCG AACGTGGGCC GAGTGGCTCG AGGCCGACGC GGCCGAGCCG CTACTCGAGT ACGCGGGCGA CGATATCGAA GGCGGACGGA CGGCGGCCGT CCGAAACGCG GTCGGTGAGG GAAGCGTCGT CTACTGCGGT GTCTGGCCCG AGACCGACCT CGCGAACGAC CTCGTCGGGT CGCTCCTCGA CCGCGCCGGC GTCCGCCGGA TGGACGTGCT CCCCGACGGC GTTCGCGTCG CCCGACGCGA CGGCCACACC TGGGTGCTGA ACTTCGGGAG CGACCCGATC GCGGTGACCC TCGAGGGGGA CGCGTCGTGG CGACTCGGCG GTCCGGAAAT CGGTCCGTTC GATCTCGCGA TCGCCGAGAC CAACGCGGTC GACGACCTCT CGGTACGGAT CCGAGACTAG
|
Protein sequence | MSIGVCYFPE HWPREQWETD VRQMADAGIE YVRMAEFSWR VLEPERGAFD FEWLDEIVEL IGEYGMQAVL CTPTAAPPRW LVEERPEIRQ RDRDGTVRDV GSRRHYCFNS AAYREETERV VRAMAERYAD DPRVVGWQTD NEYGCHGTTR CYCDDCADAF RDWVREEYET VDELNEAWGT TFWSQQYDDF EQVDLPRPTP AQDHPAMLLD FARFSSDSVV KYNRLQADLL REANDEWFVT HNFMNLFESV DTYDFDEDLD LISWDSYPTG HVQQAGGETT TDELRAGNPD LLSFNHDLYR SVLDRPFWVM EQQPGDVNWP PHSTQPAEGA MRLWAHHATA HGADAVVYFR WRRCLEGQEQ YHAGLRKQDG SADRGYDDAT RAAEELFDLD HVDAPVALLH DYENAWALGE QPHAPDFDYW QLLQSFYASL RAHGVQVDIV HPESDLESYD AVVAPTLHLA TESLADHLTA YVESGGELLL GPRTGVKDAH NKLRPDLQPG PLSELVGASV DQHESLPTQF EPTVAGTDGT NAEYAFRTWA EWLEADAAEP LLEYAGDDIE GGRTAAVRNA VGEGSVVYCG VWPETDLAND LVGSLLDRAG VRRMDVLPDG VRVARRDGHT WVLNFGSDPI AVTLEGDASW RLGGPEIGPF DLAIAETNAV DDLSVRIRD
|
| |