Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_3896 |
Symbol | |
ID | 8744524 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | - |
Start bp | 134317 |
End bp | 137172 |
Gene Length | 2856 bp |
Protein Length | 951 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 646514480 |
Product | glycoside hydrolase family 2 sugar binding protein |
Protein accession | YP_003405427 |
Protein GI | 284167149 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0623338 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGTCCTC CGGAGAAAGG GGTGCTCATG CAATCGATCT CGTTATCGGG ACCGTGGGCT CTGCGGCTCA ATCCCGACCG CGACCGACAC GTCGTTGATC CGCCTGACGA CGTTATCGAC GGGACGATGT ACCTTCCCGG ATCCACCGAC GAATACGGCT ACGGGACAAC CGTCGACGAT CGACCGCGGG ACCACCTCCA ACGGACGCAT CGATACGAGG GGTCGGCGTG GTACCAGCGC ACCGTCTCCG TCCCCGATTC CTGGGCGGGC AAACGCGTGA CACTAACGCT CGAGCGAACC AGACCGACCG AGGTCTGGGT CGACGGGGAG CGTATCGGCT CGCGAGTGTG CCTGAGCACC CCGCACGTAT ACGATCTCAC CGAGGCGATC GGCCCCGGCG AGCACGAAAT CGCCGTTCGA GTGGACAACG CCGACGACTC GATGGATCGT CCGGGAGTCA GGCGATCGCA CGCGAGCACT GAACACACGC AGACCAACTG GAACGGCATC ATCGGGGACC TCCGGCTGGA GGCGACACCG AACGTCTGGA TCGAGAGCGT CGACCCGTTC CCGGAACCGA GCGAGAACGC CGTCGACCTC GAGATAACCC TCGGGAACGC CACGGAGACC GATTTTCAGG GAACCGTCTC TGCCGCTGCC CGGAGTACGA CCACCGATGA GATCCACGTC GCGGACACCG TCGATCGCCC CGTCTCGGTC AGTGCCGGCG ATGGCTCGGC GCCGGGTCGA ACGACTCTCG AGTTCACATA CGATCTCGGA TCCGACGCGC TCACCTGGGA CGAGTTCTCG CCGGCGGTCT ACGAGTTGAC CGTCTCGCTC GAGACGGCAG GGGACGAGAA CGCCGCCGGC CACGAGTTCG AGACCGCGTT CGGCCTCCGC GAGTTCGAAG CCGACGGCAC GCAGTTCTCG ATCAACGGGA CGACGATCTT CCTGCGCGGC CGGACGGACT GCTGTGTCTT CCCCGACACC GCGTACCCGC CGACGACGGT CGCCGAATGG GTCGATCACA TGGAGACCGC CAGAGCGTAC GGCATCAACC ACTATCGGTT CCACAGCTGG TGCCCGCCGG AGGCGGCCTT CGAGGCGGCC GACCGCGTCG GGATGTACCT GCAACCGGAG TGTTCCCAGT GGAACAACGG CACCTCGCTC GCCGACGCCG ACGACTACGA GTACTACGAG CGGGAGGCCG AGCGCATTCT CGACGCCTAT GGACACCACC CCTCGTTCGT CTGCTTCACC CTCGGCAACG AAAATCGGGG CGACGAGGAG CGAATGACCG AACTCGTTCG CCACTGTCGA GCGCTCGACG ACCGACGGCT GTACGCCTAC GGGGCGAACA ATTTTCTTAC CTCACCGCAC CCGGGAGACG CCGACGACTT CTTTGTTACC GCCAACGTCC CCGAAGACGC CGAGGCCAGT CACTGGGACG TCGATCGAAC GCCCATCCGC GGGACCGGAC ACGTCAACGA TTCTCCGCCG TCGACGGCCG TCGACTACGA GGACGAACTC GAGCCCTACG ATCTCCCCGT CGTCGGTCAC GAGATCGGCC AGTACCAGGT CCATCCCGAC TACGACGAGA CGCGCAAGTA TCGCGGCGTC CTCCGGGCTC GCAACCTCGA ACGGTTCGAG CGCTCGCTCG CCGATCGCTA TCTCGAAGGC TCGGACGCGG ATTTTCGGGC CGCGTCCGGC GCGCTCGCAG TCACCTGTTA CCGCGAAGAG ATCGAGGCCG CGTTCCGAAC GGCGGGCTTC GGCGGCTTTC AATTGCTCGG CCTCGACGAC TTTCCGGGCC AGGGGACCGC GATGGTCGGG ATCCTCGACT CCTTCATGGA GTCGAAGGGA CTGATCGAGC CCCACGAGTG GCGTCGCTTC TGTAGCGCAC GCGTCCCGCT CCTCTCGTTC GACCGGTACA CCTTCACGAC CGACGACGCG TTCGAGGCCG AGTCGCTGCT CGCCAATTAC GGCCCCGGCC CGATCGCGGA CGCGACCGCG ACCTGGTCGA TCGCCGAACT CGACGGCACG GAAACCGCTT CCGGAGCCCT CGAGCGAGAC GTCCTCGAGC AGGGGGAGCT GACGTCCCTC GGGACGATCG ACGCCCCGCT CGAGGACGTC GACGCGCCTG CCCGGTTCGA GGTAACGCTC GCGGTCGAGG GAGCGGACCT CGAGGACGGC TCCGCCGTCG ATCTCCGGAC GACCTACCCG ATCTGGGTCT ATCCCGAGAC ACCCGAAATA GCCGCCAACA CAGACGATAT CGACGTTTCC TGGCGCTTCG ACGAGCGAAC TCGCACGCGA CTGGAGGACG GCGAGACGGT CCTCTTACTG CCGGAACCGA GCGCGCTCCG GTACGGTCTC GAGGGCTCGT TCCAGCCGGA CTTCTGGAAC TACGAGATCT TCAAGCAAAA CGGAAAGCCG GGGACCCTCG GAATGACGAC GGACCCCGAT CATCCGCTGT TCGACGCCTT CCCGACCGAG GACTACGCCG ACTGGCAGTG GTGGCCCCTC CTGCGTCACT CGCGGCCGAT CGTTCTCGAC GACGCGCCCG CCGACTTCGA ACCCACTGTA CAGATAATCG ATACGATCTA CCGGAACCAC AAACTCGGCG TCTACTTCGA AACCGCCGTC GGCGACGGAG CGCTCGCCGT CTGCACGCTG GATCTCTCCG GGGACGCTCC CGCCGTCCGG CAGTTTCGAC ACAGCCTCGA GTCCTACCTC GCGTCCGAGG CGTTCGATCC GGATTCCGCC CTCTCTACCG GCGTCCTCGA CAGCCTCCTC AACGCCGGAC GGGACGACGA ACGCGACTAC GGGGACGACG CCGGCGCGTG GGTCGAACGC AACTGA
|
Protein sequence | MCPPEKGVLM QSISLSGPWA LRLNPDRDRH VVDPPDDVID GTMYLPGSTD EYGYGTTVDD RPRDHLQRTH RYEGSAWYQR TVSVPDSWAG KRVTLTLERT RPTEVWVDGE RIGSRVCLST PHVYDLTEAI GPGEHEIAVR VDNADDSMDR PGVRRSHAST EHTQTNWNGI IGDLRLEATP NVWIESVDPF PEPSENAVDL EITLGNATET DFQGTVSAAA RSTTTDEIHV ADTVDRPVSV SAGDGSAPGR TTLEFTYDLG SDALTWDEFS PAVYELTVSL ETAGDENAAG HEFETAFGLR EFEADGTQFS INGTTIFLRG RTDCCVFPDT AYPPTTVAEW VDHMETARAY GINHYRFHSW CPPEAAFEAA DRVGMYLQPE CSQWNNGTSL ADADDYEYYE REAERILDAY GHHPSFVCFT LGNENRGDEE RMTELVRHCR ALDDRRLYAY GANNFLTSPH PGDADDFFVT ANVPEDAEAS HWDVDRTPIR GTGHVNDSPP STAVDYEDEL EPYDLPVVGH EIGQYQVHPD YDETRKYRGV LRARNLERFE RSLADRYLEG SDADFRAASG ALAVTCYREE IEAAFRTAGF GGFQLLGLDD FPGQGTAMVG ILDSFMESKG LIEPHEWRRF CSARVPLLSF DRYTFTTDDA FEAESLLANY GPGPIADATA TWSIAELDGT ETASGALERD VLEQGELTSL GTIDAPLEDV DAPARFEVTL AVEGADLEDG SAVDLRTTYP IWVYPETPEI AANTDDIDVS WRFDERTRTR LEDGETVLLL PEPSALRYGL EGSFQPDFWN YEIFKQNGKP GTLGMTTDPD HPLFDAFPTE DYADWQWWPL LRHSRPIVLD DAPADFEPTV QIIDTIYRNH KLGVYFETAV GDGALAVCTL DLSGDAPAVR QFRHSLESYL ASEAFDPDSA LSTGVLDSLL NAGRDDERDY GDDAGAWVER N
|
| |