Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_2399 |
Symbol | |
ID | 8743006 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | + |
Start bp | 2463927 |
End bp | 2466509 |
Gene Length | 2583 bp |
Protein Length | 860 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 646512983 |
Product | glycoside hydrolase family 3 domain protein |
Protein accession | YP_003403950 |
Protein GI | 284165671 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGGGA TGTCAACTGA TAACAATCAC GATGGTGTCG ACGAATCGCG TCGAACGTTT ATGAAGGCGA CCGGTGCGGC GACGGCCGCG GCCGGGCTCG GAGCGACCAG TACGGCGGCC GGCGACGAGA AGCGATCGAA GCGACTCGAG GACCTGATCG ACGGGATGAC CATAGAGCAG AAGGTCGGCC AGATGGCCCA GGTGGCGATC GACAACCTCG GCGAGGGGTT CGGTCCCGAC ACCGCGTTCA ACGACCACGA CGACGCGGGC ACGCTCGGTA AACTGTTCTC CGAGCTCCAC GTCGGATCGA TCCTCAACGG CGGCGCCACG GGGCCGACGT TCGACGGCGA GGAGTTCGTC GAGGGGCTCA ACGGCCTCCA GGAATACAAT CTCGAGGTCA ACGAGCCGGC GATCCCGTTC GTCTGGGGCT GTGACGCGCT CCACGGAAAC TGCCTGCTCG ACGGCTGTAC GAGCTTCCCG CAGCGGCTCA ACATGGGTGC GACGCGCGAC GTCGACCTCG TCGAGGCGGC GGCCACCCAC ACCGGCGACT CCGTCGCGGC GATCGGCGGC CACTGGAACT TCGGGCCGAC GCTGGACGTC CTCCGAGACA TGCGCTGGGG TCGCTACTTC GAGGGCCACA GCGAGGACGC GATGCTGCTC GGCGAGATGG GGAGGGCCCG GGCCCGGGGC TTCCAGCGAA ACGAGCGGGT CGCGGCGACG GTCAAACACT TCGCCGGCTA CGGCACTCCG AACACCGGTT CGGACCGGAC CCACGCTCGG ACCTCGATGC GGGACCTGCG GACCCGCCAG TTCGAGCCCT ACCGGCGCGG GCTCGAGGAG GCGAAGACGG TGATGGTCAA CAGCGGCGCG GTCAACGGGA AGCCGGCCCA CGCCTCCTCG TGGCTGCTGA CGACGGTCCT GCGCGACCGG TTCGGCTTCG ACGGCGTGGT GCTGACCGAC TGGGACGACT TCGAGCGGAT GCTCTCGAAC CACGAGTACC TCCCCGACAC CGACGACGGC TGGCGAGAGG CGGTCCGACA GGGCATCGAA GCCGGCGTCG ACATGCACAT GTGCGGCGGC GAGACGGCGC CGACCGAGTT CATCGACACC GTGATCGATC TCGTCGAGAG CGGGGATCTC TCCGAGGCGC GCATCGACGA GTCGGTCCGC CGAATCCTCG AGCTCAAGGC CGATCTCGGG CTGTTCGCGG ACCCGCTCGC GCCCGAAGAC GAGATCGGCG ACATCGTCGG CGGCGCCGCC GATATCTCCG AACAGCTCGC CAAGGAGTCG CTGGTCCTGT TGCAAAACGA GGACGACGCA CTCCCGCTCG CCCTCGAGGA CGTCGACGAC CTGCTGCTGA CCGGGCCCGG CGTCCACGAG GGGACGCCGA ACCGGTTCCT GATGCAACAC GGCGGCTGGA CGCTCGGCTG GCAGGGCATC GAGGACGGGA ACCTGACCGA GGACGGCCCG CGGCCGCGCC AGAACACGAT CGAGGGCGAA CTGACGGCCC GGCTCGGCGA CGGGCTCACG CACGTACCGA CGGAGTACGA GCCCGCGGTC TACGAGAGCC TCTACGAGAA CTTCGACAAC GGCTTCTTCG ACGTCACCGA CGAGCAGGCG GCGGCGATCA GCGAGGCCGC ACCGGGCTCC GACGCCGTCG TCGTCGTCCT CGGCGAGGGG ACGCACAACG AGGGCTTCGG CGACCGGGAC AAGATGCGGT TCCTCGAGGC CCAGCGGGAG CTCGTCGAAC TCGTCGATAG CGAGACGGGC GATGACGTTC CGATCATCGG CGTCATCCTC GCCGGGAGCC CGCGCGGCAC GGCGGAGACG TTCCAGCACT TAGATGCCGT ACTCTTCGCC GGCCAGCCGG GCAGCGATAC CGGCGTCGCG GTCGTCGATA CGCTCTTCGG CGACTACAAC CCCTCGGGGA AGCTCCCGTT CACCTGGGAG TCCCATGTCG GACACGTTCC CCAGATCTAC GACGAGTACC CGCCGCGTCA CCCGGACGGG GCGGGCGATC AGATGGTCCA GTTCGAGTTC GGCCACGGCC TCTCCTACAC CGACTGGGAG TACGTCGACC TGTCGCTGGC GACCGATTCG GTTAAAAACC CCGCCTCGAG GCCGACCGTG ACGGCCCACG TCACCGTCGA GAACGCCGGC GAAACGGCCG GCGAGCACGT CGTCGAGATC TACAACACCG AGTCCTACGG CTCCGTGCTC CAGCCTCATC GCCGCCTGAT GGGATTCGAA CGGATCGCCC TCGAGCCGGG CGAGCGCGAG ACCGTCGCCG TCGACCTCGA CCTCTCGACG CTCGAGGTCG TCCCCGGCGA CGTCCCCGGC TGGGGGCCGC GGGTCGTCGA GGCCGGCGAG TACGAACTCG CGGTCGGCGC CGACTGGGGC GTGAACGCGA GCGACGACGC GGACGACGGT GCGACGGCGA CGCTGACCGT CGGGAAGACG GCCTCCATCA CCGATCCCGA GCCGACGCCC GGCCGCTACG ACATCGACGG CGACGGGGAC GAGGACTTCG AGGATGTGAT GGCGCTCCAC CGGCGGCTCA AACATCGAAA ATGGCGACGG TGA
|
Protein sequence | MDGMSTDNNH DGVDESRRTF MKATGAATAA AGLGATSTAA GDEKRSKRLE DLIDGMTIEQ KVGQMAQVAI DNLGEGFGPD TAFNDHDDAG TLGKLFSELH VGSILNGGAT GPTFDGEEFV EGLNGLQEYN LEVNEPAIPF VWGCDALHGN CLLDGCTSFP QRLNMGATRD VDLVEAAATH TGDSVAAIGG HWNFGPTLDV LRDMRWGRYF EGHSEDAMLL GEMGRARARG FQRNERVAAT VKHFAGYGTP NTGSDRTHAR TSMRDLRTRQ FEPYRRGLEE AKTVMVNSGA VNGKPAHASS WLLTTVLRDR FGFDGVVLTD WDDFERMLSN HEYLPDTDDG WREAVRQGIE AGVDMHMCGG ETAPTEFIDT VIDLVESGDL SEARIDESVR RILELKADLG LFADPLAPED EIGDIVGGAA DISEQLAKES LVLLQNEDDA LPLALEDVDD LLLTGPGVHE GTPNRFLMQH GGWTLGWQGI EDGNLTEDGP RPRQNTIEGE LTARLGDGLT HVPTEYEPAV YESLYENFDN GFFDVTDEQA AAISEAAPGS DAVVVVLGEG THNEGFGDRD KMRFLEAQRE LVELVDSETG DDVPIIGVIL AGSPRGTAET FQHLDAVLFA GQPGSDTGVA VVDTLFGDYN PSGKLPFTWE SHVGHVPQIY DEYPPRHPDG AGDQMVQFEF GHGLSYTDWE YVDLSLATDS VKNPASRPTV TAHVTVENAG ETAGEHVVEI YNTESYGSVL QPHRRLMGFE RIALEPGERE TVAVDLDLST LEVVPGDVPG WGPRVVEAGE YELAVGADWG VNASDDADDG ATATLTVGKT ASITDPEPTP GRYDIDGDGD EDFEDVMALH RRLKHRKWRR
|
| |