Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_0988 |
Symbol | |
ID | 8383261 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 950528 |
End bp | 953455 |
Gene Length | 2928 bp |
Protein Length | 975 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644972052 |
Product | glycoside hydrolase family 10 |
Protein accession | YP_003129904 |
Protein GI | 257052071 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3693] Beta-1,4-xylanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.661558 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGACG ATAGATTACA CGTTCACGGC GACAGGCGGA CGTTCCTGAA GTCCGTCGGG GCACTGGGGG CGGCAACTGC AGTCGGGTCG GGGACAATTG GGTCAGTGGC AGCCGATGGC CACCTCGACG AGTATCACCA GACGCTGCAG AGTGAACTCC AGGAGCAGTA CGACTTGCCG GCTGGGTCGT TCCTGTTCGG GGCGACCGAG CAGGCGACGA TCGACAGCTT CGAATTCCAG TCGGGGGATA GCGGGAATCT CTCGGAGATC TCGATCGACA ACGACAGCGT CCCGATCACA CAGGGCGTCC AAATCGAGGT AAACGAGGAA GGCGCAGACT CCTGGTCGTA CTCCTACCAG CGGTTCCTCA CGGAGCAGGA CTTCGCGCAA GGTGACGTCC TCCTCGGTGT CGCGTACCTC CGGAGCGAGT CGGACAACGC GGAGACGGAA GGGTTCTTCA AATATCGGTA CAAGGACGCC GAAGGCGACG ACTGGAGCTA TCAGAACGCG AACTACATCA CGGACAACTC GGCCGTCCAG CCCGGCTCGG AGTGGACGCG CTATTACTTC CCGATCGAAG TCGGGTCCCG CCCGGGTTCA ATCCGGGACG CATACGTCGA ATTCTGGCTC GGGATGGCCC AGCAGACCGT CGAGTTCGGC GGGATGGCGC TGATCGACTA CAGTGACGCC GACGTCGGGA TCGGCAACCT CCCGAGCGGG GAGGCGGTGC CGCCCGAGGA GAGCAGTGGC TATCAGATCT GGACGGACAC TGACGACCCG TACTACTCGG ATCTCGTCAG CGACCTCAAG GGGTACAACC TCGGTGGGGC GGGTAAGTTC GCCTACGGGA CGACCGAAGC CGCGACCTTC GACGCCTACG AGGTCGCCGG CGGCAGTTCC GACCTCGCCA ACCAGGAGTC CATCGATGTC GGCGACGACG TCCCGTTCTC GGAGGCGACC CGGATCGAAG TGACCGAGCA AGCCGACGAC GACTGGCTCG TAAACCTCAA GGCGTACGGC GATCGGGCAC TCGAGAGTGG TGACGCGTTG CTCGGCGTCG CGTACATGCG CGCCCCCGAG GGAGACACAT CGATCACCTA CAAGATGACC TCCTCGGGTG ACGAGTCGGC CAACTACGTC ACCAAGCCGC GCCCGCCGCT CACCGGCGAG TGGAAGCGGT TCTACTTCCC GATCGAGGCC GGAAGCGCCG CCGCATCGGG CGAGTGGTGG ACCGAGATCT GGCTCGGCGC ACAGGCCCAG ACCGTCGACA TCGGCGGCCT CGCCGTGGTC GACTTCGCCA AGGGTGTCTC GGTCGGTGAC CTCCCTGCCT GGGAGCAAGA GATCAACGAG GAATGGGAAG ACGAAGCCGA TGCTCGGATC GAGGAACACC GCAAGACCGA CGTCGCGGTC GAAGTCGTCG ACGGCGACGG CAGCGCCGTC GAGGGGGCCG ACGTCGAGGT CGCGATGCAG GAACACGACT TCAGCTTCGG CACCGAGGTC ACGGCCGACC ACCTGATCCA GAACACCGAA CCGGGTGATC AATATCGACA GGTCATCACG GAGAATTTCA ACACCGCCGT CCTGGGCAAC CATCACAAGT GGCGCTTCTT CGAGGAGGCA CAGGACATCG CCGACTCGGC GACCGAATGG CTCGTCGAGC AGAACATGCG GATCCGCGGG CACGTCTGTC TGTGGGCAGC CGTCGACTCC TACGCCGTGC CGGAAGACGT CGTCGCGGCG ATGGGGCGCG AGTGGTCGGA AGTCGAGAAT CCCGAGCTCG ATCCGGAGTA CGTCCGCGAT CGGACGATGT CTCACATCGA GGAGATCATC AACCACTACG CGGACTTCCA GGACTACGGC AGCGTCATCG ACGAGTGGGA GGTTCACAAC GAGACCACCC ACGTACCCGG ATTCATCAAG GCGGTCCGCG GTGTCGGTCC CGACGAGGAA CTCGACATCA ACGCCGTCGA AGCACCGGTC CTCGCCGAGT GGCACAACCA CGCCGAGGAT GTCGCCCCCG ACGACGTCGG GATCGCGATC AACGACTACA ACACCATCGA GGGGCCGTAT CAGTCGACGC GTGACAACCA CAAGCGGATG GCCGAGTTCC TGATCGAGAA CGACGTCGAT CTCGACGGGA TCGGCCTCCA GAGCCACTTC AGCCAATCGT CGGCACTCAC GCCATCCGAG ATCTGGGAGG CCCTGGAGTT CTACAGCGGC CTCGGTGCCG GCATCCGGAT CACCGAGTTC GACATGGCCG ACGACACCTG GATGGAAGCC GACAAGGCCA CCTTCTTCAA GCAGTTCCTG AAGATAACGT TCAGCCATCC GAACGCGGAG ACCTTCATGG TGTGGGGCTT CCAGGACTCC CTCCACTGGC GGGACGACGC TCCGTTCTTC GACTCCCAGT GGAACCCCAA GCCGGCCCTG GACGTCTGGC AGAACCTCAT CTTCGACGAG TGGTGGACCG AGGAATCCGG CAGCACGGAC GCCGACGGAA TGTTCGCGAC GGACGCCTTC AAGGGCACGT ACCACATCAC CGCGACCCAC GGCGAGGAGA CCGTCGAGCG CGAGGTCGAG ATCAGCGACG ACACCGACAC CCTGACGATG ACGGTCGGCG AAGGCGATGG CGAGCAAGAC GACGGCGAAG CAGACGACGG CGAAACAGAC GACGGAGAGG ACGATGGCGA AGAAACGCCG CCCGGTGCAC TGCCCGGCGG CAAGGGAGCC CCCCAGGACC TCGACGGTGA CGGCCTCCAC GAGGACGTCA ACGGCGACGG CAACGCTAAC ATCGCCGACG TCCGGTCGCT GCTGAACAAT CGTAACAACG AGATGGTCCA GTCGAACGCC GACGCCTACG ACTTCACTGG AGACGGCAAA GTCGGCGTCA GCGACGTCCT GGAGCTGTTC CGGAAGCTCT ACCGGTAA
|
Protein sequence | MTDDRLHVHG DRRTFLKSVG ALGAATAVGS GTIGSVAADG HLDEYHQTLQ SELQEQYDLP AGSFLFGATE QATIDSFEFQ SGDSGNLSEI SIDNDSVPIT QGVQIEVNEE GADSWSYSYQ RFLTEQDFAQ GDVLLGVAYL RSESDNAETE GFFKYRYKDA EGDDWSYQNA NYITDNSAVQ PGSEWTRYYF PIEVGSRPGS IRDAYVEFWL GMAQQTVEFG GMALIDYSDA DVGIGNLPSG EAVPPEESSG YQIWTDTDDP YYSDLVSDLK GYNLGGAGKF AYGTTEAATF DAYEVAGGSS DLANQESIDV GDDVPFSEAT RIEVTEQADD DWLVNLKAYG DRALESGDAL LGVAYMRAPE GDTSITYKMT SSGDESANYV TKPRPPLTGE WKRFYFPIEA GSAAASGEWW TEIWLGAQAQ TVDIGGLAVV DFAKGVSVGD LPAWEQEINE EWEDEADARI EEHRKTDVAV EVVDGDGSAV EGADVEVAMQ EHDFSFGTEV TADHLIQNTE PGDQYRQVIT ENFNTAVLGN HHKWRFFEEA QDIADSATEW LVEQNMRIRG HVCLWAAVDS YAVPEDVVAA MGREWSEVEN PELDPEYVRD RTMSHIEEII NHYADFQDYG SVIDEWEVHN ETTHVPGFIK AVRGVGPDEE LDINAVEAPV LAEWHNHAED VAPDDVGIAI NDYNTIEGPY QSTRDNHKRM AEFLIENDVD LDGIGLQSHF SQSSALTPSE IWEALEFYSG LGAGIRITEF DMADDTWMEA DKATFFKQFL KITFSHPNAE TFMVWGFQDS LHWRDDAPFF DSQWNPKPAL DVWQNLIFDE WWTEESGSTD ADGMFATDAF KGTYHITATH GEETVEREVE ISDDTDTLTM TVGEGDGEQD DGEADDGETD DGEDDGEETP PGALPGGKGA PQDLDGDGLH EDVNGDGNAN IADVRSLLNN RNNEMVQSNA DAYDFTGDGK VGVSDVLELF RKLYR
|
| |