Gene Huta_0988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_0988 
Symbol 
ID8383261 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp950528 
End bp953455 
Gene Length2928 bp 
Protein Length975 aa 
Translation table11 
GC content64% 
IMG OID644972052 
Productglycoside hydrolase family 10 
Protein accessionYP_003129904 
Protein GI257052071 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3693] Beta-1,4-xylanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.661558 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGACG ATAGATTACA CGTTCACGGC GACAGGCGGA CGTTCCTGAA GTCCGTCGGG 
GCACTGGGGG CGGCAACTGC AGTCGGGTCG GGGACAATTG GGTCAGTGGC AGCCGATGGC
CACCTCGACG AGTATCACCA GACGCTGCAG AGTGAACTCC AGGAGCAGTA CGACTTGCCG
GCTGGGTCGT TCCTGTTCGG GGCGACCGAG CAGGCGACGA TCGACAGCTT CGAATTCCAG
TCGGGGGATA GCGGGAATCT CTCGGAGATC TCGATCGACA ACGACAGCGT CCCGATCACA
CAGGGCGTCC AAATCGAGGT AAACGAGGAA GGCGCAGACT CCTGGTCGTA CTCCTACCAG
CGGTTCCTCA CGGAGCAGGA CTTCGCGCAA GGTGACGTCC TCCTCGGTGT CGCGTACCTC
CGGAGCGAGT CGGACAACGC GGAGACGGAA GGGTTCTTCA AATATCGGTA CAAGGACGCC
GAAGGCGACG ACTGGAGCTA TCAGAACGCG AACTACATCA CGGACAACTC GGCCGTCCAG
CCCGGCTCGG AGTGGACGCG CTATTACTTC CCGATCGAAG TCGGGTCCCG CCCGGGTTCA
ATCCGGGACG CATACGTCGA ATTCTGGCTC GGGATGGCCC AGCAGACCGT CGAGTTCGGC
GGGATGGCGC TGATCGACTA CAGTGACGCC GACGTCGGGA TCGGCAACCT CCCGAGCGGG
GAGGCGGTGC CGCCCGAGGA GAGCAGTGGC TATCAGATCT GGACGGACAC TGACGACCCG
TACTACTCGG ATCTCGTCAG CGACCTCAAG GGGTACAACC TCGGTGGGGC GGGTAAGTTC
GCCTACGGGA CGACCGAAGC CGCGACCTTC GACGCCTACG AGGTCGCCGG CGGCAGTTCC
GACCTCGCCA ACCAGGAGTC CATCGATGTC GGCGACGACG TCCCGTTCTC GGAGGCGACC
CGGATCGAAG TGACCGAGCA AGCCGACGAC GACTGGCTCG TAAACCTCAA GGCGTACGGC
GATCGGGCAC TCGAGAGTGG TGACGCGTTG CTCGGCGTCG CGTACATGCG CGCCCCCGAG
GGAGACACAT CGATCACCTA CAAGATGACC TCCTCGGGTG ACGAGTCGGC CAACTACGTC
ACCAAGCCGC GCCCGCCGCT CACCGGCGAG TGGAAGCGGT TCTACTTCCC GATCGAGGCC
GGAAGCGCCG CCGCATCGGG CGAGTGGTGG ACCGAGATCT GGCTCGGCGC ACAGGCCCAG
ACCGTCGACA TCGGCGGCCT CGCCGTGGTC GACTTCGCCA AGGGTGTCTC GGTCGGTGAC
CTCCCTGCCT GGGAGCAAGA GATCAACGAG GAATGGGAAG ACGAAGCCGA TGCTCGGATC
GAGGAACACC GCAAGACCGA CGTCGCGGTC GAAGTCGTCG ACGGCGACGG CAGCGCCGTC
GAGGGGGCCG ACGTCGAGGT CGCGATGCAG GAACACGACT TCAGCTTCGG CACCGAGGTC
ACGGCCGACC ACCTGATCCA GAACACCGAA CCGGGTGATC AATATCGACA GGTCATCACG
GAGAATTTCA ACACCGCCGT CCTGGGCAAC CATCACAAGT GGCGCTTCTT CGAGGAGGCA
CAGGACATCG CCGACTCGGC GACCGAATGG CTCGTCGAGC AGAACATGCG GATCCGCGGG
CACGTCTGTC TGTGGGCAGC CGTCGACTCC TACGCCGTGC CGGAAGACGT CGTCGCGGCG
ATGGGGCGCG AGTGGTCGGA AGTCGAGAAT CCCGAGCTCG ATCCGGAGTA CGTCCGCGAT
CGGACGATGT CTCACATCGA GGAGATCATC AACCACTACG CGGACTTCCA GGACTACGGC
AGCGTCATCG ACGAGTGGGA GGTTCACAAC GAGACCACCC ACGTACCCGG ATTCATCAAG
GCGGTCCGCG GTGTCGGTCC CGACGAGGAA CTCGACATCA ACGCCGTCGA AGCACCGGTC
CTCGCCGAGT GGCACAACCA CGCCGAGGAT GTCGCCCCCG ACGACGTCGG GATCGCGATC
AACGACTACA ACACCATCGA GGGGCCGTAT CAGTCGACGC GTGACAACCA CAAGCGGATG
GCCGAGTTCC TGATCGAGAA CGACGTCGAT CTCGACGGGA TCGGCCTCCA GAGCCACTTC
AGCCAATCGT CGGCACTCAC GCCATCCGAG ATCTGGGAGG CCCTGGAGTT CTACAGCGGC
CTCGGTGCCG GCATCCGGAT CACCGAGTTC GACATGGCCG ACGACACCTG GATGGAAGCC
GACAAGGCCA CCTTCTTCAA GCAGTTCCTG AAGATAACGT TCAGCCATCC GAACGCGGAG
ACCTTCATGG TGTGGGGCTT CCAGGACTCC CTCCACTGGC GGGACGACGC TCCGTTCTTC
GACTCCCAGT GGAACCCCAA GCCGGCCCTG GACGTCTGGC AGAACCTCAT CTTCGACGAG
TGGTGGACCG AGGAATCCGG CAGCACGGAC GCCGACGGAA TGTTCGCGAC GGACGCCTTC
AAGGGCACGT ACCACATCAC CGCGACCCAC GGCGAGGAGA CCGTCGAGCG CGAGGTCGAG
ATCAGCGACG ACACCGACAC CCTGACGATG ACGGTCGGCG AAGGCGATGG CGAGCAAGAC
GACGGCGAAG CAGACGACGG CGAAACAGAC GACGGAGAGG ACGATGGCGA AGAAACGCCG
CCCGGTGCAC TGCCCGGCGG CAAGGGAGCC CCCCAGGACC TCGACGGTGA CGGCCTCCAC
GAGGACGTCA ACGGCGACGG CAACGCTAAC ATCGCCGACG TCCGGTCGCT GCTGAACAAT
CGTAACAACG AGATGGTCCA GTCGAACGCC GACGCCTACG ACTTCACTGG AGACGGCAAA
GTCGGCGTCA GCGACGTCCT GGAGCTGTTC CGGAAGCTCT ACCGGTAA
 
Protein sequence
MTDDRLHVHG DRRTFLKSVG ALGAATAVGS GTIGSVAADG HLDEYHQTLQ SELQEQYDLP 
AGSFLFGATE QATIDSFEFQ SGDSGNLSEI SIDNDSVPIT QGVQIEVNEE GADSWSYSYQ
RFLTEQDFAQ GDVLLGVAYL RSESDNAETE GFFKYRYKDA EGDDWSYQNA NYITDNSAVQ
PGSEWTRYYF PIEVGSRPGS IRDAYVEFWL GMAQQTVEFG GMALIDYSDA DVGIGNLPSG
EAVPPEESSG YQIWTDTDDP YYSDLVSDLK GYNLGGAGKF AYGTTEAATF DAYEVAGGSS
DLANQESIDV GDDVPFSEAT RIEVTEQADD DWLVNLKAYG DRALESGDAL LGVAYMRAPE
GDTSITYKMT SSGDESANYV TKPRPPLTGE WKRFYFPIEA GSAAASGEWW TEIWLGAQAQ
TVDIGGLAVV DFAKGVSVGD LPAWEQEINE EWEDEADARI EEHRKTDVAV EVVDGDGSAV
EGADVEVAMQ EHDFSFGTEV TADHLIQNTE PGDQYRQVIT ENFNTAVLGN HHKWRFFEEA
QDIADSATEW LVEQNMRIRG HVCLWAAVDS YAVPEDVVAA MGREWSEVEN PELDPEYVRD
RTMSHIEEII NHYADFQDYG SVIDEWEVHN ETTHVPGFIK AVRGVGPDEE LDINAVEAPV
LAEWHNHAED VAPDDVGIAI NDYNTIEGPY QSTRDNHKRM AEFLIENDVD LDGIGLQSHF
SQSSALTPSE IWEALEFYSG LGAGIRITEF DMADDTWMEA DKATFFKQFL KITFSHPNAE
TFMVWGFQDS LHWRDDAPFF DSQWNPKPAL DVWQNLIFDE WWTEESGSTD ADGMFATDAF
KGTYHITATH GEETVEREVE ISDDTDTLTM TVGEGDGEQD DGEADDGETD DGEDDGEETP
PGALPGGKGA PQDLDGDGLH EDVNGDGNAN IADVRSLLNN RNNEMVQSNA DAYDFTGDGK
VGVSDVLELF RKLYR