Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_2836 |
Symbol | |
ID | 8385144 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 2908051 |
End bp | 2910324 |
Gene Length | 2274 bp |
Protein Length | 757 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644973913 |
Product | glycoside hydrolase family 3 domain protein |
Protein accession | YP_003131730 |
Protein GI | 257053897 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.154925 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAACCC AAGACCCAGA CACGACGACT GATCGGATCG AATCACTGAT CGACCGGCTC ACGCTCGAAG AGAAAATAGA CTTCGTCCAC GGTGAGGACG ATCCGGACGA ACGGGCGACA GGGTTCCTCC CGGGCGTCGA GCGGCTCGAT ATCCCATCGC TCTCGATGGT CGACGGGCCG CTGGGCGTCC GACCGGGGAC GGCGACCGCG TTCCCGGCGT CGATCGCCCT GGCCGCCTCG TGGGACGTCG ATCTCGCCCG TGAACAGGGC GCGGCACTCG GTCGGGAGGT GCTGGGTGCC GACCAGGACG TTCTGCTCGC ACCGGGGTTC AACATCATCC GAGTGCCCCA GTGCGGTCGC AGCTTCGAGT ACTACAGCGA GGACCCGTAC CTGTCGAGTC GACTCGCCGT CGGGACCATC GACGGCGTCC AAAAGGACGC CGGCGCGATC GCCACCGCCA AGCACTTCGT CGCCAACAAC CAGGAGCAGG ACCGTCACGA GGTAAGCGCC GAAGTGAGCG AGCGCGCACT GCGAGAGATC TACCTGCCGG CCTTCGAGGC GGCGGTCACG GAGGGCGAGG TCGGCTCGGT GATGGCCGCC TACAACCGGA TCAACGGGAC ATACGCGACC GAACACGAGT GGCTGTTGAG CGACGTCCTC AAAGACGAGT GGGGCTTTTC GGGCTACGTC GTCAGCGACT GGTGGGCGAC GACCGATGGC GTGGCAGCCG CCAACGCCGG CCTCGATGTC GACATGCCGG GGATTCCGGT ACCGCAATGG CACGTCACGG AGAATCGAAT CCACGACGTG ATCGAGGGGC TCCCTGACGC CCTCCCGAAG CGATCGATTG CCAAACTCGT CTCGACGCCG TGGTTGCCGG AGAACGTGAA TCCGAACCTC TTCGATCGAA GTCCCTTCGA AGTGCAGCTG CGGGACGCCG TCGAACACGG ACAGGTGGCC GAGTCGACGC TAGACGAGAA GATCAGACGG GTCCTCGGAC AGATGAACCG TTTCGGGTTG TTCGACGATG AGCAACCCGA GGGGGCCGTC GACGCAGCCG AGCATCGCGA CCGGTCACGG CGCGTCGCCG AGCGCGGGGC AGTCGTCCTC CAGAACGACG ACGAGGTGCT CCCACTGTCG CCCGAGATCG ACTCGATCGC CGTGATCGGC CCGAACGCCG ACACGGCCAA GATCGGCGGC GGCGGGAGTT CCGCGGTCAC GCCGTCTTCG ACGGTCAGTC CACTGGCAGG GGTCCGTGAG CGCGTCGACG GCGACACCCG GGTTGCGTTC GCCCGTGGGA CCGAACGGAT CGAGGATCAT CACGACGCTT CGGAGTCGAT CGTCGATCTC TCGCTGTCGG CTTCGGAAAC GCCAGCGGTC GACACCGTGC TGGGCAACGA CACACCGGAA CGTGACGATG CCGTCATTGC TGCCCGGCAG GCGGATGTCG CCGTGGTGGT AGTCCAGGAC GACGCGACCG AGGGCGAGGA TCGATCGCTC TGGCTGCCGG GCGAGCAGGA TCGACTCGTC GCTGCCGTCG CCGACGCCGC CGACCGGACC GTCGTCGTCT GTAACACCGC CGGCCCGATC CGGATGCCCT GGGCCGAAGA TGTCGACGGG ATTGTCGAGA TGTGGTATCC CGGCCAGGAG GACGGACACG CCACGGCGGC GATTCTCTTC GGCGACAGCG ATCCCGGCGG CCGGTTGCCG GTCACCTTCG GTCGGCGACT CGACGACTAC CCGGCGGCGA CCGAGGAACG GTATCCGGGG GTCGGGCTAG AAGCCGAGTA TGACGAAGGC GTCTTCGTCG GCTATCGTCA CTTCGACGAC GAGGGGATCG AACCTCAGTT CGCGTTCGGG CACGGGCTGA GCTACACCGA CTTCACGTAT TCCGACGTGA CAGTCGAGGC TGACGGCGAA GAAGGCGCAA CCATCGAGGG CGGCGACGAA CCGGGCGTGA GTGTCGAGGT GACAGTCGAG AACGTCGGTG ACCGTCCGGG TCGGGATGTC GTACAGGTGT ATCTCGGCCC GGCCGAGGCC GCAGTCGAAC GACCGCCGAA AGCGCTTGCC GGCTTCGAAC CCATCACACT CGACGCGGGC GAGACGACAA CAGTCACGCT TTCCATCGAC GCGAGAGCGT TCGCGTACTA CGACGTCGAG GCGGGCGAGT GGGTCGCAAC TGAAGGGGAA TACACTGTTC TTGTCGGTCG CTCCGCCCAG GATATTGTCG ACGAAGAGAC AATAGCCATC GAGGAATCGA CGATCGTCGA GTAG
|
Protein sequence | MATQDPDTTT DRIESLIDRL TLEEKIDFVH GEDDPDERAT GFLPGVERLD IPSLSMVDGP LGVRPGTATA FPASIALAAS WDVDLAREQG AALGREVLGA DQDVLLAPGF NIIRVPQCGR SFEYYSEDPY LSSRLAVGTI DGVQKDAGAI ATAKHFVANN QEQDRHEVSA EVSERALREI YLPAFEAAVT EGEVGSVMAA YNRINGTYAT EHEWLLSDVL KDEWGFSGYV VSDWWATTDG VAAANAGLDV DMPGIPVPQW HVTENRIHDV IEGLPDALPK RSIAKLVSTP WLPENVNPNL FDRSPFEVQL RDAVEHGQVA ESTLDEKIRR VLGQMNRFGL FDDEQPEGAV DAAEHRDRSR RVAERGAVVL QNDDEVLPLS PEIDSIAVIG PNADTAKIGG GGSSAVTPSS TVSPLAGVRE RVDGDTRVAF ARGTERIEDH HDASESIVDL SLSASETPAV DTVLGNDTPE RDDAVIAARQ ADVAVVVVQD DATEGEDRSL WLPGEQDRLV AAVADAADRT VVVCNTAGPI RMPWAEDVDG IVEMWYPGQE DGHATAAILF GDSDPGGRLP VTFGRRLDDY PAATEERYPG VGLEAEYDEG VFVGYRHFDD EGIEPQFAFG HGLSYTDFTY SDVTVEADGE EGATIEGGDE PGVSVEVTVE NVGDRPGRDV VQVYLGPAEA AVERPPKALA GFEPITLDAG ETTTVTLSID ARAFAYYDVE AGEWVATEGE YTVLVGRSAQ DIVDEETIAI EESTIVE
|
| |