Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_1163 |
Symbol | |
ID | 8383438 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 1138882 |
End bp | 1141170 |
Gene Length | 2289 bp |
Protein Length | 762 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644972222 |
Product | glycoside hydrolase family 3 domain protein |
Protein accession | YP_003130072 |
Protein GI | 257052239 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGAAG ACCGGCCAAC AGACGAGGAT CTGTCGATGA ACGACGATCC ACCGTACATG GATTCGAACC GGTCCACCGG GGATCGGATC GAAGACCTGC TCTCGCGCAT GACGCCCGCG GAGAAAGTCG GCCAGCTCGT CGGCACGGCA CCGACGCTCC GACCAGGTCG TGAAACTGTC TCGGGCATCG CCGAGGCTGT CACCGAGCAC CACCTCGGCG CGGTCTCGCC GTTCGGTCAC GGCGGCTCCC CCTGGGAGAC CCCAGCGGAG TGCGTCGAGG TCGCGAATGC GCTCCAGCGC GAGGCCATCC AGAACACCAG ACTCGGTATC CCGGTACTGT TCTACGTCGA CGCCGACCAC GGCCACGGGT TCGTGAAGGG TACCACCGTC TTCCCACATA ACCTCGGCAT GGCCGCAACC CGCGATCCCG CTCTCGTCGA ACGGGCGGCG AGCGTCACCG CGACTGAAGT CGCCGCGACC GGCGCACACC AGAACCTCAA TCCGGTCGCC GACGTGGGGC GTGAGGCCCG CTGGGGCCGC ATCTACGAGA CCTTCGGCGA GAGTCCCTCC CTCTGCGCGT CGATGAGCGC GGCCGCCGTT CGGGGCTATC AAGGCGACGA TATCGGTGAC GAAGGGAACG TGATCGCGAC GCCGAAACAC TTCCCGGCCT ACAGCGATCC AGTCCGGGGC GAGGACGGCT CGCCGGTCGA CGTCTCCGAG TACACCCTCC GGCGGGTGTT CCGACCGCCG TTCGAGGCTG CGATCGACGC CGGCGCGGGC TCGATCATGC CCGCGTACAA CGAACTCAAC GGGTATCCCG TCCACGGCTC GACGGAGTAT CTGGAAGGCT GGCTCCGGGG AGAACTGGAT TTCGATGGGT ACGTCGTCTC GGACTGGAAC GGCATCAACA TGCTCCATCA CGACCACCGG ACCGCCCGCT CGATGGACGA AGCCGTCTGG CAGGCGACGA CCGCCGGCGT CGACGTCGCG AGCGTCGGCG GGGTCGAACA CGCCGAGCGA TTGCTCGATC TGCTCGAATC GGGGGATCTC TCCGAGAACC GGATCGACGA GAGCGTCCGG CGCGTCCTCG AAGCGAAGTT CCGGCTGGGG CTGTTCGAGG ACCCCTACGT CGAGGCGGAC CGGGTCGAGC AGGTCGGAAC CGACGACCAC CGCGCGGTCG CTCGCGAGGC CGCCCGCGAG TCGATGACGT TGCTCCGGAA CGAAGACGAG GTGCTCCCGC TCGATGCAAG TCTCGACTCG ATCGCTGTCC TCGGCCCGAA CGCCGACAAC CTCCGCAACC AGTTCGGTGG CTGGAGCACC ATCTCCGAGC CCGAACCACC GGGGACGACC ATTCGCGAGG GGATCGAGCG GGCCGTTCCA GTCGAAACGA CGGTCCGGTA CGAACAGGGC GCGTCGATGA CCGAGACCGT CGATCTCGAC GCTGCCCGCG AAGCTGCAGA CGCGAGCGAG GCCGCCGTGG TCGTCGTCGG CGAGACCGGG TATCGCCACG AGTTCCACCG CAGCGAAACC GACCGCGGCG AGTTCCCGAC CCGATCAGAA CTCGAACTTC CCGAGGCACA GCGTGAGTTG CTCGGGGCGG TCCGAGAAAC CGGAACGCCG ACCGTCGCCG TCTTCGTCGC CGGCCGCCCG CTCGCCATGG AGTGGACGGC CGAGCACGTG CCGGCGATCC TGTTCGCCTA CCTGCCCGGC TCAGAGGGCG GGAACGCAGT CGCCGACGTG CTCTTCGGCG ACGCGGACCC CGGCGGGTCG CTGCCGGTCT CGATTCCGCG GTCGAGTGGT CACCTGCCGA CCCATTTCGA CTACCGCCCA CACCCCCATC CCATCGAAGG CAGCCCCCGC GAGGAGAACC CGCGCCCGCC GGAGCATCCC GAGACATACG ATCCGCTGTT TCCATTCGGC CACGGCCTGA GCTACGCGGC CTTCGAGGCC GGCGAGCTGT CGGTGTCGAC GGAGCGGGTC GGCCCGGAGG GAAGTCTCAC GACGACCGTC GCGGTCGAAA ACGTCAGCGA CCGAGGGGGA TCGACGACGC TGCACCTCTA TGGGACCGAC GAGTTCAGTT CCCGGGTAAC GCCCGTCCGG GAACTGGTCG GCTTCCAGCG GGTCGAGCTA GCGGCCGGCG AAGGGACCGA AGTGACCTTC GAGATCAACT TGGCGGATCT GGGTGTCCTC ACGGAGAACG GTGAGCGGCG CGCGGAAGCC GGATCTATTA CGCTGTCGTG CGCTGGCGAG TCCGTCGAAG TCGTCGTCGA GGGCCGATTC GACCGCTGA
|
Protein sequence | MDEDRPTDED LSMNDDPPYM DSNRSTGDRI EDLLSRMTPA EKVGQLVGTA PTLRPGRETV SGIAEAVTEH HLGAVSPFGH GGSPWETPAE CVEVANALQR EAIQNTRLGI PVLFYVDADH GHGFVKGTTV FPHNLGMAAT RDPALVERAA SVTATEVAAT GAHQNLNPVA DVGREARWGR IYETFGESPS LCASMSAAAV RGYQGDDIGD EGNVIATPKH FPAYSDPVRG EDGSPVDVSE YTLRRVFRPP FEAAIDAGAG SIMPAYNELN GYPVHGSTEY LEGWLRGELD FDGYVVSDWN GINMLHHDHR TARSMDEAVW QATTAGVDVA SVGGVEHAER LLDLLESGDL SENRIDESVR RVLEAKFRLG LFEDPYVEAD RVEQVGTDDH RAVAREAARE SMTLLRNEDE VLPLDASLDS IAVLGPNADN LRNQFGGWST ISEPEPPGTT IREGIERAVP VETTVRYEQG ASMTETVDLD AAREAADASE AAVVVVGETG YRHEFHRSET DRGEFPTRSE LELPEAQREL LGAVRETGTP TVAVFVAGRP LAMEWTAEHV PAILFAYLPG SEGGNAVADV LFGDADPGGS LPVSIPRSSG HLPTHFDYRP HPHPIEGSPR EENPRPPEHP ETYDPLFPFG HGLSYAAFEA GELSVSTERV GPEGSLTTTV AVENVSDRGG STTLHLYGTD EFSSRVTPVR ELVGFQRVEL AAGEGTEVTF EINLADLGVL TENGERRAEA GSITLSCAGE SVEVVVEGRF DR
|
| |