Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_1047 |
Symbol | |
ID | 8383321 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 1012162 |
End bp | 1014363 |
Gene Length | 2202 bp |
Protein Length | 733 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644972112 |
Product | glycoside hydrolase family 3 domain protein |
Protein accession | YP_003129963 |
Protein GI | 257052130 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGGAG TCAGCAACAC GGCGCTCACC GAATACGTGG ACGAGCAGCG ACGCGAGGCT ATTGCCGAGC GAGTCGAGGA ACTGCTCGCA GCGATGACCA TCGAGGAAAA AGTCGGGCAG CTAAACCAGC GGTCGGTCAG TTTTGTCACG GGGTCGGAAG ACGACACCGA CGATCTCGAA ACGGCGATCG CCGATGGGGA GGTCGGATCC GTACTCAATG CACAGGGACT CGATGCAAAG CGGCACCTCC AGGAGATCGC CGTCGAGGAA TCCAGACTCG GCATCCCGCT GGTGATGGCT TTCGACGTCA TCCATGGCTA CCGGACGGTG TTCCCGACGC CGCTGGGCCA GGCAGCCAGT TGGGAACCGG ACCTCGCCGA GCGAGCGGAA CGCGTCGCCG CGACGGAAGC CAGCGCCGAC GGCCACCACT GGACGTTCGC CCCGATGGTC GACGTCTCCC GGGACCCACG GTGGGGCCGG GTCATGGAAG GGTCGGGCGA GTCCCCCGTC CTCGGGAGTG CGTTCGCCCG GGCGCGCGTC CGGGGATTCC AGGGCGATGA TCTCGCTGAT ACCGACACAA TGCTCGCCTG CGCGAAGCAC TTCGCCGGGT ACGGCGCGAG CGAGGCCGGG CGGGACTACA ACACCGTCAA CGTCTCCGAG ACGGCGCTCC GAGACAGGCA TCTCCCGCCG TTCGAAGCCG CGGTCGAGAC GGGCGTCGCG ACGGTCATGA ACGCGTTCAA TACCATCGAG CGAATCCCTG CGAGCGGTAA CGAGAGTCTG GTCTCGGGCG TCCTCAAGGG CGAGTGGGGA TTCGAGGGCG CAATGGTGTC CGATTGGGAC TCCTTCGGTG AGCAAATGCC ACACGGTGTC GCGGCGGACG AACGCGAGGC CGCCAAACGC GCCATGCTGG CCGGGTCGGA CGTCGACATG GTGAGTGAAG TCCTGCTCGA GGAGCTGCCC GAGCTCGTCC GCGACGGCGA GGTGCCCGAG TCGCGACTCG ACGACGCCGT CGCGCGCGTC CTCTGGATGA AAGGCCTCCT CGGTCTCTTC GAGGACCCGT ATCAGTACTT CGACGAGGAT CGGCGTGAAG CCGTCACTCG CACGGACGAA CAACGCGAGA CTGCCCGGGA GGTCGCCGAA CGATCGTTCG TCCTGCTGAA AAACGAGGGC GTCCTCCCGC TCGAGGACGA CGCTGAGGTC GGAGTCGTCG GTGCACTGGC CGACAGCGAC GAGGACACCC TCGGCGCGTG GGCGTGGGGC GGCGATCCCG AGGACGTGAC CACGATCCGC GCGGGACTGG ACGATCACTT CGACGGCGTT CCCTACGCGG CTGGCTACGA CCTGCCGGGC GAGGTGACCG ACGAAACGCT GGCTGACGCC CGCGAAGTCG CCGAGGCGTC GGACGTGGTC GTGTGCGTCG TCGGCGAGCC AGCAGACATG ACCGGCGAGG CTGCGAGTCG GGCGCACGTC GATCTGCCCG ACGAGCAGCG TCGGCTGCTG GAAGCCCTCC ACGACACCGG AACGCCGGTG GTCGCGCTGC TGATGAACGG CCGCCCGCTG GCGGTCGAGT GGCTCGACGA GCACCTCCCG GTCATTCTGG ACATCTGGCA TCCGGGCACC GAAGCAGGCC CGGCGGTCGC CCGAGTGCTC GCCGGCGACA CCTCCCCCGG TGGTCACCTG CCGATGAGTG TCCCCTACAC CGAGGGCCAG ATCCCGGTCG CTCACGACCG ACTGCCGACG GGCCGACCGG CGGACCAGGC CGAACGCGAG GAGGAGTACG TCTCGGCATA TCTCGACGTG CCCAACGAAC CGCTGTACGC CTTCGGCCAC GGCGAGAGCT ACACCGACTT TGCCTATAGC GACCTCTCGC TGTCGACGGA CACCCTCGTG CCCGGTGCGA CGCTCGAAGC GAGCGTCACC GTCGAGAACA CCGGCGATGT CGCGGGGCGT GACGTTGTCC AGTGGTACGT CCATGACCTC GTCGGCAGTC GGTCACGGCC GGAGAAAGAA CTGATCGCCT TCGAGACAGT GGATCTCGAA CCGGGCGAAT CAGCGACCGT CACGGTCGAG ATCGAGGAGA GCGACCTGGC GTTCTGGACT GCCGAGGAAG CGTGGGCCGC CGAACCCGGC GAATTCGATC TCATGGTCGG CCATGCGGCC GATGACATCG TCGATACCGA GCGCTTCGCG TTCGAAGCGT AG
|
Protein sequence | MTGVSNTALT EYVDEQRREA IAERVEELLA AMTIEEKVGQ LNQRSVSFVT GSEDDTDDLE TAIADGEVGS VLNAQGLDAK RHLQEIAVEE SRLGIPLVMA FDVIHGYRTV FPTPLGQAAS WEPDLAERAE RVAATEASAD GHHWTFAPMV DVSRDPRWGR VMEGSGESPV LGSAFARARV RGFQGDDLAD TDTMLACAKH FAGYGASEAG RDYNTVNVSE TALRDRHLPP FEAAVETGVA TVMNAFNTIE RIPASGNESL VSGVLKGEWG FEGAMVSDWD SFGEQMPHGV AADEREAAKR AMLAGSDVDM VSEVLLEELP ELVRDGEVPE SRLDDAVARV LWMKGLLGLF EDPYQYFDED RREAVTRTDE QRETAREVAE RSFVLLKNEG VLPLEDDAEV GVVGALADSD EDTLGAWAWG GDPEDVTTIR AGLDDHFDGV PYAAGYDLPG EVTDETLADA REVAEASDVV VCVVGEPADM TGEAASRAHV DLPDEQRRLL EALHDTGTPV VALLMNGRPL AVEWLDEHLP VILDIWHPGT EAGPAVARVL AGDTSPGGHL PMSVPYTEGQ IPVAHDRLPT GRPADQAERE EEYVSAYLDV PNEPLYAFGH GESYTDFAYS DLSLSTDTLV PGATLEASVT VENTGDVAGR DVVQWYVHDL VGSRSRPEKE LIAFETVDLE PGESATVTVE IEESDLAFWT AEEAWAAEPG EFDLMVGHAA DDIVDTERFA FEA
|
| |