Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_0159 |
Symbol | |
ID | 8382421 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 159573 |
End bp | 160997 |
Gene Length | 1425 bp |
Protein Length | 474 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644971217 |
Product | glycoside hydrolase family 4 |
Protein accession | YP_003129080 |
Protein GI | 257051247 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.075998 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGTGAGT CCCACGAGGA AGACGGCCGG TTCGAGGACG TGTCGATCGG GTTCGTCGGC GGCGGTTCTC GCGACTGGGC GGGCAAGATG ATGACCGACC TCGCCAGACA GCACACTCTC GAGGGCGAGG TTCGCCTCTA CGACGTCGAC CAGGAGAGCG CCGAACAGAA CGCCCGCCTC GGCGAGCTGA TCCAGGATCG CGAGGAAGCG ATCGCCGAGT GGGACTACCG GGCCGTCCCG TCCCTCGCCG ACGCGCTTTC GGGCGCGGAC GTCGTTGTCC TCTCGACGCA GGACCCGCCG GCCGAGACGT TCGCCCACGA CCTCGACATC CCCGCCGAGT ACGGCATCTA CCAGTCCGTC GGCGACACGG TCGGCCCGGG CGGAACCTTT CGGGCGATGC GGGCCATCCC CCAGTATCGC GAGATCGCGG CCGCGATCCG CGAACACTGT CCCGACGCCT GGGTGCTCAA CTACACCAAC CCGATGACCG TCTGCACCCG GACGCTCTAT GAGGAATTCC CCGATATCAA GGCCGTCGGG CTCTGTCACG AAGTGCTCCA CGTCAAGGAG GACCTCGCCG CCTATGTCGA GAAGCACCGC GACGTCGCGG ACGTCGACGG CGACGACCTC CGGGTGAACG TCAAGGGAAT CAACCACTTC ACCTGGATCG ACGACGTCCG CTTCCGAAGC GAGGGCGTCT TCGACGTGAT CGACGCCGAA CTCGATTCCC AGCTCCCGCT CCCTGGCGGA TTCGAACCCG GCGACCTCGA CGGCGAGACC TTCTACGTCG ACAACGATCA GATCGCGCTG GATCTCTATC GACGCTTCGG GCTCTTCCCC GCCGCGGGCG ACCGCCACCT CGCCGAGTTC GTCCCGTGGT ACCTGAACAT CGACGATCCG CAAGACGTCC AGCGGTGGGG GATCCGCCTT ACGCCGAGCG ACCACCGGAT CGAGCACTGG CCGACGAACG AGCGCCAGCG CGAGCGCCAT CTGGAAGGCA CCGAGGAGTT CGAATTCACC GACACCGGCG AGAAGATGGT CGAGCTCATG ACGGCACTGC TCGGCGGCGA GGAACTGGTC ACGAACGTCA ACCTCCCCAA CCGGGGGCAA CTTTCCGGGG TTCGCGAGGG TGCGATCGTC GAGACCAACG CGCTGGTGAC GGGCGACGAC ATCGTCCCGC ACGCCGCCGG CGACCTGCCG GAGCAGGTCC GGAGCATGGT CAGAACGCAC GTGAGCAATC AGGAGACGCT GATCGAGGCC GGATTCGCTG GCGACCTCGA TCTGGCGTAC CGGGCGTTCC TGAACGATCC ACTCGTGACG CTGCCGCCCG AAGACGCCCG AAGCCTCTTT GTCGACCTCG TCGACGCTGA ACGCCCCTAT CTCACCGACT GGAACCTGGA GGAGGCAACT GTCCTCGAAG CATAA
|
Protein sequence | MCESHEEDGR FEDVSIGFVG GGSRDWAGKM MTDLARQHTL EGEVRLYDVD QESAEQNARL GELIQDREEA IAEWDYRAVP SLADALSGAD VVVLSTQDPP AETFAHDLDI PAEYGIYQSV GDTVGPGGTF RAMRAIPQYR EIAAAIREHC PDAWVLNYTN PMTVCTRTLY EEFPDIKAVG LCHEVLHVKE DLAAYVEKHR DVADVDGDDL RVNVKGINHF TWIDDVRFRS EGVFDVIDAE LDSQLPLPGG FEPGDLDGET FYVDNDQIAL DLYRRFGLFP AAGDRHLAEF VPWYLNIDDP QDVQRWGIRL TPSDHRIEHW PTNERQRERH LEGTEEFEFT DTGEKMVELM TALLGGEELV TNVNLPNRGQ LSGVREGAIV ETNALVTGDD IVPHAAGDLP EQVRSMVRTH VSNQETLIEA GFAGDLDLAY RAFLNDPLVT LPPEDARSLF VDLVDAERPY LTDWNLEEAT VLEA
|
| |