Gene Huta_0159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_0159 
Symbol 
ID8382421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp159573 
End bp160997 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content66% 
IMG OID644971217 
Productglycoside hydrolase family 4 
Protein accessionYP_003129080 
Protein GI257051247 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.075998 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGTGAGT CCCACGAGGA AGACGGCCGG TTCGAGGACG TGTCGATCGG GTTCGTCGGC 
GGCGGTTCTC GCGACTGGGC GGGCAAGATG ATGACCGACC TCGCCAGACA GCACACTCTC
GAGGGCGAGG TTCGCCTCTA CGACGTCGAC CAGGAGAGCG CCGAACAGAA CGCCCGCCTC
GGCGAGCTGA TCCAGGATCG CGAGGAAGCG ATCGCCGAGT GGGACTACCG GGCCGTCCCG
TCCCTCGCCG ACGCGCTTTC GGGCGCGGAC GTCGTTGTCC TCTCGACGCA GGACCCGCCG
GCCGAGACGT TCGCCCACGA CCTCGACATC CCCGCCGAGT ACGGCATCTA CCAGTCCGTC
GGCGACACGG TCGGCCCGGG CGGAACCTTT CGGGCGATGC GGGCCATCCC CCAGTATCGC
GAGATCGCGG CCGCGATCCG CGAACACTGT CCCGACGCCT GGGTGCTCAA CTACACCAAC
CCGATGACCG TCTGCACCCG GACGCTCTAT GAGGAATTCC CCGATATCAA GGCCGTCGGG
CTCTGTCACG AAGTGCTCCA CGTCAAGGAG GACCTCGCCG CCTATGTCGA GAAGCACCGC
GACGTCGCGG ACGTCGACGG CGACGACCTC CGGGTGAACG TCAAGGGAAT CAACCACTTC
ACCTGGATCG ACGACGTCCG CTTCCGAAGC GAGGGCGTCT TCGACGTGAT CGACGCCGAA
CTCGATTCCC AGCTCCCGCT CCCTGGCGGA TTCGAACCCG GCGACCTCGA CGGCGAGACC
TTCTACGTCG ACAACGATCA GATCGCGCTG GATCTCTATC GACGCTTCGG GCTCTTCCCC
GCCGCGGGCG ACCGCCACCT CGCCGAGTTC GTCCCGTGGT ACCTGAACAT CGACGATCCG
CAAGACGTCC AGCGGTGGGG GATCCGCCTT ACGCCGAGCG ACCACCGGAT CGAGCACTGG
CCGACGAACG AGCGCCAGCG CGAGCGCCAT CTGGAAGGCA CCGAGGAGTT CGAATTCACC
GACACCGGCG AGAAGATGGT CGAGCTCATG ACGGCACTGC TCGGCGGCGA GGAACTGGTC
ACGAACGTCA ACCTCCCCAA CCGGGGGCAA CTTTCCGGGG TTCGCGAGGG TGCGATCGTC
GAGACCAACG CGCTGGTGAC GGGCGACGAC ATCGTCCCGC ACGCCGCCGG CGACCTGCCG
GAGCAGGTCC GGAGCATGGT CAGAACGCAC GTGAGCAATC AGGAGACGCT GATCGAGGCC
GGATTCGCTG GCGACCTCGA TCTGGCGTAC CGGGCGTTCC TGAACGATCC ACTCGTGACG
CTGCCGCCCG AAGACGCCCG AAGCCTCTTT GTCGACCTCG TCGACGCTGA ACGCCCCTAT
CTCACCGACT GGAACCTGGA GGAGGCAACT GTCCTCGAAG CATAA
 
Protein sequence
MCESHEEDGR FEDVSIGFVG GGSRDWAGKM MTDLARQHTL EGEVRLYDVD QESAEQNARL 
GELIQDREEA IAEWDYRAVP SLADALSGAD VVVLSTQDPP AETFAHDLDI PAEYGIYQSV
GDTVGPGGTF RAMRAIPQYR EIAAAIREHC PDAWVLNYTN PMTVCTRTLY EEFPDIKAVG
LCHEVLHVKE DLAAYVEKHR DVADVDGDDL RVNVKGINHF TWIDDVRFRS EGVFDVIDAE
LDSQLPLPGG FEPGDLDGET FYVDNDQIAL DLYRRFGLFP AAGDRHLAEF VPWYLNIDDP
QDVQRWGIRL TPSDHRIEHW PTNERQRERH LEGTEEFEFT DTGEKMVELM TALLGGEELV
TNVNLPNRGQ LSGVREGAIV ETNALVTGDD IVPHAAGDLP EQVRSMVRTH VSNQETLIEA
GFAGDLDLAY RAFLNDPLVT LPPEDARSLF VDLVDAERPY LTDWNLEEAT VLEA