Gene Huta_2836 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_2836 
Symbol 
ID8385144 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp2908051 
End bp2910324 
Gene Length2274 bp 
Protein Length757 aa 
Translation table11 
GC content65% 
IMG OID644973913 
Productglycoside hydrolase family 3 domain protein 
Protein accessionYP_003131730 
Protein GI257053897 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.154925 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAACCC AAGACCCAGA CACGACGACT GATCGGATCG AATCACTGAT CGACCGGCTC 
ACGCTCGAAG AGAAAATAGA CTTCGTCCAC GGTGAGGACG ATCCGGACGA ACGGGCGACA
GGGTTCCTCC CGGGCGTCGA GCGGCTCGAT ATCCCATCGC TCTCGATGGT CGACGGGCCG
CTGGGCGTCC GACCGGGGAC GGCGACCGCG TTCCCGGCGT CGATCGCCCT GGCCGCCTCG
TGGGACGTCG ATCTCGCCCG TGAACAGGGC GCGGCACTCG GTCGGGAGGT GCTGGGTGCC
GACCAGGACG TTCTGCTCGC ACCGGGGTTC AACATCATCC GAGTGCCCCA GTGCGGTCGC
AGCTTCGAGT ACTACAGCGA GGACCCGTAC CTGTCGAGTC GACTCGCCGT CGGGACCATC
GACGGCGTCC AAAAGGACGC CGGCGCGATC GCCACCGCCA AGCACTTCGT CGCCAACAAC
CAGGAGCAGG ACCGTCACGA GGTAAGCGCC GAAGTGAGCG AGCGCGCACT GCGAGAGATC
TACCTGCCGG CCTTCGAGGC GGCGGTCACG GAGGGCGAGG TCGGCTCGGT GATGGCCGCC
TACAACCGGA TCAACGGGAC ATACGCGACC GAACACGAGT GGCTGTTGAG CGACGTCCTC
AAAGACGAGT GGGGCTTTTC GGGCTACGTC GTCAGCGACT GGTGGGCGAC GACCGATGGC
GTGGCAGCCG CCAACGCCGG CCTCGATGTC GACATGCCGG GGATTCCGGT ACCGCAATGG
CACGTCACGG AGAATCGAAT CCACGACGTG ATCGAGGGGC TCCCTGACGC CCTCCCGAAG
CGATCGATTG CCAAACTCGT CTCGACGCCG TGGTTGCCGG AGAACGTGAA TCCGAACCTC
TTCGATCGAA GTCCCTTCGA AGTGCAGCTG CGGGACGCCG TCGAACACGG ACAGGTGGCC
GAGTCGACGC TAGACGAGAA GATCAGACGG GTCCTCGGAC AGATGAACCG TTTCGGGTTG
TTCGACGATG AGCAACCCGA GGGGGCCGTC GACGCAGCCG AGCATCGCGA CCGGTCACGG
CGCGTCGCCG AGCGCGGGGC AGTCGTCCTC CAGAACGACG ACGAGGTGCT CCCACTGTCG
CCCGAGATCG ACTCGATCGC CGTGATCGGC CCGAACGCCG ACACGGCCAA GATCGGCGGC
GGCGGGAGTT CCGCGGTCAC GCCGTCTTCG ACGGTCAGTC CACTGGCAGG GGTCCGTGAG
CGCGTCGACG GCGACACCCG GGTTGCGTTC GCCCGTGGGA CCGAACGGAT CGAGGATCAT
CACGACGCTT CGGAGTCGAT CGTCGATCTC TCGCTGTCGG CTTCGGAAAC GCCAGCGGTC
GACACCGTGC TGGGCAACGA CACACCGGAA CGTGACGATG CCGTCATTGC TGCCCGGCAG
GCGGATGTCG CCGTGGTGGT AGTCCAGGAC GACGCGACCG AGGGCGAGGA TCGATCGCTC
TGGCTGCCGG GCGAGCAGGA TCGACTCGTC GCTGCCGTCG CCGACGCCGC CGACCGGACC
GTCGTCGTCT GTAACACCGC CGGCCCGATC CGGATGCCCT GGGCCGAAGA TGTCGACGGG
ATTGTCGAGA TGTGGTATCC CGGCCAGGAG GACGGACACG CCACGGCGGC GATTCTCTTC
GGCGACAGCG ATCCCGGCGG CCGGTTGCCG GTCACCTTCG GTCGGCGACT CGACGACTAC
CCGGCGGCGA CCGAGGAACG GTATCCGGGG GTCGGGCTAG AAGCCGAGTA TGACGAAGGC
GTCTTCGTCG GCTATCGTCA CTTCGACGAC GAGGGGATCG AACCTCAGTT CGCGTTCGGG
CACGGGCTGA GCTACACCGA CTTCACGTAT TCCGACGTGA CAGTCGAGGC TGACGGCGAA
GAAGGCGCAA CCATCGAGGG CGGCGACGAA CCGGGCGTGA GTGTCGAGGT GACAGTCGAG
AACGTCGGTG ACCGTCCGGG TCGGGATGTC GTACAGGTGT ATCTCGGCCC GGCCGAGGCC
GCAGTCGAAC GACCGCCGAA AGCGCTTGCC GGCTTCGAAC CCATCACACT CGACGCGGGC
GAGACGACAA CAGTCACGCT TTCCATCGAC GCGAGAGCGT TCGCGTACTA CGACGTCGAG
GCGGGCGAGT GGGTCGCAAC TGAAGGGGAA TACACTGTTC TTGTCGGTCG CTCCGCCCAG
GATATTGTCG ACGAAGAGAC AATAGCCATC GAGGAATCGA CGATCGTCGA GTAG
 
Protein sequence
MATQDPDTTT DRIESLIDRL TLEEKIDFVH GEDDPDERAT GFLPGVERLD IPSLSMVDGP 
LGVRPGTATA FPASIALAAS WDVDLAREQG AALGREVLGA DQDVLLAPGF NIIRVPQCGR
SFEYYSEDPY LSSRLAVGTI DGVQKDAGAI ATAKHFVANN QEQDRHEVSA EVSERALREI
YLPAFEAAVT EGEVGSVMAA YNRINGTYAT EHEWLLSDVL KDEWGFSGYV VSDWWATTDG
VAAANAGLDV DMPGIPVPQW HVTENRIHDV IEGLPDALPK RSIAKLVSTP WLPENVNPNL
FDRSPFEVQL RDAVEHGQVA ESTLDEKIRR VLGQMNRFGL FDDEQPEGAV DAAEHRDRSR
RVAERGAVVL QNDDEVLPLS PEIDSIAVIG PNADTAKIGG GGSSAVTPSS TVSPLAGVRE
RVDGDTRVAF ARGTERIEDH HDASESIVDL SLSASETPAV DTVLGNDTPE RDDAVIAARQ
ADVAVVVVQD DATEGEDRSL WLPGEQDRLV AAVADAADRT VVVCNTAGPI RMPWAEDVDG
IVEMWYPGQE DGHATAAILF GDSDPGGRLP VTFGRRLDDY PAATEERYPG VGLEAEYDEG
VFVGYRHFDD EGIEPQFAFG HGLSYTDFTY SDVTVEADGE EGATIEGGDE PGVSVEVTVE
NVGDRPGRDV VQVYLGPAEA AVERPPKALA GFEPITLDAG ETTTVTLSID ARAFAYYDVE
AGEWVATEGE YTVLVGRSAQ DIVDEETIAI EESTIVE