Gene Huta_1163 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_1163 
Symbol 
ID8383438 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp1138882 
End bp1141170 
Gene Length2289 bp 
Protein Length762 aa 
Translation table11 
GC content67% 
IMG OID644972222 
Productglycoside hydrolase family 3 domain protein 
Protein accessionYP_003130072 
Protein GI257052239 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGAAG ACCGGCCAAC AGACGAGGAT CTGTCGATGA ACGACGATCC ACCGTACATG 
GATTCGAACC GGTCCACCGG GGATCGGATC GAAGACCTGC TCTCGCGCAT GACGCCCGCG
GAGAAAGTCG GCCAGCTCGT CGGCACGGCA CCGACGCTCC GACCAGGTCG TGAAACTGTC
TCGGGCATCG CCGAGGCTGT CACCGAGCAC CACCTCGGCG CGGTCTCGCC GTTCGGTCAC
GGCGGCTCCC CCTGGGAGAC CCCAGCGGAG TGCGTCGAGG TCGCGAATGC GCTCCAGCGC
GAGGCCATCC AGAACACCAG ACTCGGTATC CCGGTACTGT TCTACGTCGA CGCCGACCAC
GGCCACGGGT TCGTGAAGGG TACCACCGTC TTCCCACATA ACCTCGGCAT GGCCGCAACC
CGCGATCCCG CTCTCGTCGA ACGGGCGGCG AGCGTCACCG CGACTGAAGT CGCCGCGACC
GGCGCACACC AGAACCTCAA TCCGGTCGCC GACGTGGGGC GTGAGGCCCG CTGGGGCCGC
ATCTACGAGA CCTTCGGCGA GAGTCCCTCC CTCTGCGCGT CGATGAGCGC GGCCGCCGTT
CGGGGCTATC AAGGCGACGA TATCGGTGAC GAAGGGAACG TGATCGCGAC GCCGAAACAC
TTCCCGGCCT ACAGCGATCC AGTCCGGGGC GAGGACGGCT CGCCGGTCGA CGTCTCCGAG
TACACCCTCC GGCGGGTGTT CCGACCGCCG TTCGAGGCTG CGATCGACGC CGGCGCGGGC
TCGATCATGC CCGCGTACAA CGAACTCAAC GGGTATCCCG TCCACGGCTC GACGGAGTAT
CTGGAAGGCT GGCTCCGGGG AGAACTGGAT TTCGATGGGT ACGTCGTCTC GGACTGGAAC
GGCATCAACA TGCTCCATCA CGACCACCGG ACCGCCCGCT CGATGGACGA AGCCGTCTGG
CAGGCGACGA CCGCCGGCGT CGACGTCGCG AGCGTCGGCG GGGTCGAACA CGCCGAGCGA
TTGCTCGATC TGCTCGAATC GGGGGATCTC TCCGAGAACC GGATCGACGA GAGCGTCCGG
CGCGTCCTCG AAGCGAAGTT CCGGCTGGGG CTGTTCGAGG ACCCCTACGT CGAGGCGGAC
CGGGTCGAGC AGGTCGGAAC CGACGACCAC CGCGCGGTCG CTCGCGAGGC CGCCCGCGAG
TCGATGACGT TGCTCCGGAA CGAAGACGAG GTGCTCCCGC TCGATGCAAG TCTCGACTCG
ATCGCTGTCC TCGGCCCGAA CGCCGACAAC CTCCGCAACC AGTTCGGTGG CTGGAGCACC
ATCTCCGAGC CCGAACCACC GGGGACGACC ATTCGCGAGG GGATCGAGCG GGCCGTTCCA
GTCGAAACGA CGGTCCGGTA CGAACAGGGC GCGTCGATGA CCGAGACCGT CGATCTCGAC
GCTGCCCGCG AAGCTGCAGA CGCGAGCGAG GCCGCCGTGG TCGTCGTCGG CGAGACCGGG
TATCGCCACG AGTTCCACCG CAGCGAAACC GACCGCGGCG AGTTCCCGAC CCGATCAGAA
CTCGAACTTC CCGAGGCACA GCGTGAGTTG CTCGGGGCGG TCCGAGAAAC CGGAACGCCG
ACCGTCGCCG TCTTCGTCGC CGGCCGCCCG CTCGCCATGG AGTGGACGGC CGAGCACGTG
CCGGCGATCC TGTTCGCCTA CCTGCCCGGC TCAGAGGGCG GGAACGCAGT CGCCGACGTG
CTCTTCGGCG ACGCGGACCC CGGCGGGTCG CTGCCGGTCT CGATTCCGCG GTCGAGTGGT
CACCTGCCGA CCCATTTCGA CTACCGCCCA CACCCCCATC CCATCGAAGG CAGCCCCCGC
GAGGAGAACC CGCGCCCGCC GGAGCATCCC GAGACATACG ATCCGCTGTT TCCATTCGGC
CACGGCCTGA GCTACGCGGC CTTCGAGGCC GGCGAGCTGT CGGTGTCGAC GGAGCGGGTC
GGCCCGGAGG GAAGTCTCAC GACGACCGTC GCGGTCGAAA ACGTCAGCGA CCGAGGGGGA
TCGACGACGC TGCACCTCTA TGGGACCGAC GAGTTCAGTT CCCGGGTAAC GCCCGTCCGG
GAACTGGTCG GCTTCCAGCG GGTCGAGCTA GCGGCCGGCG AAGGGACCGA AGTGACCTTC
GAGATCAACT TGGCGGATCT GGGTGTCCTC ACGGAGAACG GTGAGCGGCG CGCGGAAGCC
GGATCTATTA CGCTGTCGTG CGCTGGCGAG TCCGTCGAAG TCGTCGTCGA GGGCCGATTC
GACCGCTGA
 
Protein sequence
MDEDRPTDED LSMNDDPPYM DSNRSTGDRI EDLLSRMTPA EKVGQLVGTA PTLRPGRETV 
SGIAEAVTEH HLGAVSPFGH GGSPWETPAE CVEVANALQR EAIQNTRLGI PVLFYVDADH
GHGFVKGTTV FPHNLGMAAT RDPALVERAA SVTATEVAAT GAHQNLNPVA DVGREARWGR
IYETFGESPS LCASMSAAAV RGYQGDDIGD EGNVIATPKH FPAYSDPVRG EDGSPVDVSE
YTLRRVFRPP FEAAIDAGAG SIMPAYNELN GYPVHGSTEY LEGWLRGELD FDGYVVSDWN
GINMLHHDHR TARSMDEAVW QATTAGVDVA SVGGVEHAER LLDLLESGDL SENRIDESVR
RVLEAKFRLG LFEDPYVEAD RVEQVGTDDH RAVAREAARE SMTLLRNEDE VLPLDASLDS
IAVLGPNADN LRNQFGGWST ISEPEPPGTT IREGIERAVP VETTVRYEQG ASMTETVDLD
AAREAADASE AAVVVVGETG YRHEFHRSET DRGEFPTRSE LELPEAQREL LGAVRETGTP
TVAVFVAGRP LAMEWTAEHV PAILFAYLPG SEGGNAVADV LFGDADPGGS LPVSIPRSSG
HLPTHFDYRP HPHPIEGSPR EENPRPPEHP ETYDPLFPFG HGLSYAAFEA GELSVSTERV
GPEGSLTTTV AVENVSDRGG STTLHLYGTD EFSSRVTPVR ELVGFQRVEL AAGEGTEVTF
EINLADLGVL TENGERRAEA GSITLSCAGE SVEVVVEGRF DR