Gene Huta_1047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_1047 
Symbol 
ID8383321 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp1012162 
End bp1014363 
Gene Length2202 bp 
Protein Length733 aa 
Translation table11 
GC content67% 
IMG OID644972112 
Productglycoside hydrolase family 3 domain protein 
Protein accessionYP_003129963 
Protein GI257052130 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGGAG TCAGCAACAC GGCGCTCACC GAATACGTGG ACGAGCAGCG ACGCGAGGCT 
ATTGCCGAGC GAGTCGAGGA ACTGCTCGCA GCGATGACCA TCGAGGAAAA AGTCGGGCAG
CTAAACCAGC GGTCGGTCAG TTTTGTCACG GGGTCGGAAG ACGACACCGA CGATCTCGAA
ACGGCGATCG CCGATGGGGA GGTCGGATCC GTACTCAATG CACAGGGACT CGATGCAAAG
CGGCACCTCC AGGAGATCGC CGTCGAGGAA TCCAGACTCG GCATCCCGCT GGTGATGGCT
TTCGACGTCA TCCATGGCTA CCGGACGGTG TTCCCGACGC CGCTGGGCCA GGCAGCCAGT
TGGGAACCGG ACCTCGCCGA GCGAGCGGAA CGCGTCGCCG CGACGGAAGC CAGCGCCGAC
GGCCACCACT GGACGTTCGC CCCGATGGTC GACGTCTCCC GGGACCCACG GTGGGGCCGG
GTCATGGAAG GGTCGGGCGA GTCCCCCGTC CTCGGGAGTG CGTTCGCCCG GGCGCGCGTC
CGGGGATTCC AGGGCGATGA TCTCGCTGAT ACCGACACAA TGCTCGCCTG CGCGAAGCAC
TTCGCCGGGT ACGGCGCGAG CGAGGCCGGG CGGGACTACA ACACCGTCAA CGTCTCCGAG
ACGGCGCTCC GAGACAGGCA TCTCCCGCCG TTCGAAGCCG CGGTCGAGAC GGGCGTCGCG
ACGGTCATGA ACGCGTTCAA TACCATCGAG CGAATCCCTG CGAGCGGTAA CGAGAGTCTG
GTCTCGGGCG TCCTCAAGGG CGAGTGGGGA TTCGAGGGCG CAATGGTGTC CGATTGGGAC
TCCTTCGGTG AGCAAATGCC ACACGGTGTC GCGGCGGACG AACGCGAGGC CGCCAAACGC
GCCATGCTGG CCGGGTCGGA CGTCGACATG GTGAGTGAAG TCCTGCTCGA GGAGCTGCCC
GAGCTCGTCC GCGACGGCGA GGTGCCCGAG TCGCGACTCG ACGACGCCGT CGCGCGCGTC
CTCTGGATGA AAGGCCTCCT CGGTCTCTTC GAGGACCCGT ATCAGTACTT CGACGAGGAT
CGGCGTGAAG CCGTCACTCG CACGGACGAA CAACGCGAGA CTGCCCGGGA GGTCGCCGAA
CGATCGTTCG TCCTGCTGAA AAACGAGGGC GTCCTCCCGC TCGAGGACGA CGCTGAGGTC
GGAGTCGTCG GTGCACTGGC CGACAGCGAC GAGGACACCC TCGGCGCGTG GGCGTGGGGC
GGCGATCCCG AGGACGTGAC CACGATCCGC GCGGGACTGG ACGATCACTT CGACGGCGTT
CCCTACGCGG CTGGCTACGA CCTGCCGGGC GAGGTGACCG ACGAAACGCT GGCTGACGCC
CGCGAAGTCG CCGAGGCGTC GGACGTGGTC GTGTGCGTCG TCGGCGAGCC AGCAGACATG
ACCGGCGAGG CTGCGAGTCG GGCGCACGTC GATCTGCCCG ACGAGCAGCG TCGGCTGCTG
GAAGCCCTCC ACGACACCGG AACGCCGGTG GTCGCGCTGC TGATGAACGG CCGCCCGCTG
GCGGTCGAGT GGCTCGACGA GCACCTCCCG GTCATTCTGG ACATCTGGCA TCCGGGCACC
GAAGCAGGCC CGGCGGTCGC CCGAGTGCTC GCCGGCGACA CCTCCCCCGG TGGTCACCTG
CCGATGAGTG TCCCCTACAC CGAGGGCCAG ATCCCGGTCG CTCACGACCG ACTGCCGACG
GGCCGACCGG CGGACCAGGC CGAACGCGAG GAGGAGTACG TCTCGGCATA TCTCGACGTG
CCCAACGAAC CGCTGTACGC CTTCGGCCAC GGCGAGAGCT ACACCGACTT TGCCTATAGC
GACCTCTCGC TGTCGACGGA CACCCTCGTG CCCGGTGCGA CGCTCGAAGC GAGCGTCACC
GTCGAGAACA CCGGCGATGT CGCGGGGCGT GACGTTGTCC AGTGGTACGT CCATGACCTC
GTCGGCAGTC GGTCACGGCC GGAGAAAGAA CTGATCGCCT TCGAGACAGT GGATCTCGAA
CCGGGCGAAT CAGCGACCGT CACGGTCGAG ATCGAGGAGA GCGACCTGGC GTTCTGGACT
GCCGAGGAAG CGTGGGCCGC CGAACCCGGC GAATTCGATC TCATGGTCGG CCATGCGGCC
GATGACATCG TCGATACCGA GCGCTTCGCG TTCGAAGCGT AG
 
Protein sequence
MTGVSNTALT EYVDEQRREA IAERVEELLA AMTIEEKVGQ LNQRSVSFVT GSEDDTDDLE 
TAIADGEVGS VLNAQGLDAK RHLQEIAVEE SRLGIPLVMA FDVIHGYRTV FPTPLGQAAS
WEPDLAERAE RVAATEASAD GHHWTFAPMV DVSRDPRWGR VMEGSGESPV LGSAFARARV
RGFQGDDLAD TDTMLACAKH FAGYGASEAG RDYNTVNVSE TALRDRHLPP FEAAVETGVA
TVMNAFNTIE RIPASGNESL VSGVLKGEWG FEGAMVSDWD SFGEQMPHGV AADEREAAKR
AMLAGSDVDM VSEVLLEELP ELVRDGEVPE SRLDDAVARV LWMKGLLGLF EDPYQYFDED
RREAVTRTDE QRETAREVAE RSFVLLKNEG VLPLEDDAEV GVVGALADSD EDTLGAWAWG
GDPEDVTTIR AGLDDHFDGV PYAAGYDLPG EVTDETLADA REVAEASDVV VCVVGEPADM
TGEAASRAHV DLPDEQRRLL EALHDTGTPV VALLMNGRPL AVEWLDEHLP VILDIWHPGT
EAGPAVARVL AGDTSPGGHL PMSVPYTEGQ IPVAHDRLPT GRPADQAERE EEYVSAYLDV
PNEPLYAFGH GESYTDFAYS DLSLSTDTLV PGATLEASVT VENTGDVAGR DVVQWYVHDL
VGSRSRPEKE LIAFETVDLE PGESATVTVE IEESDLAFWT AEEAWAAEPG EFDLMVGHAA
DDIVDTERFA FEA