Gene Huta_2707 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_2707 
Symbol 
ID8385012 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp2775870 
End bp2777744 
Gene Length1875 bp 
Protein Length624 aa 
Translation table11 
GC content67% 
IMG OID644973781 
Productglycoside hydrolase family 2 sugar binding 
Protein accessionYP_003131601 
Protein GI257053768 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCGCGG CGGCCCCCGC CACACGTATG ATGAACGAGT GGCGGGCGGC GAGAGTCGAG 
CCGGGCGGCG ACCGGCCCGA TCCGGCGACC TGGGAGCCGG TCGAGGTGCC GGGGCGGCCG
GCGGCATTTG CTGGCGCTGA CGCCGTGGCC TACCGATCGA CGTTCGCGGA TCCGACAGCG
GAAGACGAAC ACGCGACGCT CGTCCTGGAA GGAGTCTACG CGCACGCTCG CGTCTGGCTC
AACGACACGT TTCTGGGTGA ACACGACGCC TACTTCGAGC CGCTCCGACT CCGCCTCGAC
GCCGCGCTCG CTGCCGAAAA CGAGCTGCTT GTCGAATGCC GACCGCCGGA GGACCGTTTC
GGTGGGAGTT ACGAAACCGA CGAGGTCCCC GACGAACTCG CGGTCCCCGG TATCTGGTGG
GATGTCGATG TCGAGACATA CACGGACCAC CACATCCTCG ATCTCTCGGC CCGCCCGCGG
GTCGACGACG AGGATGCGCG CTTCGACGTC CGGGCGACCG TCCTCGCCGA GACAGCGCTC
GACGATCGGC TGACGTTCTC CGTCAAGCCC GAGGGGTCAC GCCGTGGTCG GGGAATGATG
GATCGCACGG CGATCGAGGC GGATGCCGGC GAACGGACGA CGGTCGAATA CACGATCGAT
ATCCGGGATC CCGCCCTGTG GTGGCCCCAC GACCTCGGCG AGCAGAACCG CTACGTCATC
CGGGCGAAGC TCGACGACGA CGAACGCACG CTGACCACGG GGCTGTGCTC GGTCGACGAC
GACGAAAGCG ATGGGCTCCG GGTCAACGAC ACGCCGATGA CGGCCCGTGG CGTTGGTCTC
CTCGCAGCCG AGCCAGAGGA CATCGAACGA GCGGTCGACC TCAACGCCAA CCTGGTCCGG
GCACACGCTC ACGTCCCGTC GCCGGCCGTC TACGAGGCGG CCGACGAGGC CGGCGTCTTG
GTCTGGCAGG ACCTCCCGCT GACCGGGCCG GGCCCGTTCG ACATCGAGCG TGGCCGGGAC
CTGACCGGTC GACTCGTCTC AGCGTACGAA CATCACCCCG GCTTCGCCGC GATCAGCGTC
CACGACGATC CGGTCACGGT GAGTGACGGA CCGCTCGGGT CCGGGTTCCT CGACCGGTTG
CGTCTCCGGT GGCGGCGTTT TCGGGCGGAC TACGATCACG AACCCGCCGA AACCGTCGCC
GCCGAGGTTC CCGACGGTAT CGTCACGCTG CCAGTCGTCG GGCCACCCGG GATCGAGCCG
GACGCGACGT CGCTGTATCC CGGCTGGAGA TACGGCCAGG CGGTGGACGT CGATCAGCTA
CTGGAGCGGA ATCCCTCGCT CGATTCCGTG GTTGGGGAGT ACGGTGCGGG ATCGCTGGGC
GTCGAAGACC CCGTCGACGT CGACGGGTTC GATCGTGAGG TTCACGACTA TCACGTCTCC
GGGGGCGTCG AGGACTCCCA GGCCTACCAG CGGTCCGTGA TCGCGACGGT CACCGAACGA
CTTCGCCTGC GGGGGACGGA CGTACTCGTG GCTGGATCGC TCCGTGATCT TGGCGACGCC
GGCATGGGCG TCCTGGCGCG CGACGGTACA CCCAAGGATG CACACGACGC ACTCGCCAGC
GCCTTCGAGC CAGTGCAAGC GATGCTTGCC GATCCGCGTG CGGGTGGAGA ATCGGACGTC
GTCGTCCACA ACGACCTGCC AACCGACGTC ACCGACCGCC TCACCTGGGA GGTTGGCGGT
GAAACCGGAG AAGCCGACGT CGCCATCGGC GCTGCGAGCA GTGAGACAGT GACGTCGATC
TCGATTCCGC AAGATGCCGA GACGATCACG CTGTCGCTGG CCGGCCACTC AGTCTCAAAC
ACATATCAGT TATAA
 
Protein sequence
MPAAAPATRM MNEWRAARVE PGGDRPDPAT WEPVEVPGRP AAFAGADAVA YRSTFADPTA 
EDEHATLVLE GVYAHARVWL NDTFLGEHDA YFEPLRLRLD AALAAENELL VECRPPEDRF
GGSYETDEVP DELAVPGIWW DVDVETYTDH HILDLSARPR VDDEDARFDV RATVLAETAL
DDRLTFSVKP EGSRRGRGMM DRTAIEADAG ERTTVEYTID IRDPALWWPH DLGEQNRYVI
RAKLDDDERT LTTGLCSVDD DESDGLRVND TPMTARGVGL LAAEPEDIER AVDLNANLVR
AHAHVPSPAV YEAADEAGVL VWQDLPLTGP GPFDIERGRD LTGRLVSAYE HHPGFAAISV
HDDPVTVSDG PLGSGFLDRL RLRWRRFRAD YDHEPAETVA AEVPDGIVTL PVVGPPGIEP
DATSLYPGWR YGQAVDVDQL LERNPSLDSV VGEYGAGSLG VEDPVDVDGF DREVHDYHVS
GGVEDSQAYQ RSVIATVTER LRLRGTDVLV AGSLRDLGDA GMGVLARDGT PKDAHDALAS
AFEPVQAMLA DPRAGGESDV VVHNDLPTDV TDRLTWEVGG ETGEADVAIG AASSETVTSI
SIPQDAETIT LSLAGHSVSN TYQL