Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_2707 |
Symbol | |
ID | 8385012 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 2775870 |
End bp | 2777744 |
Gene Length | 1875 bp |
Protein Length | 624 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644973781 |
Product | glycoside hydrolase family 2 sugar binding |
Protein accession | YP_003131601 |
Protein GI | 257053768 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCGCGG CGGCCCCCGC CACACGTATG ATGAACGAGT GGCGGGCGGC GAGAGTCGAG CCGGGCGGCG ACCGGCCCGA TCCGGCGACC TGGGAGCCGG TCGAGGTGCC GGGGCGGCCG GCGGCATTTG CTGGCGCTGA CGCCGTGGCC TACCGATCGA CGTTCGCGGA TCCGACAGCG GAAGACGAAC ACGCGACGCT CGTCCTGGAA GGAGTCTACG CGCACGCTCG CGTCTGGCTC AACGACACGT TTCTGGGTGA ACACGACGCC TACTTCGAGC CGCTCCGACT CCGCCTCGAC GCCGCGCTCG CTGCCGAAAA CGAGCTGCTT GTCGAATGCC GACCGCCGGA GGACCGTTTC GGTGGGAGTT ACGAAACCGA CGAGGTCCCC GACGAACTCG CGGTCCCCGG TATCTGGTGG GATGTCGATG TCGAGACATA CACGGACCAC CACATCCTCG ATCTCTCGGC CCGCCCGCGG GTCGACGACG AGGATGCGCG CTTCGACGTC CGGGCGACCG TCCTCGCCGA GACAGCGCTC GACGATCGGC TGACGTTCTC CGTCAAGCCC GAGGGGTCAC GCCGTGGTCG GGGAATGATG GATCGCACGG CGATCGAGGC GGATGCCGGC GAACGGACGA CGGTCGAATA CACGATCGAT ATCCGGGATC CCGCCCTGTG GTGGCCCCAC GACCTCGGCG AGCAGAACCG CTACGTCATC CGGGCGAAGC TCGACGACGA CGAACGCACG CTGACCACGG GGCTGTGCTC GGTCGACGAC GACGAAAGCG ATGGGCTCCG GGTCAACGAC ACGCCGATGA CGGCCCGTGG CGTTGGTCTC CTCGCAGCCG AGCCAGAGGA CATCGAACGA GCGGTCGACC TCAACGCCAA CCTGGTCCGG GCACACGCTC ACGTCCCGTC GCCGGCCGTC TACGAGGCGG CCGACGAGGC CGGCGTCTTG GTCTGGCAGG ACCTCCCGCT GACCGGGCCG GGCCCGTTCG ACATCGAGCG TGGCCGGGAC CTGACCGGTC GACTCGTCTC AGCGTACGAA CATCACCCCG GCTTCGCCGC GATCAGCGTC CACGACGATC CGGTCACGGT GAGTGACGGA CCGCTCGGGT CCGGGTTCCT CGACCGGTTG CGTCTCCGGT GGCGGCGTTT TCGGGCGGAC TACGATCACG AACCCGCCGA AACCGTCGCC GCCGAGGTTC CCGACGGTAT CGTCACGCTG CCAGTCGTCG GGCCACCCGG GATCGAGCCG GACGCGACGT CGCTGTATCC CGGCTGGAGA TACGGCCAGG CGGTGGACGT CGATCAGCTA CTGGAGCGGA ATCCCTCGCT CGATTCCGTG GTTGGGGAGT ACGGTGCGGG ATCGCTGGGC GTCGAAGACC CCGTCGACGT CGACGGGTTC GATCGTGAGG TTCACGACTA TCACGTCTCC GGGGGCGTCG AGGACTCCCA GGCCTACCAG CGGTCCGTGA TCGCGACGGT CACCGAACGA CTTCGCCTGC GGGGGACGGA CGTACTCGTG GCTGGATCGC TCCGTGATCT TGGCGACGCC GGCATGGGCG TCCTGGCGCG CGACGGTACA CCCAAGGATG CACACGACGC ACTCGCCAGC GCCTTCGAGC CAGTGCAAGC GATGCTTGCC GATCCGCGTG CGGGTGGAGA ATCGGACGTC GTCGTCCACA ACGACCTGCC AACCGACGTC ACCGACCGCC TCACCTGGGA GGTTGGCGGT GAAACCGGAG AAGCCGACGT CGCCATCGGC GCTGCGAGCA GTGAGACAGT GACGTCGATC TCGATTCCGC AAGATGCCGA GACGATCACG CTGTCGCTGG CCGGCCACTC AGTCTCAAAC ACATATCAGT TATAA
|
Protein sequence | MPAAAPATRM MNEWRAARVE PGGDRPDPAT WEPVEVPGRP AAFAGADAVA YRSTFADPTA EDEHATLVLE GVYAHARVWL NDTFLGEHDA YFEPLRLRLD AALAAENELL VECRPPEDRF GGSYETDEVP DELAVPGIWW DVDVETYTDH HILDLSARPR VDDEDARFDV RATVLAETAL DDRLTFSVKP EGSRRGRGMM DRTAIEADAG ERTTVEYTID IRDPALWWPH DLGEQNRYVI RAKLDDDERT LTTGLCSVDD DESDGLRVND TPMTARGVGL LAAEPEDIER AVDLNANLVR AHAHVPSPAV YEAADEAGVL VWQDLPLTGP GPFDIERGRD LTGRLVSAYE HHPGFAAISV HDDPVTVSDG PLGSGFLDRL RLRWRRFRAD YDHEPAETVA AEVPDGIVTL PVVGPPGIEP DATSLYPGWR YGQAVDVDQL LERNPSLDSV VGEYGAGSLG VEDPVDVDGF DREVHDYHVS GGVEDSQAYQ RSVIATVTER LRLRGTDVLV AGSLRDLGDA GMGVLARDGT PKDAHDALAS AFEPVQAMLA DPRAGGESDV VVHNDLPTDV TDRLTWEVGG ETGEADVAIG AASSETVTSI SIPQDAETIT LSLAGHSVSN TYQL
|
| |