Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4671 |
Symbol | |
ID | 8745272 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013745 |
Strand | + |
Start bp | 254970 |
End bp | 257507 |
Gene Length | 2538 bp |
Protein Length | 845 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 646515180 |
Product | glycoside hydrolase family 2 sugar binding protein |
Protein accession | YP_003406127 |
Protein GI | 284172745 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGATAG AGTCGCTCAA CGGTAGCTGG AAGTTACGTC AGTCGGATAC CGATCGCTGG TTAGATGCGT CGGTTCCCGG CGGAGTCTAT ACGGACCTCC TCAACGCAGG TGAAATCCCC GATCCGTACG ACGACGACAA CGAACTCGAC CTCCAGTGGG TCGGGACGTC CGACTGGGTG TATCGACACA CCGTGACGCT CGACGATGAC TTTCTCGACG AGGAACGCGT ACGCTTGCGC TGTGCCGGCC TTGACACTAT CGCGACGGTA CGCATCAACG GCACGGTCGT GGGCGAAGCC GCTAACATGC ACCGCAAGTA CGAGTTCGAC GTCGGTGATG CCCTCACTCC CGGGGAGAAC CAGGTCGAAA TCACGTTCCA CTCTCCGGTC GAGTATAGCG TTCGTCACTC AGAGAATCAC GGGTATCAGG TTCCAACACT TCGATATCCG GTCGATCAGC CGGGACGGAA CTTTATCCGG AAAGCCCAGT GCCATTACGG GTGGGACTGG GGGCCGTGTC TTCCGACCTC GGGAATCTGG CGAGACATCG ACCTCCTCGC CTACTCTGAA CCGCGGATCG AGTACACGAA GACTGTACAG GACCACGACG GCAACAGCGT CAGCCTCGAC GTGACCGTTG GCCTCGACGC ACCGGCCGAC GGTGACGTAT TGCTCGCTGC CGAGGTCGCA AATACGGCGA CACATAAAGT CGTAGATGTC GTCGAGGGGC ACAATGAGGT TACGATCACC CTCGACGTTT CAGATCCCGA TCTCTGGTGG CCTAATGGGT ACGGCGACCA ACCGCTTTAC GACCTCATCA TAGCCGTCGA CACGAAACCC GAGTCGGTTG CGGACGACAC GGACGCGGTG ACAGCCGACG GCGGCGTGAC GACGGCCGCC TCGTCGCTCC TGCCCGACCC GGCTCACGAG ACGTCCACTC GCATCGGGTT CCGAGAGCTC GAACTCGTCC GCGAACCGGA CGGGGAGGGC GACGGCGAGT CGTTCACGTT CGAGGTCAAT GGGGTATCGG TGTTCGCGAA GGGTGCCAAC TGGATCCCGG CGGACGCGCT GTACGGACGG ATCACGCGCG ATCGATACGA ATCGCTGCTC GACAGCGCGA TCGAGGCCAA CATGAACATG ATTCGTGTCT GGGGCGGCGG TTACTACGAG CGAGACGGGT TCTACGAGGC GTGCGACGAG CGGGGACTGC TCGTCTGGCA GGACTTCATG TTCGCCTGCG CACTGTACCC GAGCGACGAC GATTATCTGG CGTCCGTCGA GGAGGAGGTC CGGTACCAAG TTCGCCGGCT CGCCGACCAC CCGTCGATCG CGCTCTGGTG TGGAAATAAC GAAGTCGAAA TGGGCCTTGA AAGCTGGTTC GATGACGCCG ACGAACTGGA ACAGTTGAAG GAGGACTACG AGACGCTGTT CTACGACGTG ATCGGCGATA CCGTTGCTGA AGAGGACGAG ACACGGACGT ACTGGCCCGG ATCACCATCC AGTGGCACCG GGAGGCAAGA CCCCTACCCG GCGAACAAGG GCGACATCCA CTACTGGGAC GTCTGGCATG ACGGCGCGGA CTTCGAGGAG TACGAGACGG TCGAACCGCG ATTCGTCTCC GAGTTCGGAT ACCAGTCATT CCCCTCGGTC GACGCCCTCT CGTCGGTGCT CCCCGACAAC GAACTCAACC CGACCGCGCC GCTGATGGAA CACCATCAGC GACACGAGGA GGGAAATCGG ACGATCCTCC AGCGGATGGC GGCGTTGTTC CGCATCCCGT TTAGCTTCGC AGACTTCGTC TATCTCAGTC AAGTGCAGCA AGGACTGGCG ATGAAGGTCG CCATCGAACA CTGGCGGCGG CTGAAACCCG ATTGCATGGG GACGCTCTAC TGGCAGTTGA ACGATCTCTG GCCCTGCGCG TCGTGGTCAT CTATCGAGTA CGGCGGCGAC TGGAAGGCGC TCCAGCACGT CAGCCGCCGT ATCTACGCAC CGGTCCTGCT CTCGACGACG ATGACGGACG ACGGCGACGA GGTCGAAATC TGGCTCACGA ACGACGAACG CGACCACCTG AAAGGGAATG TCGCTGTCGA AGCATACACT TTCGACGGGG AACGCATCGA CGGGACCGAC GAGCGCGTCT CGGTCGCGGC GCTCGACAGC GCCCGCGTCG CGACCGTCAA CGCGGATCGA TTACTCGGCG ACATCCCACG GGAGGAGGCA TTCCTCCGCG TCACTTTCGA CGGGAGCGAC GAGACGTATC CGGCGTTCAC GTTCTTCGAG GAGTACAAGC ACCTCGAACT CCCGGAGCCG AACTTCGACG TCGCTGTCGA CAGGAACGAG GTGACGATTA AGGCGGACGC CGCCGCCCTG TTCGTCGAAC TGAACGTCCC GCTCGACGGC CAGTTCTCGG ACAACTACTT CCACCTGACG CCTGGCGAAG AGCAAAGGGT CGCGTTCAAC GCCGCGGACC CACCCGACGA TCTCGAACGG CGACTCACTG AGGAACTGTC GCTGAACCAC CTCCGTGCAA CCTACTGA
|
Protein sequence | MRIESLNGSW KLRQSDTDRW LDASVPGGVY TDLLNAGEIP DPYDDDNELD LQWVGTSDWV YRHTVTLDDD FLDEERVRLR CAGLDTIATV RINGTVVGEA ANMHRKYEFD VGDALTPGEN QVEITFHSPV EYSVRHSENH GYQVPTLRYP VDQPGRNFIR KAQCHYGWDW GPCLPTSGIW RDIDLLAYSE PRIEYTKTVQ DHDGNSVSLD VTVGLDAPAD GDVLLAAEVA NTATHKVVDV VEGHNEVTIT LDVSDPDLWW PNGYGDQPLY DLIIAVDTKP ESVADDTDAV TADGGVTTAA SSLLPDPAHE TSTRIGFREL ELVREPDGEG DGESFTFEVN GVSVFAKGAN WIPADALYGR ITRDRYESLL DSAIEANMNM IRVWGGGYYE RDGFYEACDE RGLLVWQDFM FACALYPSDD DYLASVEEEV RYQVRRLADH PSIALWCGNN EVEMGLESWF DDADELEQLK EDYETLFYDV IGDTVAEEDE TRTYWPGSPS SGTGRQDPYP ANKGDIHYWD VWHDGADFEE YETVEPRFVS EFGYQSFPSV DALSSVLPDN ELNPTAPLME HHQRHEEGNR TILQRMAALF RIPFSFADFV YLSQVQQGLA MKVAIEHWRR LKPDCMGTLY WQLNDLWPCA SWSSIEYGGD WKALQHVSRR IYAPVLLSTT MTDDGDEVEI WLTNDERDHL KGNVAVEAYT FDGERIDGTD ERVSVAALDS ARVATVNADR LLGDIPREEA FLRVTFDGSD ETYPAFTFFE EYKHLELPEP NFDVAVDRNE VTIKADAAAL FVELNVPLDG QFSDNYFHLT PGEEQRVAFN AADPPDDLER RLTEELSLNH LRATY
|
| |