Gene Htur_4671 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_4671 
Symbol 
ID8745272 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013745 
Strand
Start bp254970 
End bp257507 
Gene Length2538 bp 
Protein Length845 aa 
Translation table11 
GC content61% 
IMG OID646515180 
Productglycoside hydrolase family 2 sugar binding protein 
Protein accessionYP_003406127 
Protein GI284172745 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGATAG AGTCGCTCAA CGGTAGCTGG AAGTTACGTC AGTCGGATAC CGATCGCTGG 
TTAGATGCGT CGGTTCCCGG CGGAGTCTAT ACGGACCTCC TCAACGCAGG TGAAATCCCC
GATCCGTACG ACGACGACAA CGAACTCGAC CTCCAGTGGG TCGGGACGTC CGACTGGGTG
TATCGACACA CCGTGACGCT CGACGATGAC TTTCTCGACG AGGAACGCGT ACGCTTGCGC
TGTGCCGGCC TTGACACTAT CGCGACGGTA CGCATCAACG GCACGGTCGT GGGCGAAGCC
GCTAACATGC ACCGCAAGTA CGAGTTCGAC GTCGGTGATG CCCTCACTCC CGGGGAGAAC
CAGGTCGAAA TCACGTTCCA CTCTCCGGTC GAGTATAGCG TTCGTCACTC AGAGAATCAC
GGGTATCAGG TTCCAACACT TCGATATCCG GTCGATCAGC CGGGACGGAA CTTTATCCGG
AAAGCCCAGT GCCATTACGG GTGGGACTGG GGGCCGTGTC TTCCGACCTC GGGAATCTGG
CGAGACATCG ACCTCCTCGC CTACTCTGAA CCGCGGATCG AGTACACGAA GACTGTACAG
GACCACGACG GCAACAGCGT CAGCCTCGAC GTGACCGTTG GCCTCGACGC ACCGGCCGAC
GGTGACGTAT TGCTCGCTGC CGAGGTCGCA AATACGGCGA CACATAAAGT CGTAGATGTC
GTCGAGGGGC ACAATGAGGT TACGATCACC CTCGACGTTT CAGATCCCGA TCTCTGGTGG
CCTAATGGGT ACGGCGACCA ACCGCTTTAC GACCTCATCA TAGCCGTCGA CACGAAACCC
GAGTCGGTTG CGGACGACAC GGACGCGGTG ACAGCCGACG GCGGCGTGAC GACGGCCGCC
TCGTCGCTCC TGCCCGACCC GGCTCACGAG ACGTCCACTC GCATCGGGTT CCGAGAGCTC
GAACTCGTCC GCGAACCGGA CGGGGAGGGC GACGGCGAGT CGTTCACGTT CGAGGTCAAT
GGGGTATCGG TGTTCGCGAA GGGTGCCAAC TGGATCCCGG CGGACGCGCT GTACGGACGG
ATCACGCGCG ATCGATACGA ATCGCTGCTC GACAGCGCGA TCGAGGCCAA CATGAACATG
ATTCGTGTCT GGGGCGGCGG TTACTACGAG CGAGACGGGT TCTACGAGGC GTGCGACGAG
CGGGGACTGC TCGTCTGGCA GGACTTCATG TTCGCCTGCG CACTGTACCC GAGCGACGAC
GATTATCTGG CGTCCGTCGA GGAGGAGGTC CGGTACCAAG TTCGCCGGCT CGCCGACCAC
CCGTCGATCG CGCTCTGGTG TGGAAATAAC GAAGTCGAAA TGGGCCTTGA AAGCTGGTTC
GATGACGCCG ACGAACTGGA ACAGTTGAAG GAGGACTACG AGACGCTGTT CTACGACGTG
ATCGGCGATA CCGTTGCTGA AGAGGACGAG ACACGGACGT ACTGGCCCGG ATCACCATCC
AGTGGCACCG GGAGGCAAGA CCCCTACCCG GCGAACAAGG GCGACATCCA CTACTGGGAC
GTCTGGCATG ACGGCGCGGA CTTCGAGGAG TACGAGACGG TCGAACCGCG ATTCGTCTCC
GAGTTCGGAT ACCAGTCATT CCCCTCGGTC GACGCCCTCT CGTCGGTGCT CCCCGACAAC
GAACTCAACC CGACCGCGCC GCTGATGGAA CACCATCAGC GACACGAGGA GGGAAATCGG
ACGATCCTCC AGCGGATGGC GGCGTTGTTC CGCATCCCGT TTAGCTTCGC AGACTTCGTC
TATCTCAGTC AAGTGCAGCA AGGACTGGCG ATGAAGGTCG CCATCGAACA CTGGCGGCGG
CTGAAACCCG ATTGCATGGG GACGCTCTAC TGGCAGTTGA ACGATCTCTG GCCCTGCGCG
TCGTGGTCAT CTATCGAGTA CGGCGGCGAC TGGAAGGCGC TCCAGCACGT CAGCCGCCGT
ATCTACGCAC CGGTCCTGCT CTCGACGACG ATGACGGACG ACGGCGACGA GGTCGAAATC
TGGCTCACGA ACGACGAACG CGACCACCTG AAAGGGAATG TCGCTGTCGA AGCATACACT
TTCGACGGGG AACGCATCGA CGGGACCGAC GAGCGCGTCT CGGTCGCGGC GCTCGACAGC
GCCCGCGTCG CGACCGTCAA CGCGGATCGA TTACTCGGCG ACATCCCACG GGAGGAGGCA
TTCCTCCGCG TCACTTTCGA CGGGAGCGAC GAGACGTATC CGGCGTTCAC GTTCTTCGAG
GAGTACAAGC ACCTCGAACT CCCGGAGCCG AACTTCGACG TCGCTGTCGA CAGGAACGAG
GTGACGATTA AGGCGGACGC CGCCGCCCTG TTCGTCGAAC TGAACGTCCC GCTCGACGGC
CAGTTCTCGG ACAACTACTT CCACCTGACG CCTGGCGAAG AGCAAAGGGT CGCGTTCAAC
GCCGCGGACC CACCCGACGA TCTCGAACGG CGACTCACTG AGGAACTGTC GCTGAACCAC
CTCCGTGCAA CCTACTGA
 
Protein sequence
MRIESLNGSW KLRQSDTDRW LDASVPGGVY TDLLNAGEIP DPYDDDNELD LQWVGTSDWV 
YRHTVTLDDD FLDEERVRLR CAGLDTIATV RINGTVVGEA ANMHRKYEFD VGDALTPGEN
QVEITFHSPV EYSVRHSENH GYQVPTLRYP VDQPGRNFIR KAQCHYGWDW GPCLPTSGIW
RDIDLLAYSE PRIEYTKTVQ DHDGNSVSLD VTVGLDAPAD GDVLLAAEVA NTATHKVVDV
VEGHNEVTIT LDVSDPDLWW PNGYGDQPLY DLIIAVDTKP ESVADDTDAV TADGGVTTAA
SSLLPDPAHE TSTRIGFREL ELVREPDGEG DGESFTFEVN GVSVFAKGAN WIPADALYGR
ITRDRYESLL DSAIEANMNM IRVWGGGYYE RDGFYEACDE RGLLVWQDFM FACALYPSDD
DYLASVEEEV RYQVRRLADH PSIALWCGNN EVEMGLESWF DDADELEQLK EDYETLFYDV
IGDTVAEEDE TRTYWPGSPS SGTGRQDPYP ANKGDIHYWD VWHDGADFEE YETVEPRFVS
EFGYQSFPSV DALSSVLPDN ELNPTAPLME HHQRHEEGNR TILQRMAALF RIPFSFADFV
YLSQVQQGLA MKVAIEHWRR LKPDCMGTLY WQLNDLWPCA SWSSIEYGGD WKALQHVSRR
IYAPVLLSTT MTDDGDEVEI WLTNDERDHL KGNVAVEAYT FDGERIDGTD ERVSVAALDS
ARVATVNADR LLGDIPREEA FLRVTFDGSD ETYPAFTFFE EYKHLELPEP NFDVAVDRNE
VTIKADAAAL FVELNVPLDG QFSDNYFHLT PGEEQRVAFN AADPPDDLER RLTEELSLNH
LRATY