Gene Htur_3896 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_3896 
Symbol 
ID8744524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013744 
Strand
Start bp134317 
End bp137172 
Gene Length2856 bp 
Protein Length951 aa 
Translation table11 
GC content65% 
IMG OID646514480 
Productglycoside hydrolase family 2 sugar binding protein 
Protein accessionYP_003405427 
Protein GI284167149 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0623338 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGTCCTC CGGAGAAAGG GGTGCTCATG CAATCGATCT CGTTATCGGG ACCGTGGGCT 
CTGCGGCTCA ATCCCGACCG CGACCGACAC GTCGTTGATC CGCCTGACGA CGTTATCGAC
GGGACGATGT ACCTTCCCGG ATCCACCGAC GAATACGGCT ACGGGACAAC CGTCGACGAT
CGACCGCGGG ACCACCTCCA ACGGACGCAT CGATACGAGG GGTCGGCGTG GTACCAGCGC
ACCGTCTCCG TCCCCGATTC CTGGGCGGGC AAACGCGTGA CACTAACGCT CGAGCGAACC
AGACCGACCG AGGTCTGGGT CGACGGGGAG CGTATCGGCT CGCGAGTGTG CCTGAGCACC
CCGCACGTAT ACGATCTCAC CGAGGCGATC GGCCCCGGCG AGCACGAAAT CGCCGTTCGA
GTGGACAACG CCGACGACTC GATGGATCGT CCGGGAGTCA GGCGATCGCA CGCGAGCACT
GAACACACGC AGACCAACTG GAACGGCATC ATCGGGGACC TCCGGCTGGA GGCGACACCG
AACGTCTGGA TCGAGAGCGT CGACCCGTTC CCGGAACCGA GCGAGAACGC CGTCGACCTC
GAGATAACCC TCGGGAACGC CACGGAGACC GATTTTCAGG GAACCGTCTC TGCCGCTGCC
CGGAGTACGA CCACCGATGA GATCCACGTC GCGGACACCG TCGATCGCCC CGTCTCGGTC
AGTGCCGGCG ATGGCTCGGC GCCGGGTCGA ACGACTCTCG AGTTCACATA CGATCTCGGA
TCCGACGCGC TCACCTGGGA CGAGTTCTCG CCGGCGGTCT ACGAGTTGAC CGTCTCGCTC
GAGACGGCAG GGGACGAGAA CGCCGCCGGC CACGAGTTCG AGACCGCGTT CGGCCTCCGC
GAGTTCGAAG CCGACGGCAC GCAGTTCTCG ATCAACGGGA CGACGATCTT CCTGCGCGGC
CGGACGGACT GCTGTGTCTT CCCCGACACC GCGTACCCGC CGACGACGGT CGCCGAATGG
GTCGATCACA TGGAGACCGC CAGAGCGTAC GGCATCAACC ACTATCGGTT CCACAGCTGG
TGCCCGCCGG AGGCGGCCTT CGAGGCGGCC GACCGCGTCG GGATGTACCT GCAACCGGAG
TGTTCCCAGT GGAACAACGG CACCTCGCTC GCCGACGCCG ACGACTACGA GTACTACGAG
CGGGAGGCCG AGCGCATTCT CGACGCCTAT GGACACCACC CCTCGTTCGT CTGCTTCACC
CTCGGCAACG AAAATCGGGG CGACGAGGAG CGAATGACCG AACTCGTTCG CCACTGTCGA
GCGCTCGACG ACCGACGGCT GTACGCCTAC GGGGCGAACA ATTTTCTTAC CTCACCGCAC
CCGGGAGACG CCGACGACTT CTTTGTTACC GCCAACGTCC CCGAAGACGC CGAGGCCAGT
CACTGGGACG TCGATCGAAC GCCCATCCGC GGGACCGGAC ACGTCAACGA TTCTCCGCCG
TCGACGGCCG TCGACTACGA GGACGAACTC GAGCCCTACG ATCTCCCCGT CGTCGGTCAC
GAGATCGGCC AGTACCAGGT CCATCCCGAC TACGACGAGA CGCGCAAGTA TCGCGGCGTC
CTCCGGGCTC GCAACCTCGA ACGGTTCGAG CGCTCGCTCG CCGATCGCTA TCTCGAAGGC
TCGGACGCGG ATTTTCGGGC CGCGTCCGGC GCGCTCGCAG TCACCTGTTA CCGCGAAGAG
ATCGAGGCCG CGTTCCGAAC GGCGGGCTTC GGCGGCTTTC AATTGCTCGG CCTCGACGAC
TTTCCGGGCC AGGGGACCGC GATGGTCGGG ATCCTCGACT CCTTCATGGA GTCGAAGGGA
CTGATCGAGC CCCACGAGTG GCGTCGCTTC TGTAGCGCAC GCGTCCCGCT CCTCTCGTTC
GACCGGTACA CCTTCACGAC CGACGACGCG TTCGAGGCCG AGTCGCTGCT CGCCAATTAC
GGCCCCGGCC CGATCGCGGA CGCGACCGCG ACCTGGTCGA TCGCCGAACT CGACGGCACG
GAAACCGCTT CCGGAGCCCT CGAGCGAGAC GTCCTCGAGC AGGGGGAGCT GACGTCCCTC
GGGACGATCG ACGCCCCGCT CGAGGACGTC GACGCGCCTG CCCGGTTCGA GGTAACGCTC
GCGGTCGAGG GAGCGGACCT CGAGGACGGC TCCGCCGTCG ATCTCCGGAC GACCTACCCG
ATCTGGGTCT ATCCCGAGAC ACCCGAAATA GCCGCCAACA CAGACGATAT CGACGTTTCC
TGGCGCTTCG ACGAGCGAAC TCGCACGCGA CTGGAGGACG GCGAGACGGT CCTCTTACTG
CCGGAACCGA GCGCGCTCCG GTACGGTCTC GAGGGCTCGT TCCAGCCGGA CTTCTGGAAC
TACGAGATCT TCAAGCAAAA CGGAAAGCCG GGGACCCTCG GAATGACGAC GGACCCCGAT
CATCCGCTGT TCGACGCCTT CCCGACCGAG GACTACGCCG ACTGGCAGTG GTGGCCCCTC
CTGCGTCACT CGCGGCCGAT CGTTCTCGAC GACGCGCCCG CCGACTTCGA ACCCACTGTA
CAGATAATCG ATACGATCTA CCGGAACCAC AAACTCGGCG TCTACTTCGA AACCGCCGTC
GGCGACGGAG CGCTCGCCGT CTGCACGCTG GATCTCTCCG GGGACGCTCC CGCCGTCCGG
CAGTTTCGAC ACAGCCTCGA GTCCTACCTC GCGTCCGAGG CGTTCGATCC GGATTCCGCC
CTCTCTACCG GCGTCCTCGA CAGCCTCCTC AACGCCGGAC GGGACGACGA ACGCGACTAC
GGGGACGACG CCGGCGCGTG GGTCGAACGC AACTGA
 
Protein sequence
MCPPEKGVLM QSISLSGPWA LRLNPDRDRH VVDPPDDVID GTMYLPGSTD EYGYGTTVDD 
RPRDHLQRTH RYEGSAWYQR TVSVPDSWAG KRVTLTLERT RPTEVWVDGE RIGSRVCLST
PHVYDLTEAI GPGEHEIAVR VDNADDSMDR PGVRRSHAST EHTQTNWNGI IGDLRLEATP
NVWIESVDPF PEPSENAVDL EITLGNATET DFQGTVSAAA RSTTTDEIHV ADTVDRPVSV
SAGDGSAPGR TTLEFTYDLG SDALTWDEFS PAVYELTVSL ETAGDENAAG HEFETAFGLR
EFEADGTQFS INGTTIFLRG RTDCCVFPDT AYPPTTVAEW VDHMETARAY GINHYRFHSW
CPPEAAFEAA DRVGMYLQPE CSQWNNGTSL ADADDYEYYE REAERILDAY GHHPSFVCFT
LGNENRGDEE RMTELVRHCR ALDDRRLYAY GANNFLTSPH PGDADDFFVT ANVPEDAEAS
HWDVDRTPIR GTGHVNDSPP STAVDYEDEL EPYDLPVVGH EIGQYQVHPD YDETRKYRGV
LRARNLERFE RSLADRYLEG SDADFRAASG ALAVTCYREE IEAAFRTAGF GGFQLLGLDD
FPGQGTAMVG ILDSFMESKG LIEPHEWRRF CSARVPLLSF DRYTFTTDDA FEAESLLANY
GPGPIADATA TWSIAELDGT ETASGALERD VLEQGELTSL GTIDAPLEDV DAPARFEVTL
AVEGADLEDG SAVDLRTTYP IWVYPETPEI AANTDDIDVS WRFDERTRTR LEDGETVLLL
PEPSALRYGL EGSFQPDFWN YEIFKQNGKP GTLGMTTDPD HPLFDAFPTE DYADWQWWPL
LRHSRPIVLD DAPADFEPTV QIIDTIYRNH KLGVYFETAV GDGALAVCTL DLSGDAPAVR
QFRHSLESYL ASEAFDPDSA LSTGVLDSLL NAGRDDERDY GDDAGAWVER N