Gene Htur_3895 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_3895 
Symbol 
ID8744523 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013744 
Strand
Start bp132719 
End bp134146 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content67% 
IMG OID646514479 
Productglycoside hydrolase family 4 
Protein accessionYP_003405426 
Protein GI284167148 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.206792 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATCGAC TCGAGGAGAG AGTGCCGTCG CGGTCGCGTT CGTCGGTGAA GATCGGCTAC 
GTCGGCGGCG GCAGTCACGG CTGGGCGCAC ACGCTCATCA ACGACCTGCT CCAGTGCGAC
GATCTCGCGG GGACGGTATC GCTCTACGAC GTCGACTACG AAGCGGCCGA GCAGAACGCG
AGGCTGGCGA ACGGCCTCGC GGAGCGGTCG GACGCCAACG GGGCGTGGAC GTTCGAGGCG
CGTCGGGAGG TCGACGACGC GCTCGCGGAC GCCGACTTCG TCATCTGCTC GATTCAGGAT
CCGGTCGGGG AGACGTTCGT CCACGACATC GACGTCCCCC AGGAGTACGG CATTTACCAG
ACCGTCGCGG ACACGGTCGG CCCCGGCGGC GTCCTGCGCT CGCTGCGAGC GATCCCGCAG
TACCGCGAGA TCGCGGCGAC GGTCCGCGAA CAGTGTCCCG ACGCGTGGGT GATCAACTAC
ACGAACCCGA TGACGGTCTG TACGCGGGCG CTCTACGAGG AGTTCCCCGA CATCAACGCG
ATCGGGCTCT GCCACGAGGT GTTCGGTACC CAGCGGCTGC TGGCCGACAT CGCCGAGCGC
TACGTCGACG AGGCCGAAGA CGTCGCGGCC GACGAGATCG ACGTGAACGT CAAAGGGATC
AACCACTTCA CGTGGGTCGA CGAGGCCTAC TGGAACGGAC ACGACCTCTT CCAGTATCTC
GATCGCGAAC TCGAGGAGCG GAAACCGATT CCGGGGTTCG AACCCGGCGA ACTGAACGAC
GAGTCCTACT GGACGAACCA CCACCAGATC GCCTTCGATC TGTACGACCG GTTCGGGGTG
CTCGGCGCGG CGGGCGACCG CCACCTCGCC GAGTTCGTCC CCTGGTATCT CGACATCGAC
GAGCCCGAGG AGATCCAGCG CTGGGGGATC CGGCTGACCC CCAGTTCCGC CCGGACCGGC
GACAGCGAGG GGCCGGCGAA GATGGAACGA TACCTGTCCG GCGACGAGGA GTTCGAGTTC
ACCGAGTCCG GCGAGGAGGT CGTCGATATC ATGCGCGCGC TCGAGGGACT CGAGCCGATC
AAGACGCACG TCAACCACCC GAATCGGGGC CAGACGCCCG ACCTGCCGAC GGGCGCCGTC
GTCGAGACCA ACGCCGTCAT CACCGGCGGC GGCGTCGCGC CGATCACCGC CGGCGAACTC
CCCCGCGAAG TGCGGTCGAT GGTACTGACG GCCGTGCACA ACCAGGAGAC GCTTATCGAG
GCCGGCTTCG CCGGTGATCT GGACCTTGCC TTCCAGGCGT TCCTCAACGA ACCGCTGGTC
ACCATTCAGC GCGACGAGGC CCGCGACCTG TTCGCCGACC TCGTCGCCCT CGAGCGCGAC
TACCTCCGGG ACTACGACCT CGAGAACGCC GACGTCCTCG AGGGCTGA
 
Protein sequence
MHRLEERVPS RSRSSVKIGY VGGGSHGWAH TLINDLLQCD DLAGTVSLYD VDYEAAEQNA 
RLANGLAERS DANGAWTFEA RREVDDALAD ADFVICSIQD PVGETFVHDI DVPQEYGIYQ
TVADTVGPGG VLRSLRAIPQ YREIAATVRE QCPDAWVINY TNPMTVCTRA LYEEFPDINA
IGLCHEVFGT QRLLADIAER YVDEAEDVAA DEIDVNVKGI NHFTWVDEAY WNGHDLFQYL
DRELEERKPI PGFEPGELND ESYWTNHHQI AFDLYDRFGV LGAAGDRHLA EFVPWYLDID
EPEEIQRWGI RLTPSSARTG DSEGPAKMER YLSGDEEFEF TESGEEVVDI MRALEGLEPI
KTHVNHPNRG QTPDLPTGAV VETNAVITGG GVAPITAGEL PREVRSMVLT AVHNQETLIE
AGFAGDLDLA FQAFLNEPLV TIQRDEARDL FADLVALERD YLRDYDLENA DVLEG