Gene Htur_4787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_4787 
Symbol 
ID8745377 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013745 
Strand
Start bp405476 
End bp406906 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content66% 
IMG OID646515285 
Productglycoside hydrolase family 4 
Protein accessionYP_003406232 
Protein GI284172850 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0488645 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATCAAC TCGGTAGTCG TTCGATCGAG GAGATCCCTC GCGTGAAGAT CGGATACGTC 
GGGGGCGGCA GCCAGGGGTG GGCCCACACC CTCATCAACG ATCTCGCGCA GTGTGGCGAC
ATCGCCGGAT CGGTGGCGCT GTACGACGTC GACCACGAGG CCGCGACGAA GAACGCCGAA
CTTGGCAATC GGATCGTCGA GCGCGAGGAC GCCGACGGCG ACTGGACGTT CGAGGCCTAC
CGCGAGATGG ACGACGCGCT CGCGGACGCC GACTTCGTCG TCTGCTCGAT CCAGGACCCG
CCCGCGGAGA CGTTCGTCCA CGACATCGAC GTTCCCAAAC AGTACGGCAT CCACCAGCCG
GTCGCCGACA CCGTCGGTCC CGGCGGGGTC CTCCGCTCGA TGCGGGCGAT CCCGCAGTAC
CGCGAGATCG CGGCGACGGT TCGCGAACAG TGTCCCGATG CGTGGGTAAT CAACTACACC
AACCCGATGA CCGTCTGCAC TCGGACGCTC TACGAGGAGT ACCCCGACAT CAACGCGATC
GGGCTCTGCC ACGAGGTGTT CAAGTTCCAG GAGCAGTTCG CCGACATCGC CGAGCGGTAC
GTCGACGACG CCGAGGACGT CGCCCGCGAG GAGATCCACG TCACCGTCAA GGGGATCAAC
CACTTCACGT GGATCGACGA GGCCCGATGG CGCGATACCG ACCTGTTCGG CTACCTCGAG
GCCGAACTCG AGGAGCGGAA ACCGCTGAAG GACTTCGATC CGGGTTCGAT GGCCGACGCG
TCCTACTGGG TCAACAACTA CAACGTCGCC TTCGACCTCT ACGACCGGTT CGGCCTGCTC
GGCGCGGCCG GCGACCGCCA CCTCGTCGAG TTCGTCCCGT GGTACCTCCA GCTCGACGAC
CCCGAGGACC TCCATCGATG GGGGATCCGG TTCACCCCGA GTTCGGCTCG CCTCCCCGAC
GACGACGGGC CGACGCAGAC CGAGCGGTAC CTCTCTGGCG ACGAGGAGTT CGAGTTCTAC
GACTCCGGCG AGGAGGCCGT CGACATCTTC CGGGCCCTGC TGGGACTCGA GCCCGTCGAG
ACCCACCTGA ACTACCCCAA CGAGGGGCAG GTCGCGGGGC TGCCCGAGGG CGCCGTCGTC
GAGACGAACG CGTTGCTCAC CGGCGACGAC GTCTCGCCGC TGGCCGCCGG CTCGTTCCCT
CGCGAAATCC GATCAATGGT GATGACCCAC GTGAACAACC AGGAGACGCT CGTCGAGGCC
GGGTTCGAGG GCGACCTCGA TCGGGCGTTC CGGGCGTTCC TCAACGATCC GCTCGTCTCG
ATCGAACGCG ACGCCGCCGC GGACCTTTTC GTCGAACTCG TCGACCGCGA ACGCGACTAC
CTCGAGGTGT GGGACCTCGA GGACGCCGAC GTCCTCGCGG CGTCGCGCTG A
 
Protein sequence
MHQLGSRSIE EIPRVKIGYV GGGSQGWAHT LINDLAQCGD IAGSVALYDV DHEAATKNAE 
LGNRIVERED ADGDWTFEAY REMDDALADA DFVVCSIQDP PAETFVHDID VPKQYGIHQP
VADTVGPGGV LRSMRAIPQY REIAATVREQ CPDAWVINYT NPMTVCTRTL YEEYPDINAI
GLCHEVFKFQ EQFADIAERY VDDAEDVARE EIHVTVKGIN HFTWIDEARW RDTDLFGYLE
AELEERKPLK DFDPGSMADA SYWVNNYNVA FDLYDRFGLL GAAGDRHLVE FVPWYLQLDD
PEDLHRWGIR FTPSSARLPD DDGPTQTERY LSGDEEFEFY DSGEEAVDIF RALLGLEPVE
THLNYPNEGQ VAGLPEGAVV ETNALLTGDD VSPLAAGSFP REIRSMVMTH VNNQETLVEA
GFEGDLDRAF RAFLNDPLVS IERDAAADLF VELVDRERDY LEVWDLEDAD VLAASR