Gene Htur_4685 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_4685 
Symbol 
ID8745281 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013745 
Strand
Start bp271521 
End bp272570 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content65% 
IMG OID646515189 
Productglycosyl hydrolase family 88 
Protein accessionYP_003406136 
Protein GI284172754 
COG category[R] General function prediction only 
COG ID[COG4225] Predicted unsaturated glucuronyl hydrolase involved in regulation of bacterial surface properties, and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.558962 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTATGG AACGTCTGCT AGAAGAACTG CTTCCAACCG TCGCCGAGTA CACGATCGAC 
CTCGATTTCG AGGATTTCAT CCAGGGAGAA CGGCCGATGG TGCTCCGCGG CCTGCTGGCG
ACGGACGACG ACGAACACAC CGAGATCGCT GAGCGATATA TCGACTGGGC CGTCCGATCG
CAGTCGAGCG ACGGGCTGAT GGCCTACGGA TCGATCGACG TCGTCCCGGA GTGGGACGAA
CACCGCATCT TCCGTCCCAT TCCGGACGCC GGAGCTGTCG CCGTCCTCGC ACTCGAGGCC
TACGAGAACG GCGGTCCCGA CTCCTATCTC GACGCCTGCC GACGACAGTT CGAGTACCTC
CACGAGGACG CGCCTCGGAG CGAGGACGGG GGGATTACCC ACCACATGGA CGACATCGAA
CTGTGGATCG ACGCCATCTA CATGATGTGT CCGTTCATGG CCCGCTACGG CCAGCTTACG
GACACGCCCG AAGCGATCGA CGAAGCCGTC GACCAGATTC TCGTTCACGC GAAGCGCCTC
CGTGACCCCC ATAGCGACCT CTACCGGCAC ATCTGGCGCG AGCAGCCCAA CAGCTACCCC
GACGGTTCGC TCTGGCTGCG CGGCAACGGC TGGTTCGCCA CGGGCGTCCT CGACACGTTA
GACTACGTGC CCGAAGACCA CCCCGAGCGA GATCGCCTCG AGGAACTCGT CCGTGACCAC
CTCCTGAGCA TGGCCGAGTA TCAAGACGCC AGCGGCTTCT GGCACCACCT CATCGACGAC
GACACGATGT ATCTGGAGAC CTCGGGGACC CTCCAGTACG CCTACGCCTT CACCGAGGCC
GTCGATCGCG GCCTGCTCGA GGAGGAGTAC CGCGAGGTCG CCGAGGACGC GATCGGTGCG
GCCAAGACGG TCGTGACCCC CGACGGCGCG GTCCAGCGCA ACGCGGCGAT GCCCGGCGGT
CCGGAGGCTC CGCAGGCGAT CAACCTCTAC GGACAGGGCT GGTTCCTGAT CGCCGGCAAG
CGGGTCCTCG AGTCGGACGC TGACGTCTGA
 
Protein sequence
MPMERLLEEL LPTVAEYTID LDFEDFIQGE RPMVLRGLLA TDDDEHTEIA ERYIDWAVRS 
QSSDGLMAYG SIDVVPEWDE HRIFRPIPDA GAVAVLALEA YENGGPDSYL DACRRQFEYL
HEDAPRSEDG GITHHMDDIE LWIDAIYMMC PFMARYGQLT DTPEAIDEAV DQILVHAKRL
RDPHSDLYRH IWREQPNSYP DGSLWLRGNG WFATGVLDTL DYVPEDHPER DRLEELVRDH
LLSMAEYQDA SGFWHHLIDD DTMYLETSGT LQYAYAFTEA VDRGLLEEEY REVAEDAIGA
AKTVVTPDGA VQRNAAMPGG PEAPQAINLY GQGWFLIAGK RVLESDADV