Gene Htur_4496 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_4496 
Symbol 
ID8745125 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013745 
Strand
Start bp95195 
End bp96706 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content64% 
IMG OID646515033 
Productglycoside hydrolase family 43 
Protein accessionYP_003405980 
Protein GI284172598 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3507] Beta-xylosidase 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0241272 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTGATA GTCATGATCA CACAGGGTTG AATCGGCGAA GCGTACTGAA GACGATCGGC 
GCCGGCGCGC TCGGTGCGGG CGTCATCGCG ACGAGCGGGA CGACGGTCGC CGACGAGAGT
TCGACGCATT ACCACAATCC GGTCGGACCG GTCGGGTTCG GAGACGTAAC CGTCATTCAG
GCCGGCGACG GAACCTACTA CGCGTACGGA ACCGAGACGC CGGAGGACAT CGTTCCGATC
GCGACCTCGG ACGATCTGGT AGACTGGACG TACATCGACT CGGCGTTCGA CAGCTACCCC
GACTGGCGGG ACGATCCGGA CGCCGGGGTC TGGGCGCCCG ACATCAACTA CTACAACGGC
CAGTACTACC TCTATTACTC CTACTCGACG TGGGGGAGCC AGGACAATCC GGGGATCGGC
GTCGCGCTCT CGGACACACC GGACGGTCCG TTCGAGGATC AGGGGCCGGT GTTCAGGGCC
GAAGACCTCG GGATGACCAA CTGCATCGAC TCGGAGTTCC GCGTCGTCGA CGGCACTCCC
TACATGATCT GGGGGAGTTT CTACGGGTTC TACGGCGTCG AACTCACGAG CGACGGGATG
GACTACGTCC CGGATACGAC GTTCCACTTG GCGGGTGACA ATCGCGAAGG CCCGATGGTC
ATCGAAGAGA ACGGCTACTA CTACCTGTTC TACTCGACTG GCCACTGCTG TGAGGGGTAC
GACAGCACCT ACGAGGTCGA AGTCGGCCGC TCCGAATCGT TCTTCGGCCC CTACTACAAC
CAGAACGGGA CCGACCTGCG CGACCTGAAC GAGCACCGCA GCGGCGTGTC GGTCCTCAAC
GGGACGGACG AGTTCACCGG TCCGGGTCAC AACACCGCGA TCCAGGACGA GAACGGCGAC
TGGTGGATGC TCTACCACGT CGAGGCCACG GCGGACCAGG AGACCCGCAT CATGATGATC
GATCGAATCC AGTGGGAGAA CGGCTGGCCG GTCGTCGCCT GTGACGGGAC GCCGAGCACG
CAGAGCCCGA TGCCGAACAC CGGCAGCTAC GACTGCGGCG CCGTCACCAG CGGTATCGGC
ATCAGCGAGG GGACCTACGC GATCACGAAC GTCAACAGCG GCAAGCGCCT CGAGGTCGCC
AGCGCCGGAA CGAGTGACGG CGACAACGTT CAGCAGTACA GCGATACGGG ACACGCCTGC
CAGCAGTGGG ACGTAATCGA GACGGACGAC CACGAGACGT TCCACCTTCG AAACGTCAAC
AGCGGGAAGC TCATGGAGGT GGCCGGCGCC GACACGAGCG ACGGAGCGAC CGTCCAGCAG
TACGCCGACA CCGGGCACGC GACCCAAGAC TGGCACATCG TCGACAACGG TGACGGTACC
TACCGCATCG AGAACGCCAA CAGCGGCAAG GTCGCCGAGG TCAACGGTGC GTCGACGGAC
GACGGTGCCG ACGTCATCCA GTGGTCGTGG AACGGCGGCG CGAACCAGCG GTGGACGTTC
GATCTGGTGT AA
 
Protein sequence
MVDSHDHTGL NRRSVLKTIG AGALGAGVIA TSGTTVADES STHYHNPVGP VGFGDVTVIQ 
AGDGTYYAYG TETPEDIVPI ATSDDLVDWT YIDSAFDSYP DWRDDPDAGV WAPDINYYNG
QYYLYYSYST WGSQDNPGIG VALSDTPDGP FEDQGPVFRA EDLGMTNCID SEFRVVDGTP
YMIWGSFYGF YGVELTSDGM DYVPDTTFHL AGDNREGPMV IEENGYYYLF YSTGHCCEGY
DSTYEVEVGR SESFFGPYYN QNGTDLRDLN EHRSGVSVLN GTDEFTGPGH NTAIQDENGD
WWMLYHVEAT ADQETRIMMI DRIQWENGWP VVACDGTPST QSPMPNTGSY DCGAVTSGIG
ISEGTYAITN VNSGKRLEVA SAGTSDGDNV QQYSDTGHAC QQWDVIETDD HETFHLRNVN
SGKLMEVAGA DTSDGATVQQ YADTGHATQD WHIVDNGDGT YRIENANSGK VAEVNGASTD
DGADVIQWSW NGGANQRWTF DLV