Gene Htur_4504 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_4504 
Symbol 
ID8745133 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013745 
Strand
Start bp105537 
End bp107069 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content64% 
IMG OID646515041 
Productglycoside hydrolase family 43 
Protein accessionYP_003405988 
Protein GI284172606 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3507] Beta-xylosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGATA GTCATGATTG CGGTGATCCG AATCGACGTA CCGTGCTGCA AGCGATCGGC 
GCCGGCGCGA TCGGTACTGC GGGCGCGCTC GGGGGTAGCG GATTGGGGAG TGCCGACCAC
GGCGGCGAGC ATTACTATAA CCCGCTGTAC GAACCGCACT TTCCGGATCC GGAAGTTCAC
CGCGCTGACG ACGGTACGTG GTGGGCCTAC GGGACGAACA TGAACCGGGA AAATACCGAC
GACGAACTCC TCGTTCCGGT CCTCTCCTCG ACGGATCTGG TGAACTGGAC CTACGAGGGC
GAGGCGTTCG ACGCGCGACC CGGTTGGACC GAGGGCTCGA TCTGGGCACC CAACATCCAC
TACTACAACG ACGAGTGGAT CATGTTCTAC TCGCTCGAGC CGCGTCCGTG GGAGAGCGGC
GAGTTCGGTA TCGGCCTCGC GACGTCCGAC ACGCCGAGGG GACCGTTTAC CGATCACGGA
CAAGTCATCG GTGACAGCGA CACGGGCGGC GGAACCATCG ACGCCTACTT CGTCGAGTAT
CAGGGGACGC CGTACCTCTT CTGGGGGAGC TTCCAGGGGA TCTACGTCGC GGAACTGACG
CCCGACCTGC GGGACGTCGA TATGGCGACG GTCACGCAGG TCGCCGGCGA CGCCTACGAG
GGAACGATCC ACTACGAGCG CAATGGGTAC CACTATCTGT TCGTTTCGAC CGGAACCTGC
TGTGAAGGGC ACAGCAGCAC GTACGAGTAC GAAGTCGGCC GGTCGACGGA CTTCTTCGGG
CCGTACGTCG ATCAGAACGG CATCGACTTG ATGGAGTACA ACGAGTATAA CCAGGGGACG
CCGGTCCTCA CCGGTACCGA TCGGTTCCCG GGCGCAGCAC ACGGCGACAT CACGACGTAC
GACGACGGCT CGGAATGGCT GCTCTATCAC GCGTACGACG CGACCGATCC GGAGTTCATC
GACGGCGTCC CGCGGCGCGT GCTCATGATG GACCGGATCG ACTGGGAGAA CGGCTGGCCG
GTGATCGGCT GCGACGGGAC GCCGAGCGAG GTCTCCCCGG TACCGGGTAG CGGCACACAC
TGCGGCGATA ACGGCGGCGA CGGACCGCTT TCCGCGGGAA CGTACCGAAT CTCGAACGTC
AATAGCGGTC TACTGCTGGA GGTGGGGGAC GCGGACACCA CCGAGGGAGC GACCGTCAAT
CAGTGGTCGG ATACCGGTCA TCCGTGCCAG GAGTGGGAGC TCATCGAGCA CGACGACGGG
ACCTACCGAC TGGAGAACGT TCACTCCGGA CACGTGCTGT CGGTCGCAGA CGGTTCGACG
AGCGAGGGGG CCAGTTTGGT CCAGCGCGCC TGGGGGGACG CCGCCGATCA GCGCTGGCGT
CTCATCGAGG GCGACGGCTC GTATCGACTC GAGAACGGCG CCAGCGGGTA CGTCGCGGAC
GTGCTAGAGG CGTCGACCGA CGACGGCGCC GACGTCGTCC AGTGGAGTTG GCTGGACGGC
GATAACCAGC GGTGGAACTT CGACCCAGTC TGA
 
Protein sequence
MTDSHDCGDP NRRTVLQAIG AGAIGTAGAL GGSGLGSADH GGEHYYNPLY EPHFPDPEVH 
RADDGTWWAY GTNMNRENTD DELLVPVLSS TDLVNWTYEG EAFDARPGWT EGSIWAPNIH
YYNDEWIMFY SLEPRPWESG EFGIGLATSD TPRGPFTDHG QVIGDSDTGG GTIDAYFVEY
QGTPYLFWGS FQGIYVAELT PDLRDVDMAT VTQVAGDAYE GTIHYERNGY HYLFVSTGTC
CEGHSSTYEY EVGRSTDFFG PYVDQNGIDL MEYNEYNQGT PVLTGTDRFP GAAHGDITTY
DDGSEWLLYH AYDATDPEFI DGVPRRVLMM DRIDWENGWP VIGCDGTPSE VSPVPGSGTH
CGDNGGDGPL SAGTYRISNV NSGLLLEVGD ADTTEGATVN QWSDTGHPCQ EWELIEHDDG
TYRLENVHSG HVLSVADGST SEGASLVQRA WGDAADQRWR LIEGDGSYRL ENGASGYVAD
VLEASTDDGA DVVQWSWLDG DNQRWNFDPV