Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4504 |
Symbol | |
ID | 8745133 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013745 |
Strand | - |
Start bp | 105537 |
End bp | 107069 |
Gene Length | 1533 bp |
Protein Length | 510 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 646515041 |
Product | glycoside hydrolase family 43 |
Protein accession | YP_003405988 |
Protein GI | 284172606 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3507] Beta-xylosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGATA GTCATGATTG CGGTGATCCG AATCGACGTA CCGTGCTGCA AGCGATCGGC GCCGGCGCGA TCGGTACTGC GGGCGCGCTC GGGGGTAGCG GATTGGGGAG TGCCGACCAC GGCGGCGAGC ATTACTATAA CCCGCTGTAC GAACCGCACT TTCCGGATCC GGAAGTTCAC CGCGCTGACG ACGGTACGTG GTGGGCCTAC GGGACGAACA TGAACCGGGA AAATACCGAC GACGAACTCC TCGTTCCGGT CCTCTCCTCG ACGGATCTGG TGAACTGGAC CTACGAGGGC GAGGCGTTCG ACGCGCGACC CGGTTGGACC GAGGGCTCGA TCTGGGCACC CAACATCCAC TACTACAACG ACGAGTGGAT CATGTTCTAC TCGCTCGAGC CGCGTCCGTG GGAGAGCGGC GAGTTCGGTA TCGGCCTCGC GACGTCCGAC ACGCCGAGGG GACCGTTTAC CGATCACGGA CAAGTCATCG GTGACAGCGA CACGGGCGGC GGAACCATCG ACGCCTACTT CGTCGAGTAT CAGGGGACGC CGTACCTCTT CTGGGGGAGC TTCCAGGGGA TCTACGTCGC GGAACTGACG CCCGACCTGC GGGACGTCGA TATGGCGACG GTCACGCAGG TCGCCGGCGA CGCCTACGAG GGAACGATCC ACTACGAGCG CAATGGGTAC CACTATCTGT TCGTTTCGAC CGGAACCTGC TGTGAAGGGC ACAGCAGCAC GTACGAGTAC GAAGTCGGCC GGTCGACGGA CTTCTTCGGG CCGTACGTCG ATCAGAACGG CATCGACTTG ATGGAGTACA ACGAGTATAA CCAGGGGACG CCGGTCCTCA CCGGTACCGA TCGGTTCCCG GGCGCAGCAC ACGGCGACAT CACGACGTAC GACGACGGCT CGGAATGGCT GCTCTATCAC GCGTACGACG CGACCGATCC GGAGTTCATC GACGGCGTCC CGCGGCGCGT GCTCATGATG GACCGGATCG ACTGGGAGAA CGGCTGGCCG GTGATCGGCT GCGACGGGAC GCCGAGCGAG GTCTCCCCGG TACCGGGTAG CGGCACACAC TGCGGCGATA ACGGCGGCGA CGGACCGCTT TCCGCGGGAA CGTACCGAAT CTCGAACGTC AATAGCGGTC TACTGCTGGA GGTGGGGGAC GCGGACACCA CCGAGGGAGC GACCGTCAAT CAGTGGTCGG ATACCGGTCA TCCGTGCCAG GAGTGGGAGC TCATCGAGCA CGACGACGGG ACCTACCGAC TGGAGAACGT TCACTCCGGA CACGTGCTGT CGGTCGCAGA CGGTTCGACG AGCGAGGGGG CCAGTTTGGT CCAGCGCGCC TGGGGGGACG CCGCCGATCA GCGCTGGCGT CTCATCGAGG GCGACGGCTC GTATCGACTC GAGAACGGCG CCAGCGGGTA CGTCGCGGAC GTGCTAGAGG CGTCGACCGA CGACGGCGCC GACGTCGTCC AGTGGAGTTG GCTGGACGGC GATAACCAGC GGTGGAACTT CGACCCAGTC TGA
|
Protein sequence | MTDSHDCGDP NRRTVLQAIG AGAIGTAGAL GGSGLGSADH GGEHYYNPLY EPHFPDPEVH RADDGTWWAY GTNMNRENTD DELLVPVLSS TDLVNWTYEG EAFDARPGWT EGSIWAPNIH YYNDEWIMFY SLEPRPWESG EFGIGLATSD TPRGPFTDHG QVIGDSDTGG GTIDAYFVEY QGTPYLFWGS FQGIYVAELT PDLRDVDMAT VTQVAGDAYE GTIHYERNGY HYLFVSTGTC CEGHSSTYEY EVGRSTDFFG PYVDQNGIDL MEYNEYNQGT PVLTGTDRFP GAAHGDITTY DDGSEWLLYH AYDATDPEFI DGVPRRVLMM DRIDWENGWP VIGCDGTPSE VSPVPGSGTH CGDNGGDGPL SAGTYRISNV NSGLLLEVGD ADTTEGATVN QWSDTGHPCQ EWELIEHDDG TYRLENVHSG HVLSVADGST SEGASLVQRA WGDAADQRWR LIEGDGSYRL ENGASGYVAD VLEASTDDGA DVVQWSWLDG DNQRWNFDPV
|
| |