Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4649 |
Symbol | |
ID | 8745252 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013745 |
Strand | + |
Start bp | 229586 |
End bp | 230893 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 646515160 |
Product | glycoside hydrolase family 4 |
Protein accession | YP_003406107 |
Protein GI | 284172725 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAAAGG TCGCGTTCAT TGGAGCGGGA AGTATGGTCT TCGCCAAGAA CCTAGTCGGA GACATCCTCT CGTTCGAAGC GCTCAAGGAC AGCACGATCG CGCTCATGGA TATCGACGAG CACCGCCTCG CTCAGACAAC CGAGGTTGCC GAACAGATAG TCGAGAACAG TCAGATCGAC GCAACGATCG AGTCGACGAC TGATCGCCGC GAGGCACTCG ACGGTGCTGA CTATGTGCTC AACATGATCA ATGTCGGTGG GACGGAACCA TTCGAAAATG AGATCCGCAT TCCCGAGGAG TACGGCGTCA AGCAATCGAT CGGGGATACA CTGGGACCAG GGGGAATCTT CAGGGGACTC CGGACGATTC CCACGATGCT CGACATCGCT CGAGACATGG AGGAGCTCTG TCCTAACGCA TTGCTTATGA ACTACACCAA TCCCATGGCG ATCGTCTGCT GGGCTGTAGA CGAGGCCACA GATATAGATA TCGTCGGACT CTGCCATAGC GTCCCCCACA CCGCGGAGGC GATTGCTGAG TACGCCGACA TCCCGCTGGA GGAACTCAAT TACTGGGTCG CCGGAATCAA CCATATGGCG TGGTTCCTTG AGTGTGACTG GGACGGACAA GACATCTATC CGCTGCTCGA GGATGCGACA GACGACGAGG AGACATATCG GAAGGACACC GTCCGGTTCG AGTTGCTGAA ACACTTCGGT GCTTTCGTCA CTGAATCGAG CCACCACAAC TCGGAATACC TTCCCTACTT CCGCACGGAC GAAGATCTCA TCGACGAGTT GACGGGGACG AACTACGCCG AGCGCATGTC AACGGCGACG TACCTTGAGG GTTGGAAGAA ACGCTCTGAG GAACGGGACG ACGCGCTGAC CGGCGTCAAT CCCGACGATG TCTCAATCGA GCGCTCCGAG GAGTACGCCT CGCGGCTGAT CCACTCGATC GAGACAGACA CGCCGCGACG GCTCAACTTG AACGTGCGAA ACGAAGCAGG TCACATCCAG AACTTGGAGA ACGACGCCTG CATTGAAGTG CCCTGTCTGG TGGACGGCAC GGGAATTCGT CCGTGTTCAG TCGGCGAGCT GCCACCGCAG CTCGCCGCGC TCAACCGAAC GAACGTGAAC GTTCAGCGCC TCGCGGTCGA GGGTGCGCTT AAGGGCGACC GCGACGTCGT TCACCAGGCC GTCAAACTTG ATCCGTTAAC GGCGGCCGAG CTCGACCTCG ATGAGATTCA CGAGATGACC GAGGAACTGA TCGCAGCGAA CAAAGCATAT CTGCCGGCCC TCGACTAA
|
Protein sequence | MPKVAFIGAG SMVFAKNLVG DILSFEALKD STIALMDIDE HRLAQTTEVA EQIVENSQID ATIESTTDRR EALDGADYVL NMINVGGTEP FENEIRIPEE YGVKQSIGDT LGPGGIFRGL RTIPTMLDIA RDMEELCPNA LLMNYTNPMA IVCWAVDEAT DIDIVGLCHS VPHTAEAIAE YADIPLEELN YWVAGINHMA WFLECDWDGQ DIYPLLEDAT DDEETYRKDT VRFELLKHFG AFVTESSHHN SEYLPYFRTD EDLIDELTGT NYAERMSTAT YLEGWKKRSE ERDDALTGVN PDDVSIERSE EYASRLIHSI ETDTPRRLNL NVRNEAGHIQ NLENDACIEV PCLVDGTGIR PCSVGELPPQ LAALNRTNVN VQRLAVEGAL KGDRDVVHQA VKLDPLTAAE LDLDEIHEMT EELIAANKAY LPALD
|
| |