Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4787 |
Symbol | |
ID | 8745377 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013745 |
Strand | + |
Start bp | 405476 |
End bp | 406906 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 646515285 |
Product | glycoside hydrolase family 4 |
Protein accession | YP_003406232 |
Protein GI | 284172850 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0488645 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATCAAC TCGGTAGTCG TTCGATCGAG GAGATCCCTC GCGTGAAGAT CGGATACGTC GGGGGCGGCA GCCAGGGGTG GGCCCACACC CTCATCAACG ATCTCGCGCA GTGTGGCGAC ATCGCCGGAT CGGTGGCGCT GTACGACGTC GACCACGAGG CCGCGACGAA GAACGCCGAA CTTGGCAATC GGATCGTCGA GCGCGAGGAC GCCGACGGCG ACTGGACGTT CGAGGCCTAC CGCGAGATGG ACGACGCGCT CGCGGACGCC GACTTCGTCG TCTGCTCGAT CCAGGACCCG CCCGCGGAGA CGTTCGTCCA CGACATCGAC GTTCCCAAAC AGTACGGCAT CCACCAGCCG GTCGCCGACA CCGTCGGTCC CGGCGGGGTC CTCCGCTCGA TGCGGGCGAT CCCGCAGTAC CGCGAGATCG CGGCGACGGT TCGCGAACAG TGTCCCGATG CGTGGGTAAT CAACTACACC AACCCGATGA CCGTCTGCAC TCGGACGCTC TACGAGGAGT ACCCCGACAT CAACGCGATC GGGCTCTGCC ACGAGGTGTT CAAGTTCCAG GAGCAGTTCG CCGACATCGC CGAGCGGTAC GTCGACGACG CCGAGGACGT CGCCCGCGAG GAGATCCACG TCACCGTCAA GGGGATCAAC CACTTCACGT GGATCGACGA GGCCCGATGG CGCGATACCG ACCTGTTCGG CTACCTCGAG GCCGAACTCG AGGAGCGGAA ACCGCTGAAG GACTTCGATC CGGGTTCGAT GGCCGACGCG TCCTACTGGG TCAACAACTA CAACGTCGCC TTCGACCTCT ACGACCGGTT CGGCCTGCTC GGCGCGGCCG GCGACCGCCA CCTCGTCGAG TTCGTCCCGT GGTACCTCCA GCTCGACGAC CCCGAGGACC TCCATCGATG GGGGATCCGG TTCACCCCGA GTTCGGCTCG CCTCCCCGAC GACGACGGGC CGACGCAGAC CGAGCGGTAC CTCTCTGGCG ACGAGGAGTT CGAGTTCTAC GACTCCGGCG AGGAGGCCGT CGACATCTTC CGGGCCCTGC TGGGACTCGA GCCCGTCGAG ACCCACCTGA ACTACCCCAA CGAGGGGCAG GTCGCGGGGC TGCCCGAGGG CGCCGTCGTC GAGACGAACG CGTTGCTCAC CGGCGACGAC GTCTCGCCGC TGGCCGCCGG CTCGTTCCCT CGCGAAATCC GATCAATGGT GATGACCCAC GTGAACAACC AGGAGACGCT CGTCGAGGCC GGGTTCGAGG GCGACCTCGA TCGGGCGTTC CGGGCGTTCC TCAACGATCC GCTCGTCTCG ATCGAACGCG ACGCCGCCGC GGACCTTTTC GTCGAACTCG TCGACCGCGA ACGCGACTAC CTCGAGGTGT GGGACCTCGA GGACGCCGAC GTCCTCGCGG CGTCGCGCTG A
|
Protein sequence | MHQLGSRSIE EIPRVKIGYV GGGSQGWAHT LINDLAQCGD IAGSVALYDV DHEAATKNAE LGNRIVERED ADGDWTFEAY REMDDALADA DFVVCSIQDP PAETFVHDID VPKQYGIHQP VADTVGPGGV LRSMRAIPQY REIAATVREQ CPDAWVINYT NPMTVCTRTL YEEYPDINAI GLCHEVFKFQ EQFADIAERY VDDAEDVARE EIHVTVKGIN HFTWIDEARW RDTDLFGYLE AELEERKPLK DFDPGSMADA SYWVNNYNVA FDLYDRFGLL GAAGDRHLVE FVPWYLQLDD PEDLHRWGIR FTPSSARLPD DDGPTQTERY LSGDEEFEFY DSGEEAVDIF RALLGLEPVE THLNYPNEGQ VAGLPEGAVV ETNALLTGDD VSPLAAGSFP REIRSMVMTH VNNQETLVEA GFEGDLDRAF RAFLNDPLVS IERDAAADLF VELVDRERDY LEVWDLEDAD VLAASR
|
| |