Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4484 |
Symbol | |
ID | 8745113 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013745 |
Strand | + |
Start bp | 76912 |
End bp | 78231 |
Gene Length | 1320 bp |
Protein Length | 439 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 646515021 |
Product | glycoside hydrolase family 4 |
Protein accession | YP_003405968 |
Protein GI | 284172586 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0542529 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGACA TCACTTTCAT CGGAGCCGGG AGTGTCGTGT TCGCGAAGAA CCTGATGACG GATATCTTTT CGTTCGAGCG CCTGCGAGAC AGCACGATCA CGCTGATGGA TATCGATCCG CAGCGGTTGG ATCGGGCCGC CGAGATCGGC GAGGCCATCG TCGATTCCCA CGATCTCCCG GGGGAGATCC GCGCGACGAC CGATAGGCGC GAGGCCCTCG AGGGCACTGA CTACGTCTTC AACATGATCA ACGTCGGCGG CGAAGCGCCG TTCGAGAACG AGATCCGAAT TCCCGAGGAG TACGGGGTCA AGCAGGCTGT CGGCGACACG CTCGGTCCCG GCGGCGTCTT CCGGGCGCTG CGAACCGCCC CGACGGTGCT CGATATCGCC CGCGATATGG AGGAACTGTG TCCCGACGCG CTCCTGTTGA ACTACACGAA CCCGATGGCG ATCCTCTGTT GGGCGGTCGA CGAGGCGACC GATATCGACG TGGTGGGACT CTGTCACAGC GTCCAGCACA CGACCGCGGC CATCGGCCGG TATCTCGACG TCCCCAGCGA AGAGCTCCAG CACTGGGTCG CCGGCATCAA CCACATGGCG TGGTACCTCG AGCTAGAACA CGACGGTCAG GACTGTTACC CGGCCCTGCG CGACGCCGCG GACGATCCCG ACGTCTACGC GCAGGATCCG GTCCGGTTCG ACGTGATGGA CCACTTCGGG GCGTTCATCA CGGAATCGAG CCACCACCTG AGCGAGTACC TCCCGTACTT CCGGACGGAC CAGGACGTCA TCGACGACCT GACCCCCGAG GAAGACTTTG GCCACTACAC GATCGAGTGG ATGCCGACGG GACGGTATCT CGAACACTGG CGCTCCTACC AGCCGGATCT CGAGGGCGAG GTCGAGATCA CCGAGGACGA CGTCTCGCTC GAGCGCTCTC CCGAGTACGG CTCTCGGATC GTCCACTCGA TCGAGACGGA CGAGGTGCGG CGCATGAATC TCAACGTTCG GAACGACACC GGCGCGATTT CGAACCTTCC GGACGACTCG TGCGTCGAGG TGCCGTGTCT GATCGACGGA CGGGGGATTC ACCCCTGTTC CGTCGGTGAC CTGCCGTCCC AGCTCGCGGC GCTGAACCGG ACGAATATCG GCGTTCAGGA ACGCGCGGTG ACCGCGATAC TCGAGCAAGA CGAAACCGCG CTTCGCCAGG CGGTGAAACT CGATCCGCTC ACCGCGGCCG AACTCGATCT CGAGACGATT GACGAGATGG TCGACGACCT GCTGGCCGTG AACGCCGAGT ATCTGCCCGA ACTGGACTGA
|
Protein sequence | MADITFIGAG SVVFAKNLMT DIFSFERLRD STITLMDIDP QRLDRAAEIG EAIVDSHDLP GEIRATTDRR EALEGTDYVF NMINVGGEAP FENEIRIPEE YGVKQAVGDT LGPGGVFRAL RTAPTVLDIA RDMEELCPDA LLLNYTNPMA ILCWAVDEAT DIDVVGLCHS VQHTTAAIGR YLDVPSEELQ HWVAGINHMA WYLELEHDGQ DCYPALRDAA DDPDVYAQDP VRFDVMDHFG AFITESSHHL SEYLPYFRTD QDVIDDLTPE EDFGHYTIEW MPTGRYLEHW RSYQPDLEGE VEITEDDVSL ERSPEYGSRI VHSIETDEVR RMNLNVRNDT GAISNLPDDS CVEVPCLIDG RGIHPCSVGD LPSQLAALNR TNIGVQERAV TAILEQDETA LRQAVKLDPL TAAELDLETI DEMVDDLLAV NAEYLPELD
|
| |