Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4675 |
Symbol | |
ID | 8745276 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013745 |
Strand | + |
Start bp | 262032 |
End bp | 263657 |
Gene Length | 1626 bp |
Protein Length | 541 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 646515184 |
Product | Alpha-galactosidase |
Protein accession | YP_003406131 |
Protein GI | 284172749 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3345] Alpha-galactosidase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGTGG CAGCGGCGAC GGGAGCCAGT AGTCTCCTCG CGAACTCGAC GGTTACCGCC GACGACAACA GCGACCGGAG CGAGAGTGCA GACAAGACAT TGGTGGAATC CCCACCGATG GGATGGAACA GCTGGAACAC GTTCTACTGT GACATCGACG AGGAATTGAT CAAGGACGCC GCCGATGCAA TGGCCGAAAG CGGGATGAAA GAAGCGGGCT ACGAGTACGT CTGTATCGAC GATTGCTGGA TGGCACCCGA ACGCGACGCG AACGGGAAAC TCCAACCAGA TCCCGAGACG TTTCCGAACG GCATCAGTGC TCTCGCCGAT TATGTTCACG ACAAAGGCCT CAAACTGGGT ATCTACGAAT CAGCGGGGAC GACGACGTGT CAGGGCCTTC CCGGCAGCCT CGGTTACGAA GAAACCGATG CACAGACGTT CGCTGACTGG GGAGTTGACT TCCTCAAATA CGATAACTGC GGGGACCATT ACGGCCTATC GGCGGTTGAG CGCTATACGC GAATGCACAA CGCGCTCGAA GCCGTTGATC GGGATATTAT TTTCAGCATC TGTGAATGGG GAGATAACGA TCCGTGGATG TGGGCCCCAG AGGTAGGCGG CGACCTCTGG CGGACAACTG GCGATATTAA ACCCCTCTGG AGGGCCCAAG AGGATCTGTG GGGGAACGGC ATTATCGACA TCATCGATCA GAATGAGCCC CTCGCCGAAT ACGCCGGTCC CGGTCGCTGG AATGATCCGG ACATGCTCGT GGTCGGCGTG GACCTGCCGG AGTATCCAAA CCTAACCGAA GCGGAAGACC GAACGCACTT CGGCATGTGG GCGATGATGG CCGCGCCGCT CATGGCTGGT AACGACATTC GCAATATGTC CGACGAGACT CGCGATATTC TCACTAACGA CGAGGTGATC GCGATCGATC AGGATCCGGC GGGCAATCAG GCGACGCGGA TTCAACACAT CAGGGGTGAG GACGGTCTCT CACGTTCAGT CTGGGCGAAA ACACTCGCGA ATGGGGATCG AGCAGTCGGG TTACTGAATC GTAGCGATAG GAGAACGACA GTTACGACCA GTGCTCAGGC GGTCGGACTT GAAGCTGCCT CTTGCTACGT TGCTCGCGAT CTCTGGAACG GAACCGACTG GCAGACCGCC GGTCTTATTA GTGCATCGGT TCCGTCACAT GGGCTTGCAT TATTCCGGGT CAGTAGTGGT AACCCGGACG GTACCAACCC ATTCGCAACG GTTTCACTCG GCGATACTGA AGCGACAGTC GCACCGGGTG AGGCAGTCAC TCGATCGCTG ACATTCACTA ATTATTCGCC AATGGCAATC GATAGCGTTC ACGTCACCTG TGATTCGCCC GATGGCTGGG AATCCGAGCC CACCTCAACG ACGTTTACCG ATATCGCGGC CGGACCAGCG ATTTCGGGTG CATCTGGTCC GCAAAACGAC GCCGCAACTG ACTGGATGAT ACGCCCCCCA AGAGATGCAC CGTCCAAAGA CTACGAACTT AGCGTAACCG CAGAATACGC AGATGGAGTC TCTATTGCAG AGCCATTTAC CGTAACTGTC GAGAATAACA CTGGCCATGA CTCTGATCCT GACTGA
|
Protein sequence | MKVAAATGAS SLLANSTVTA DDNSDRSESA DKTLVESPPM GWNSWNTFYC DIDEELIKDA ADAMAESGMK EAGYEYVCID DCWMAPERDA NGKLQPDPET FPNGISALAD YVHDKGLKLG IYESAGTTTC QGLPGSLGYE ETDAQTFADW GVDFLKYDNC GDHYGLSAVE RYTRMHNALE AVDRDIIFSI CEWGDNDPWM WAPEVGGDLW RTTGDIKPLW RAQEDLWGNG IIDIIDQNEP LAEYAGPGRW NDPDMLVVGV DLPEYPNLTE AEDRTHFGMW AMMAAPLMAG NDIRNMSDET RDILTNDEVI AIDQDPAGNQ ATRIQHIRGE DGLSRSVWAK TLANGDRAVG LLNRSDRRTT VTTSAQAVGL EAASCYVARD LWNGTDWQTA GLISASVPSH GLALFRVSSG NPDGTNPFAT VSLGDTEATV APGEAVTRSL TFTNYSPMAI DSVHVTCDSP DGWESEPTST TFTDIAAGPA ISGASGPQND AATDWMIRPP RDAPSKDYEL SVTAEYADGV SIAEPFTVTV ENNTGHDSDP D
|
| |