Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1511 |
Symbol | |
ID | 4601107 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1459316 |
End bp | 1460698 |
Gene Length | 1383 bp |
Protein Length | 460 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639774286 |
Product | Alpha-galactosidase |
Protein accession | YP_920911 |
Protein GI | 119720416 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.408355 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCGCGCG TGCCGAAAAT AAAAATAAGC GTTATAGGCG CAGGTAGCGT TGCGTGGAGC TCTAAGCTCG TACACGACCT CCTGCACACG CCCTCCCTGT ACGGTAGCGA CGTGTACTTC ATGGATATAA ACGAGGAGAG GCTACGCATC CTCAGGGGTC TCGCGGAGAA GTACAAGTCG GAGGTCGGGG CGGAGTACAA CTTCTTCTAC ACGACGGACA GGAAGGAGGC TGTGAGGGAC TCGGACGTCG TGATAAACAC GGCGATGTAC GGTGGGCACA GGTACTACGA GCTTATGCGC GAGGTTAGCG AGAAGCACGG CTACTACCGG GGCGTGAACA GCGTTGAGTG GAACATGGTG AGCGACTACC ACACGATATG GGGCTACTAC CAGTGGAAGC TCGCGATGGA CATAGCGAGG GACGTCGAGG AGCTGGCGCC GGGCGCCTGG CTGATCCAGA TGGCGAACCC CGTCTTCGAG CTTACAACAC TGATTTCGCG GGAGACGAGG GTCAAGGTCG TAGGGCTGTG CCACGGGCAC CTGGGCTACA AGGAGATAGC GCAGACGATA GGGCTCGACC CGGCGAGGGT GGGCTTCGAG GCTATAGGCT TCAACCACGT GATATGGCTG ACGAAGTTCA CCTACGACGG GGAGGACGCG TACCCGCTCA TAGACAGGTG GGTAGAGGAG AAGGCGGAGC AGTACTGGTC GCAGTGGAGG ATGAGGCAGG TAAACCCCTT CGACATACAG ATGTCCCCTG CGGCTGTCGA CATGTACAGG CGCTACGGGC TCTTCCCGGT AGGCGACACC GTCAGAGGAG GTACCTGGAA GTACCACTCG AGCCTCAAGA CGAAGCAGTA CTGGTACGGC CCTACGGGCG GCCCCGACAG CGAGATAGGC TGGGCTCTCT ACACCGCGCA CCAGGAGTAC TGGCTGGCAA CGCTGGCCCA GGCGGCCTCC GACCCGAGGA TACCGGTCTC CTCCCTCTTC CCGCAGACCA GGACCGAGGA GAGTGTCGTC CCGCTAATCG AGAGCCTCAT GCTCGACAAG CCGGGCGAGT ACCAGGTCAA CGTGCTCAAC GGCAACGCGA TAGAAGGCAT ACCCAGCAAC GTGGCAGTCG AGGTGCCGGC CAGGGTGGAC GCCCGCGGTA TACACCCGAA GACCGGGCTG AGGCTACCGA GGAAGATACT CTCGCTCGTC ATGCAGCCCA GGCTGCTCCG CGCGGAGATG GCGATAGCCG CCTTCCTGGA GGGCGGGAGG CAGTTCCTTA TAGACTGGCT CATGCTGGAC CCGAGGACGA GGAGCGAGGA GCAGGCAGAG AAGGTTTGGG AGGAGATACT ATCGCTACCC GGGAACGAGG AGATGAAGAG GCACTACAGC TAG
|
Protein sequence | MSRVPKIKIS VIGAGSVAWS SKLVHDLLHT PSLYGSDVYF MDINEERLRI LRGLAEKYKS EVGAEYNFFY TTDRKEAVRD SDVVINTAMY GGHRYYELMR EVSEKHGYYR GVNSVEWNMV SDYHTIWGYY QWKLAMDIAR DVEELAPGAW LIQMANPVFE LTTLISRETR VKVVGLCHGH LGYKEIAQTI GLDPARVGFE AIGFNHVIWL TKFTYDGEDA YPLIDRWVEE KAEQYWSQWR MRQVNPFDIQ MSPAAVDMYR RYGLFPVGDT VRGGTWKYHS SLKTKQYWYG PTGGPDSEIG WALYTAHQEY WLATLAQAAS DPRIPVSSLF PQTRTEESVV PLIESLMLDK PGEYQVNVLN GNAIEGIPSN VAVEVPARVD ARGIHPKTGL RLPRKILSLV MQPRLLRAEM AIAAFLEGGR QFLIDWLMLD PRTRSEEQAE KVWEEILSLP GNEEMKRHYS
|
| |