Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1110 |
Symbol | |
ID | 4601104 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1047833 |
End bp | 1049386 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 639773887 |
Product | glycoside hydrolase family protein |
Protein accession | YP_920512 |
Protein GI | 119720017 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCCCGA AGAGTTTCCT CTGGGGAGTA TCCCTAGCAG GCTTCCAGTT CGAGATGGGG GACCCCGCGG GGGAAGCTTT GGACCCTAAC ACCGACTGGT ACGTGTGGGT ACACGACGAG TACAACATAA GGGAGGGAAT AGTCAGCGGG GATCTGCCGG AGAAAGGGAT AGACTACTGG CACCTTTTCA GGGAGGACCA CTCTCTGGCG AAAAGCCTGG GGCTAAACGC CTACAGGCTT AACGTCGAGT GGAGCAGGGT GTTTCCGGAG CCGACGTTCA GCGTAGAGGT TGGGGTGGAA GAGGAGGACG GCGTTAAGAC CGGTATAGAC ATCGACGACT CCGACTTAGA GAAGCTGGAC AGCATTGCGA ACAAGAAGGC GGTGCAACAC TACAGGGAGG TCGTGGAGGA CCTCCGCGAG AAGGGCTTCT ACGTCATCCT CAACTTGGTC CACTTCACGC TTCCAACCTG GATCCACGAC CCTCTAACCG CGCGCGCCAC GAACGCGAAG AAGGGGCCAC TGGGCTACGC GGACCCCAGG TTCCCGGTGG AGTTCGCGAA GTTCGCCGCC TACGTTGCGG CGAGCTTCGG GGATCTCGTA GACGCGTGGT CAACGTTCAA CGAGCCGAGC GTGGTGACCG AGTCGGGCTT CCTGAAGAGG AGGGGGAAGT TCCCGCCCGG CATATTCAAC TTCGACGCGT ACAAGCGGGC TATGATCAAC ATCGCACAAG CACACCTACT GGCGTACATC GCTATCAAGA AGTTCGACAG GGTGAAAGCT TATTCTGACT CCGCGGAGTC AGCGTCCGTC GGAATTATAC ACAACATGAT ACCGTTCCAC CCCCTCGACC CCTCCAGGAA GCGCGACCGG GACGCATCTA TGGTAACACA CCACCTCCAT AACTCCTGGA TCCCGAACTC CCTTGTAAAC GGGTGGATAG ACAGGGACTT CGACCTCAAA CAGGAGCCCA GCGAAGTATT CGAGAAGTAC AAGTCGAGGC TTGACTGGAT GGGCATCAAC TACTACTCGA GGTCCGTCGT CAAGGGTAAG GTCAACCTCC TCAGGCCTGT AATCCCGTTC CCCGCGTTCC CCGTGCTCGT GAAGGGGTAC GGGTTTGAGT GTGCACCGAA CTCTCAGAGC CTGGCAGGGA GACCTACCAC GGACTTCGGG TGGGAAGTAT ACCCCGAGGG CATAGTAGAG GTTGTAAAAA TGGCAATGCA GTACAACGTT CCTCTACTCG TAACGGAGAA CGGGGTCGCA GACGCGCGGG ACGAGCTGAG GCCGCACTTC CTAGCCCTCC ACCTAAAGCT CCTCGAGGAC GCGTTGGAAA GCCGCGAGAT AAGCCTTAAA GGCTACCTTC ACTGGGCTCT GACGGACAAC TACGAGTGGG CGGATGGCTT CAGGATGCGC TTCGGCCTAT TCGAGGTAGA CCTCTCCAGC AAGAGAAGAG TGAAGCGCCC GAGCGCGGAT CTCTTTGCGA GGATAGTCTC GGAGGGGACT GTCCCAGACG AGGCGGTCAG GAAGGCGAGG GAAAAGCTTT CCGTCAACCT TTAA
|
Protein sequence | MFPKSFLWGV SLAGFQFEMG DPAGEALDPN TDWYVWVHDE YNIREGIVSG DLPEKGIDYW HLFREDHSLA KSLGLNAYRL NVEWSRVFPE PTFSVEVGVE EEDGVKTGID IDDSDLEKLD SIANKKAVQH YREVVEDLRE KGFYVILNLV HFTLPTWIHD PLTARATNAK KGPLGYADPR FPVEFAKFAA YVAASFGDLV DAWSTFNEPS VVTESGFLKR RGKFPPGIFN FDAYKRAMIN IAQAHLLAYI AIKKFDRVKA YSDSAESASV GIIHNMIPFH PLDPSRKRDR DASMVTHHLH NSWIPNSLVN GWIDRDFDLK QEPSEVFEKY KSRLDWMGIN YYSRSVVKGK VNLLRPVIPF PAFPVLVKGY GFECAPNSQS LAGRPTTDFG WEVYPEGIVE VVKMAMQYNV PLLVTENGVA DARDELRPHF LALHLKLLED ALESREISLK GYLHWALTDN YEWADGFRMR FGLFEVDLSS KRRVKRPSAD LFARIVSEGT VPDEAVRKAR EKLSVNL
|
| |