Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1055 |
Symbol | |
ID | 4601441 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 994302 |
End bp | 995558 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 639773833 |
Product | extracellular solute-binding protein |
Protein accession | YP_920458 |
Protein GI | 119719963 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2182] Maltose-binding periplasmic proteins/domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.404897 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCGGCGC AGGAAAAGGG AAAGAAGGGA GTAAACAAGC TTCTTATCGC CGTTGCTGTA CTGGTTATCG TAGCACTAGC GGCTGTAGCC CTCTACCCGA TGTTCGCTCC TAAACCGGCG CCTAAACAGG TAAAGATAAC TATATGGACG GCGTGGACTG GCGGAGAGTA CGACGCACTT AAAGCCGTTA TAGACGATTT CAGGGCGAAG AACCCCAACT ACCAGATAGA CATAGTCAAC GTCCCCTTCG ACCAGCTCAA GAACAAGGTT ATACAGGCGG TACCGGTCGG CGAAGGGCCG GACCTCTTCA CCGGCCCGCA CGACTGGACG GGCGAGCTCG TCCAGGCGGG CGCTCTCGTA GACATCACGG ACAAAGTCTC CGCCTTTAAG GGAGAGTACA TGGAGAGCGC CCTCCAGGGA GTAACGCTGA AGGGCAAGAT CTACGGGCTA CCCGAGAGCA TCAAGCTACC GGCCTTGATA GTGAACAAGA AGCTCCTAGC CACTCCCCCG AAGACGCTCG ACGAGCTGTG GAGCATAATG GACCAGTTCA AGGCTAAGGG GATGTACGGC CTCGCGTACG ACGTGCAGAA CGCGTACTTC AGTAGCTGCT GGTTCTACGG GCTCGGAGCC TACTACCTCG ACCCCAACAC CCTGGAGACA GCGCTGGACA GCCCCGGCGC CGTCCAGGCG TTCCAGATAA TCGCCAAGTT CAGCAAGTAC CTCCCGCCGG ACATCAGCTA CGACATGATG ACCAACCTCT TCATGAACGG TAAAGCCGCG ATGGCCATAA ACGGTCCATG GTGGATCGGA GACCTCAAAA AGGCTTTCGG CGAGAACCTC GCGGACATCG AGATAACCCT CATACCGGCT ATAGACCCGG CGCACCCGGC CAGGCCCTTC ATGACGGTGG AGGCGGTCTT CGTGACTAAG AACGCCGCCG AGAGGGGCGT CCTCGACGAA GCAATAGCGC TGGCGCACTA CATAACGGGC GAAGCCTCCG TAAAGCTCGC CAAGATGGCT GGACACGTAC CCACGTGGAA GAACGCCATG AAGGACCCGG CAGTCTCCGG TGACAAGGTT ATCAGCGCGT TCTTCAAGCA GGCCGAGTAC GGCGTACCGA TGCCGAACGT ACCCGAGGTC GCGCAGATGT GGAACGTCGT GCCGAAGTAC ATAAGCCAGG TCTACCAGGG ACAGCTATCC CCGCAGGACG CGGCGAAGGC GGCGGCCCAG GAGCTAAGAG CGGCCCTGAA GAAATGA
|
Protein sequence | MAAQEKGKKG VNKLLIAVAV LVIVALAAVA LYPMFAPKPA PKQVKITIWT AWTGGEYDAL KAVIDDFRAK NPNYQIDIVN VPFDQLKNKV IQAVPVGEGP DLFTGPHDWT GELVQAGALV DITDKVSAFK GEYMESALQG VTLKGKIYGL PESIKLPALI VNKKLLATPP KTLDELWSIM DQFKAKGMYG LAYDVQNAYF SSCWFYGLGA YYLDPNTLET ALDSPGAVQA FQIIAKFSKY LPPDISYDMM TNLFMNGKAA MAINGPWWIG DLKKAFGENL ADIEITLIPA IDPAHPARPF MTVEAVFVTK NAAERGVLDE AIALAHYITG EASVKLAKMA GHVPTWKNAM KDPAVSGDKV ISAFFKQAEY GVPMPNVPEV AQMWNVVPKY ISQVYQGQLS PQDAAKAAAQ ELRAALKK
|
| |