Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1675 |
Symbol | |
ID | 4600927 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1620717 |
End bp | 1622258 |
Gene Length | 1542 bp |
Protein Length | 513 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 639774448 |
Product | glycoside hydrolase family protein |
Protein accession | YP_921073 |
Protein GI | 119720578 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.256832 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTACAA TGATGTTTCC GCAAAAATTT AGGTGGGGAG TCTCCGAGTC AGGTTTTCAG TTCGAGATGG GAGACGAGTA TAGGCGCTTC ATAGACACAA ACACCGATTG GTGGCACTGG GTACGTGACC CTCACAACAT CTCTTCCAGA CTTGTCAGTG GAGATCTGCC AGAAGACGGT ATAAACTACT TCGAACTATT TGGGAAGGAT CATGAATTGG CAAGGGAGCT GGGACTTAAC ACTTACAGGC TGGGAATAGA GTGGAGCAGA ATCTTTCCAC ATCCGACCTG GTTTATAGAG GTAGATTTTG AGAAGGATTC CTTGGGTTTC GTTAAGAGCG TAAGGATAGA CGAGGATACC CTTAGGGCTC TGGACAGGTA TGCTTGTCGG AAGGCCGTGC AGATGTACAG AGAAATACTG CTTGACTTAC GCAAAAGAGG ATTCAAAGTA ATCGTAAACC TTGTACACTT TACGTTACCT TACTGGATTC ACGACCCTAT AAGGGCGAAG TCTTCTGAGC TTTCTGAGGG ACCTTTAGGC CTCCTCGAGG AAAGCTTCCC TATTGAGATG GCGAAGTACG CAGCTTACGT TGCATGGAAG TTTGGCGACT TAGTAGATAT GTGGTCCACG TTTAATGAAC CAGTAGTACC TATAGAGCTA GGCTATCTAG GCACTTACAC AGGTTTTCCT CCAGGTGTCA ATAAGCCCCA GGCAGTTCCT AAAGCATTGG TAAACACTGC TATCGCTCAT GCCTTAGCTT ACGATATGAT AAAGAAGTTT GACAATGTAA AGGCTGATCC CGACTCTAAT TCTCCGGCAG AGGTGGGTCT AATCTACAAT ATAATTCCAG CATACTCTCC CGAGGGAACA AAATCAGAAA AAGCCGTTGA GCATTATTCA TATTTCCATA ACGAGTTGCT ACTAGAGGCC GTAAAGAACG GCAGATTAGA TGTCGCGCTT GACGGCAAAA ATATCCTTAA ACCCGCTGCT CTAGGCGGGA AGCTTGACTG GTTAGGTGTT AACTACTACA CTAGGATTGT TGTCAAAGAG TCTTCTCGTC GCTTTAACGG ACATCCAGTA TTAGACTTCG AAGCAGTGGC TGGTTACGGA TACGCTTGTG TTCCGTTCGG ACTCTCGAAG ATTGGAAGAG CTTGTGATGG AATGGGGTGG GAGTTCTATC CAGAAGGGCT TATTGATGCA TTGAGGATTG GGTCGACCTA CGCGAGTAAG CTTTTAGTCA CGGAGAACGG CACCTCCGAT CCTAGGGACG TAATACGTCC CAGTTATCTC GTAAACCATC TGTATGCATT ATTACTAGCC ATAGAGGAAG GAATAAATGT AGAAGGTTAT TTGCATTGGG CGTTAACGGA TAACTACGAG TGGGCTCATG GTTTTCGGCA ACGTTTCGGT TTGTTCGAAG TAGACCTCAT CACGAAGAGT AGAATTCCAA GGCATTCTTC AAGGATTTAT AAGCATATAA TCCAGCAAGG TTTTATACCA AGTGAGTATA AGAAAGACAT CGTCGAGTTT AGGGGGATAT AG
|
Protein sequence | MSTMMFPQKF RWGVSESGFQ FEMGDEYRRF IDTNTDWWHW VRDPHNISSR LVSGDLPEDG INYFELFGKD HELARELGLN TYRLGIEWSR IFPHPTWFIE VDFEKDSLGF VKSVRIDEDT LRALDRYACR KAVQMYREIL LDLRKRGFKV IVNLVHFTLP YWIHDPIRAK SSELSEGPLG LLEESFPIEM AKYAAYVAWK FGDLVDMWST FNEPVVPIEL GYLGTYTGFP PGVNKPQAVP KALVNTAIAH ALAYDMIKKF DNVKADPDSN SPAEVGLIYN IIPAYSPEGT KSEKAVEHYS YFHNELLLEA VKNGRLDVAL DGKNILKPAA LGGKLDWLGV NYYTRIVVKE SSRRFNGHPV LDFEAVAGYG YACVPFGLSK IGRACDGMGW EFYPEGLIDA LRIGSTYASK LLVTENGTSD PRDVIRPSYL VNHLYALLLA IEEGINVEGY LHWALTDNYE WAHGFRQRFG LFEVDLITKS RIPRHSSRIY KHIIQQGFIP SEYKKDIVEF RGI
|
| |