Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0177 |
Symbol | |
ID | 4600879 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 152495 |
End bp | 153514 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 639772931 |
Product | cellulase |
Protein accession | YP_919590 |
Protein GI | 119719095 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGTCGAAA CTAGCCTGTT GTCAAAGTTG GCTAACGCCC CGGCGCCTAG CGGCTTCGAG AATAGAGTTC GAGAGATAAT CGCGGAGGAG CTTGAAGAGC TGGGCTACGA GCCTGTAACG GACTCCCTTG GAAACCTGTA CGTGGTGCTC GGAGAGGGTA GGCCGAGCCT GGTTTTAGCG GCGCATATGG ACGAGGTAGG CTTTATAGTC ACGCACGTAA CGGAGGACGG GTTCTTGAGG GTAGCCCCGC TAGGCGGGGT AGTCGCGGAG GGGCTTCCCG GTCAGGAGGT GGTCGTGCTG ACGGATGAGG GGCTTGTCGA GGGGGTTATA GGGGCTACTC CTCCGCATCT ACGGGGGGCT ACCCAGAAGG AGCTAACAGT GGAGGAGATT TTCATAGATA TAGGAGTCTT GTCCCGGGAG GAGGCGCGCT CCAAGGGTGT GGACGTTGGT TCGCCTGTAA CCTTCGCGGG GAACTTCAAG GAGAGAGGCG ATGCAGTGAT AAGCAAGGCG CTCGACGACC GCGTCGGGTG CTACGCGTTG CTGGAGGCCC TGAGAAGCGG GGCTACTCCG AAGAAGGGTA GCGTCGTCGT AGCGTTCACA GTGCAGGAGG AGGTCGGGCT GAGGGGATCC TCCGCGCTCG CGAAGGCTCT AGAGCCGAAT TTCGCCGTAG CCGTCGAGGG AACCATTGCT AACGATACCC CGGGAACTCC TCCAGAGAAG GTTGTCACCA GGCTGGGCAG AGGTCCCGCC GTACGCTTGA TGGATAAATC GATGATAGCA AGCATGGAGC TTTACAAGCA CATCAAGGCG CTAGCGGAGT CGAAGTCCAT TCCGTACCAG GTGCAGATAT CCCCCTATAG CGGGACGGAC GCTGGGAGCT TCGCCGTTCA CGGCGCCGCT GTCAGCGCAG TCTCCGTGCC CGTAAGGTAC ATTCACTCGC CAGCCTCCCT GGCCTTGAAG AAGGATGTAG ACGCCACAGT AGAGCTGTTG AAGGCTCTGA TCGAAGAGCC GTTCCCCTGA
|
Protein sequence | MVETSLLSKL ANAPAPSGFE NRVREIIAEE LEELGYEPVT DSLGNLYVVL GEGRPSLVLA AHMDEVGFIV THVTEDGFLR VAPLGGVVAE GLPGQEVVVL TDEGLVEGVI GATPPHLRGA TQKELTVEEI FIDIGVLSRE EARSKGVDVG SPVTFAGNFK ERGDAVISKA LDDRVGCYAL LEALRSGATP KKGSVVVAFT VQEEVGLRGS SALAKALEPN FAVAVEGTIA NDTPGTPPEK VVTRLGRGPA VRLMDKSMIA SMELYKHIKA LAESKSIPYQ VQISPYSGTD AGSFAVHGAA VSAVSVPVRY IHSPASLALK KDVDATVELL KALIEEPFP
|
| |