Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0002 |
Symbol | |
ID | 4600534 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 887 |
End bp | 1984 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 639772755 |
Product | cellulase |
Protein accession | YP_919415 |
Protein GI | 119718920 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCAGCGT ACAAATTAGA TGAAAAATCC CTGAACGTTT TAAAAGAGAT AACGGAGATA GTGGCGCCTT CCGGCTTCGA GGAGCCCGTC CTCGAGAGGA TAAAGCAGTA CTACTCGGAG TACGCAGACG AGGTGAGACG CGATAACCTT GGCTCGCTGA TCCTCGTCAA GAGGGGTTCG AGCGAGAGGC CCAAGGTTCT TGTCGCTGGT CACGTGGACG AAGTGGGCTT CCTCGTAACG GGGATAACCC CCGAAGGTTT CATCACGTTC ACCACGCTGG GAGGCTGGTT TGAGCAGGTT CTCCTGGCTC AGCGGGTCGT CATAAGGACG AAGAAGGGGG AGGTCTACGG CGTCATTACG AGCAAGCCTC CGCACCTGTT GACACCGGAG GAGAGGCAGA AGGTCGTCCA GTTCAGCCAG ATGTACATCG ACGTCGGCGC TACGAGCAAG GAGGAAGTAG AAAAGCTAGG TGTAAGAATA GGGGACCCGG TGGCGCCGTG GTCTCCCTTC ACGAGGACCG CGTTCGAGGA CAGAATCATG GCGAAAGCTT TGGACGACAG AGTGGGGGCT TTCATAGCGA TGGAGGTCCT CAAGCACCTC AGGCTTAACG GTATTGACCA CCCGAACACG CTCTACGCGG CTGCAACTGT GCAGGAGGAG GTTGGGCTTA GGGGCGCCGA GACTGTTGGA TGGGTGGCAG ACCACGACGT AGCCATAGTT ACGGAGGTAG ACATAGCCGG CGATGTGCCG GGGATAAAGC CTAGCGAGGC TCCGGCTAAA CTCGGGAAGG GACCGTCCAT AATCGTGTAC GACAGGTCGA TGATACCTAA TCCTCGCTTC AAGGAGTTCG TCATAGAGGT CGCCGAGGAG GCAAAGATCC CCTACCAGCT ATCGGCTGTG AGTGGCGGCA CGGATGCCGG CAGGCTTCAC CTTTACAAGG GAGGAAGGCC CAGCATTGTG ATAGGCGTGC CCACTAGGCA TATACACAGC CACGTCAGCA TCGTGAGTCT GAGCGACGTG GAGAACGCCG TTCGACTAGT GCTGGAACTC GTGAAGCGGC TGGACCAGGA AACCCTAAAA AGGTTCGTGA ATATATAG
|
Protein sequence | MSAYKLDEKS LNVLKEITEI VAPSGFEEPV LERIKQYYSE YADEVRRDNL GSLILVKRGS SERPKVLVAG HVDEVGFLVT GITPEGFITF TTLGGWFEQV LLAQRVVIRT KKGEVYGVIT SKPPHLLTPE ERQKVVQFSQ MYIDVGATSK EEVEKLGVRI GDPVAPWSPF TRTAFEDRIM AKALDDRVGA FIAMEVLKHL RLNGIDHPNT LYAAATVQEE VGLRGAETVG WVADHDVAIV TEVDIAGDVP GIKPSEAPAK LGKGPSIIVY DRSMIPNPRF KEFVIEVAEE AKIPYQLSAV SGGTDAGRLH LYKGGRPSIV IGVPTRHIHS HVSIVSLSDV ENAVRLVLEL VKRLDQETLK RFVNI
|
| |