Gene Tpen_0002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0002 
Symbol 
ID4600534 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp887 
End bp1984 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content57% 
IMG OID639772755 
Productcellulase 
Protein accessionYP_919415 
Protein GI119718920 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCAGCGT ACAAATTAGA TGAAAAATCC CTGAACGTTT TAAAAGAGAT AACGGAGATA 
GTGGCGCCTT CCGGCTTCGA GGAGCCCGTC CTCGAGAGGA TAAAGCAGTA CTACTCGGAG
TACGCAGACG AGGTGAGACG CGATAACCTT GGCTCGCTGA TCCTCGTCAA GAGGGGTTCG
AGCGAGAGGC CCAAGGTTCT TGTCGCTGGT CACGTGGACG AAGTGGGCTT CCTCGTAACG
GGGATAACCC CCGAAGGTTT CATCACGTTC ACCACGCTGG GAGGCTGGTT TGAGCAGGTT
CTCCTGGCTC AGCGGGTCGT CATAAGGACG AAGAAGGGGG AGGTCTACGG CGTCATTACG
AGCAAGCCTC CGCACCTGTT GACACCGGAG GAGAGGCAGA AGGTCGTCCA GTTCAGCCAG
ATGTACATCG ACGTCGGCGC TACGAGCAAG GAGGAAGTAG AAAAGCTAGG TGTAAGAATA
GGGGACCCGG TGGCGCCGTG GTCTCCCTTC ACGAGGACCG CGTTCGAGGA CAGAATCATG
GCGAAAGCTT TGGACGACAG AGTGGGGGCT TTCATAGCGA TGGAGGTCCT CAAGCACCTC
AGGCTTAACG GTATTGACCA CCCGAACACG CTCTACGCGG CTGCAACTGT GCAGGAGGAG
GTTGGGCTTA GGGGCGCCGA GACTGTTGGA TGGGTGGCAG ACCACGACGT AGCCATAGTT
ACGGAGGTAG ACATAGCCGG CGATGTGCCG GGGATAAAGC CTAGCGAGGC TCCGGCTAAA
CTCGGGAAGG GACCGTCCAT AATCGTGTAC GACAGGTCGA TGATACCTAA TCCTCGCTTC
AAGGAGTTCG TCATAGAGGT CGCCGAGGAG GCAAAGATCC CCTACCAGCT ATCGGCTGTG
AGTGGCGGCA CGGATGCCGG CAGGCTTCAC CTTTACAAGG GAGGAAGGCC CAGCATTGTG
ATAGGCGTGC CCACTAGGCA TATACACAGC CACGTCAGCA TCGTGAGTCT GAGCGACGTG
GAGAACGCCG TTCGACTAGT GCTGGAACTC GTGAAGCGGC TGGACCAGGA AACCCTAAAA
AGGTTCGTGA ATATATAG
 
Protein sequence
MSAYKLDEKS LNVLKEITEI VAPSGFEEPV LERIKQYYSE YADEVRRDNL GSLILVKRGS 
SERPKVLVAG HVDEVGFLVT GITPEGFITF TTLGGWFEQV LLAQRVVIRT KKGEVYGVIT
SKPPHLLTPE ERQKVVQFSQ MYIDVGATSK EEVEKLGVRI GDPVAPWSPF TRTAFEDRIM
AKALDDRVGA FIAMEVLKHL RLNGIDHPNT LYAAATVQEE VGLRGAETVG WVADHDVAIV
TEVDIAGDVP GIKPSEAPAK LGKGPSIIVY DRSMIPNPRF KEFVIEVAEE AKIPYQLSAV
SGGTDAGRLH LYKGGRPSIV IGVPTRHIHS HVSIVSLSDV ENAVRLVLEL VKRLDQETLK
RFVNI