Gene Tpen_0177 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0177 
Symbol 
ID4600879 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp152495 
End bp153514 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content60% 
IMG OID639772931 
Productcellulase 
Protein accessionYP_919590 
Protein GI119719095 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTCGAAA CTAGCCTGTT GTCAAAGTTG GCTAACGCCC CGGCGCCTAG CGGCTTCGAG 
AATAGAGTTC GAGAGATAAT CGCGGAGGAG CTTGAAGAGC TGGGCTACGA GCCTGTAACG
GACTCCCTTG GAAACCTGTA CGTGGTGCTC GGAGAGGGTA GGCCGAGCCT GGTTTTAGCG
GCGCATATGG ACGAGGTAGG CTTTATAGTC ACGCACGTAA CGGAGGACGG GTTCTTGAGG
GTAGCCCCGC TAGGCGGGGT AGTCGCGGAG GGGCTTCCCG GTCAGGAGGT GGTCGTGCTG
ACGGATGAGG GGCTTGTCGA GGGGGTTATA GGGGCTACTC CTCCGCATCT ACGGGGGGCT
ACCCAGAAGG AGCTAACAGT GGAGGAGATT TTCATAGATA TAGGAGTCTT GTCCCGGGAG
GAGGCGCGCT CCAAGGGTGT GGACGTTGGT TCGCCTGTAA CCTTCGCGGG GAACTTCAAG
GAGAGAGGCG ATGCAGTGAT AAGCAAGGCG CTCGACGACC GCGTCGGGTG CTACGCGTTG
CTGGAGGCCC TGAGAAGCGG GGCTACTCCG AAGAAGGGTA GCGTCGTCGT AGCGTTCACA
GTGCAGGAGG AGGTCGGGCT GAGGGGATCC TCCGCGCTCG CGAAGGCTCT AGAGCCGAAT
TTCGCCGTAG CCGTCGAGGG AACCATTGCT AACGATACCC CGGGAACTCC TCCAGAGAAG
GTTGTCACCA GGCTGGGCAG AGGTCCCGCC GTACGCTTGA TGGATAAATC GATGATAGCA
AGCATGGAGC TTTACAAGCA CATCAAGGCG CTAGCGGAGT CGAAGTCCAT TCCGTACCAG
GTGCAGATAT CCCCCTATAG CGGGACGGAC GCTGGGAGCT TCGCCGTTCA CGGCGCCGCT
GTCAGCGCAG TCTCCGTGCC CGTAAGGTAC ATTCACTCGC CAGCCTCCCT GGCCTTGAAG
AAGGATGTAG ACGCCACAGT AGAGCTGTTG AAGGCTCTGA TCGAAGAGCC GTTCCCCTGA
 
Protein sequence
MVETSLLSKL ANAPAPSGFE NRVREIIAEE LEELGYEPVT DSLGNLYVVL GEGRPSLVLA 
AHMDEVGFIV THVTEDGFLR VAPLGGVVAE GLPGQEVVVL TDEGLVEGVI GATPPHLRGA
TQKELTVEEI FIDIGVLSRE EARSKGVDVG SPVTFAGNFK ERGDAVISKA LDDRVGCYAL
LEALRSGATP KKGSVVVAFT VQEEVGLRGS SALAKALEPN FAVAVEGTIA NDTPGTPPEK
VVTRLGRGPA VRLMDKSMIA SMELYKHIKA LAESKSIPYQ VQISPYSGTD AGSFAVHGAA
VSAVSVPVRY IHSPASLALK KDVDATVELL KALIEEPFP