Gene Tpen_1269 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1269 
Symbol 
ID4600484 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1206045 
End bp1209386 
Gene Length3342 bp 
Protein Length1113 aa 
Translation table11 
GC content65% 
IMG OID639774045 
Productglycoside hydrolase family protein 
Protein accessionYP_920670 
Protein GI119720175 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.172811 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTTTGG GGACCGTGCG CCTCGCTGCT GCTACGGCCC TCCTGCTAGC CCTACTCCTA 
CTCGCCGCCG CCCGCGGCGA ACCCGTGACT GTGGACGGCT CAGGCTTCGA CTGGCCGTAC
GACTCCTGCC ACGCGCTCGA CCCCCACGGC GACCTCCTCG ACGCGACCCC CCACTGGTAC
GACAGCAGGG ACCTCCTAGC CTGGTACTAC GCTGTCGGCG ACCAGTACGT CTTCATCAGG
CTAGACCTCC TGGACCTAGC CTACGGCGCC GAGACGGCGA GCTACCCGGA CGGCGGTGTC
GCGGACGCCC TAAACATCTA CGTGATGCTC GGGTGGGAGA ACGCGCCGGG GTACCAAGAG
TGGGCCCCCG ACTACCTGCA GCTGAACGGC TACGGAGTCC ACATAAGCGA CTACCGCTGG
GTCGTAGCTA TCGCGGTGTA CGACTCCGCA CACTACAAGG TGTACAGGTA CGACTGGTCT
GTCCTCGCCG AGAACAGCGG GCTACAGGTA GCCTTCAACA GCCAGTGGGA CCTCGTCGAG
ATCGGCATAC CCAGGAGCAT GCTGGAGAGC TACGGCTGGA CGCCTACATC CAGGGTGTGG
GCCAAGATTG CAACCGCGCT CGTGAAGACG GGCGGCGCGA ACGTCCTGGC GGACGCCATG
CCGAACACGG TCAGCTTCGA CGGGAACGCC GGCCGCTACG AGTGGAGCGG CGCCGTGTTC
AGCGACCAGA AGTGCGGGAC CGCCAAGGTA GCCTTCGTGC ACCACGGGAA CCAGCACCTC
GCGGACAACA GAGCCCTCAA CAAGCCTAAC GCCGTCAACA GCTACGACTA CATACTGAGG
GTGCACGAAG AGCTCTCCGC GAAGGCAGGC AGGAGGATAC CCGTGTGCGT CCACATGTCG
GGAACCCTGG CGGCGAGCTA CGTGTGGTGG GACCCGAGCA TAGTGGCGCA CCTGAGGAGC
CTCGCCGCGA AGGGGCTAGC GTGCATAGTC GGGGGAACCT GGGCTGAGTA CATCACGGCG
TACTTCTACG ACAACTTCAA CGACCCCTCG TTCTACCTCG GAAAGCTCTA CGCCGAGGCA
CTGTTCGGCT ACACCCCGCT CACCGCGTGG ATACCCGAGA GGACGTGGGA CGACGAGAGA
ACCGGTATCG CGTACACAGT CAGCAAGCAC TACTACGCGG TGATACTCGA CGGCAACACG
CACCACGATG ACTGGAGCCC GAACACCAGC CCCTACAAGC CCCACCAGTA CGACCCGTCG
AAGACGGGGG GCAGGGAGCT CTACGTCTTC TTCATCGACT GGGAGATGCA GCAACTCCTC
CTCGCGAACA CGGACGGAGG GCTGAACATA AACCTGAGGA GGAAGCTCGC CTGGGCGGCC
ACCAACGCGG ACCAGCAGAT GCTGTTCCTC TACGCGGACG ACTGGGAGAA GGCCGCCGGG
ATATCCAGCC CGAGCTGGGA CCCCGGCAAC CCGTACCGCT ACAGGGACTC GCTGACCTGG
ATAGCCATGC ACCCGTGGAT ACAGGTCGTA ACGGTGGACG AGGTCGTCGG CTGGCTTAGG
AGCGGCTCGT GGACGCCGGT GAAGTACTAC TACTGCGGCT ACGACACCTA CTACTACCTC
AAGACGTGGG TCCCGGGCTA CCCGTACGAC TACAGAAGGG CCTACGACGG GTGGTACTGG
GGTACGAGCA GCGCGAAGAG TTTCGCCTGG TACGGTAGCG GCCAGCCGGG CTACTCGCTC
CCCGACACCA CGATGCCATT CGGCGACGTC TTCGGGTACA CCACCTACAA CGGGTCGCCC
GCAAACACCG TGATATACAG GCTGCTGGCC CCGGGGAAGG CTCTGGACAG CGCGCCTAGA
AACGAGCTGT GGAGGCTCGC GGTCATGGCG GCTAACGCGT TGCTCTACGA GACAGCCTGG
CACGAGGGCT CCGACTGCGT CGGGTGGGGC CTCAACCAGT GGAACCACTT GAGGCTCGTG
AACGTGCTCC TACTCGCGTC CAAGTGGCTC GACGACGCGC GCCTCGGGAA GATCGCGGGC
GCCTCCTACC TAGTCGGGGA CTTCGACTGG GACGGCAGGC CCGAGGCGGT GATCTACAAC
CGGGCGGTGT TCGCCTACAT AGACGACAAG GGAGGCGCGG CGCCCCTCAT CTTCGCCTAC
AACAGGACCA CCGACAGGGT CTACGTGTCC GTGGGCGCCC CGCTCGTGTA CTGGGGCGTA
CTGGGCGACG CCTGGTACGG CGACAACCAC GTGGGCTTCC TCGCGGACGA CTACTTCGAC
GCCACGGGTA AGAACTACTA CTCGTCCAGC TACCAGCTGG TCTCGGCTTC GAGGAGCGAC
GCCGAGGGGG GAGTCGTGGT CAAGCTCGCG GCTCCCGACA TGGACGGGGA CGGGCGCCCC
GACTTCTACA AGTACTTCGT GCTCCGCGAC ACCGCCAACT ACGTCGACGC GGTCTACGAG
CCCTCCGGGA AGGCTGGGAC CGTGTACTCC GCCGTCGGCC TCTCGGTAGA CCTGCTCGAC
AGCCTCTTCA ACGGCGACAG GGCCGCCAGG GTCGGCGACC CCTCCGGGGC CTCGACGTTC
GGCTACAGGA ACGGCTACAC GGGGGCCTAC GCGTACGTCA AGCCGTTGCA GGGCGCGTCG
TGGACGGGGC CCCAGGACCT CTCCAAGTAC ACCCTCCAGT ACGTTGCGAA GCTAGCAGTC
AGCGTGTCGC AGGGCCAGTC GAAGGTGCGC CTCTACCTGC CCGGCGACCC CGGCCAGTAC
GCGCCCCGGT GCCAGCCCTC GGCCTACGCG ACTATCGGCT TCCTCAGGTC CCCCGGGTCG
ACGGCGGTGC TCGTGGCTGT GTACGGCAAC TCCAGCGCAC CCGTGAACTC CTTCGAGGCT
AGGGTTGTCG ACGCGTCCGG CGCGGGCTCC TGGAGGCTCG ACGCCAGGCT CGCCGGCGGC
TCCCTCGGCT CCCCCTACGC CTCCTTCCTC TTGAACCTGA GCTCGTCCCA GCTCGCTCCC
GGGATGTACT ACGTGGAGCT CAACGTGAGC CTGGGGGCCT CGAGGATCGT CGAGAGGGCG
TACAGCGTCT ACGTGAGGAG GCTCGAACGC GGCTACAACC TGGTGTCCCT CCCGTTCTTC
TACAGCGCCG TCGTGAGCCC CTCGAAGGCC TCCGAGCTCG CCGAGACCGC GGGCACCAGC
CTGCTCGCCG TGTGGAGGTG GGACGTGCAG GCGCAGAGGT TCAGGGGCTA CGTACCCGGT
GTGAGCGGGC CCGAGGACGA CTTCCCGCTG GAGCCCGGGA GCGGCTACTT CGTCTACGCG
AAGTCTCCAG TCGTCGTGGT GTGGGTGGCC GGGAAATGCT GA
 
Protein sequence
MSLGTVRLAA ATALLLALLL LAAARGEPVT VDGSGFDWPY DSCHALDPHG DLLDATPHWY 
DSRDLLAWYY AVGDQYVFIR LDLLDLAYGA ETASYPDGGV ADALNIYVML GWENAPGYQE
WAPDYLQLNG YGVHISDYRW VVAIAVYDSA HYKVYRYDWS VLAENSGLQV AFNSQWDLVE
IGIPRSMLES YGWTPTSRVW AKIATALVKT GGANVLADAM PNTVSFDGNA GRYEWSGAVF
SDQKCGTAKV AFVHHGNQHL ADNRALNKPN AVNSYDYILR VHEELSAKAG RRIPVCVHMS
GTLAASYVWW DPSIVAHLRS LAAKGLACIV GGTWAEYITA YFYDNFNDPS FYLGKLYAEA
LFGYTPLTAW IPERTWDDER TGIAYTVSKH YYAVILDGNT HHDDWSPNTS PYKPHQYDPS
KTGGRELYVF FIDWEMQQLL LANTDGGLNI NLRRKLAWAA TNADQQMLFL YADDWEKAAG
ISSPSWDPGN PYRYRDSLTW IAMHPWIQVV TVDEVVGWLR SGSWTPVKYY YCGYDTYYYL
KTWVPGYPYD YRRAYDGWYW GTSSAKSFAW YGSGQPGYSL PDTTMPFGDV FGYTTYNGSP
ANTVIYRLLA PGKALDSAPR NELWRLAVMA ANALLYETAW HEGSDCVGWG LNQWNHLRLV
NVLLLASKWL DDARLGKIAG ASYLVGDFDW DGRPEAVIYN RAVFAYIDDK GGAAPLIFAY
NRTTDRVYVS VGAPLVYWGV LGDAWYGDNH VGFLADDYFD ATGKNYYSSS YQLVSASRSD
AEGGVVVKLA APDMDGDGRP DFYKYFVLRD TANYVDAVYE PSGKAGTVYS AVGLSVDLLD
SLFNGDRAAR VGDPSGASTF GYRNGYTGAY AYVKPLQGAS WTGPQDLSKY TLQYVAKLAV
SVSQGQSKVR LYLPGDPGQY APRCQPSAYA TIGFLRSPGS TAVLVAVYGN SSAPVNSFEA
RVVDASGAGS WRLDARLAGG SLGSPYASFL LNLSSSQLAP GMYYVELNVS LGASRIVERA
YSVYVRRLER GYNLVSLPFF YSAVVSPSKA SELAETAGTS LLAVWRWDVQ AQRFRGYVPG
VSGPEDDFPL EPGSGYFVYA KSPVVVVWVA GKC