Gene Tpen_1628 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1628 
Symbol 
ID4601031 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1573072 
End bp1576170 
Gene Length3099 bp 
Protein Length1032 aa 
Translation table11 
GC content66% 
IMG OID639774401 
Productglycoside hydrolase family protein 
Protein accessionYP_921026 
Protein GI119720531 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0383] Alpha-mannosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCTAC GCGACGCCGA GAGGTCGCTG TTCGACCTTC TGGTAGCGTC CGTTCAGAGG 
TACATCCCGC TCTCCGTCTG GAGGCTCGAC GGGAGGGAGA AGTTCCTACC GGTACGCGTG
CAGGCTGAGC CGGGTAGGAT CTACGAGTTC GTCGGGAAAG CTAGGGTCCC GTCGAGCGAG
CACAGGTGGT TCGCGAAGTT CGTCCTGAGC GGGAACGCTA CGCTCGAGGT CGACGGGAGG
CTGGTAGGCG GGATAGACGA AGCGCACACG TACTTCCCGC TGGACCCGGG GCTCCACGAG
CTGAGGATTA GGGCTTCGCC GAGGGGGATG TTCGGCTTCC ACGACTGGAG CCTGAGCTTC
GAGAAAGCCT TCCTCGCCGA GGTGTACTGG CGCGGCTTCT CGCTCGCGCT CAAGCTACTC
TCCCTCGTGT CCTACCTCAA GTCCCTTCCC CCCGACGACG CGGTGAGGAG GAGGGTCGAA
GGGGAGCTCT TCGACGCCTT GATGTCCGCT CCCGTCTCGC CGAGCGTGCT CCAGATAACG
GTGGCGTTGA GCCTCCTCTA TGAGGGCGGG AGGAGCTTCT CGAGGGGGGA CCTGCCCGAG
AGGTACGGGG ACTACGCGTG GCTCTCCGGT GTCTACGGCG CCGGGATCCT GGGCGGCAGG
CTCAGGGACG TGGGCGCGTC GAGCCTCGAC GACGTGAAGC GCGCCTGCGA CAGGCTGGAG
CCCCTCCTCT CACTCCTCGA GGAGCTGAGG AAAAGCCAGG AGGGCGCGGG GGAGCTCTTC
ATAGTTGGGC ACAGCCACAT AGACGCCGCG TGGCTCTGGC CCGTGCAGGA GACCAGGGAG
AAGGTCGCGA GGACGTTTGC GAACGTGGTC TCGCTCCTGA GGGAGTACGG GGGACCCGTG
TACGCGCAGA GCTCGGCGCT GTACTACGAG TGGGTCGAGG AGGACTACCC GGAGCTTTTC
CGCGAAATCA AGCGCCTCGT CGAGTCGGGT AGGTGGATAC CGGTTGGGGG CATGTGGGTT
GAGAGCGACG TCCAGCTCGT GGAGGGGGAG AGCCTGGCGA GGCAGTTCCT GTACGGCCAG
AGGTACTTCT ACTCCAGGTT CGGGAGGACA GCCCGGATCG GGTGGATACC GGACAGCTTC
GGCTTCCCCT ACTCGTTGCC CCAGCTACTC GTCAAGAGCG GGCTCGAGTG CTTCGTGACG
CACAAGGTGC TCTGGAACGA TACCAACGAG TTCCCCTACC ACTCCTTCCT CTGGAGGGGG
GTCGACGGCT CCACTATACC CGTCCAGATC CTCCTGAACA GCTACAACGA GATGCTCACC
CCGGGAAGCG TGAAGGCGTA CTGGTCCAGG TACAAGCAGA GGGGCGAGGT CCCCTTCCTG
CCGTACGCCT ACGGCTACGG GGACGGGGGA GGGGGGCCGA CCAGGGAGAT GCTCGAATCC
CTCGATGTCG TCAAGAGGCT CCCCGGGTTG CCGAGGCTCA GGCACCTGGA AGAGGAGGAG
TACCTGGCGA GGCTCAAGGC TTCCGAGGGC TCGATGCCCG TGTGGGACGG GGAGCTCTAC
GTGGAGTTCC ACAGGGGGAC CTACACCACG AACCTACAGG TAAAGGAGCT GATGTGGAGG
GCGGAGCTAG CCCTCCTGAC AGCCGAGGAG GTCGCCTCGC TCGCATCCTC CAGGACGGGG
GAGGGGCTGT CGGCGCGCCT CGAGAGGCTC TGGAAGACCC TCCTCCTGCA CCAGTTCCAC
GACATCCTGC CGGGGTCCTC GATGAAGGAG GTCTACGAGG ACGCGTACAG GGACCTCTCG
ATGGTCCTGC GGGAGGCCTC CTCCCTGGCC CGGGAGGCCC TCGAAGCCTA CCTGGGGGGC
GGCGAAGGCG GGGTCGCAGT CTTCAACCCC CTGCCGTGGC CCCGGAGGGC TATCGTCTCC
CTCCCCATGG GCATGGCGCC GCGCGGCGCG GAGTGCCAGG AGTCGGAGGA GGGGGTTCTG
GTCGAGGTCG AGCTACCGCC GGCCGGCTAC CGCGTCTACA GCGCGTCGGG CGGGTGCGTG
GATGAAGGGG AGCCGCTGAG AGTAGAGGAG GTCGACGGCG GGATACTGCT CTCCAGCGGC
GCAGTCGAGG TGGTGGTAGA CCGGGACGGC TCGCTGGGGT CCGTGAGGTT GCTCCCCGGC
GGAGAGGAGG TGCTGTCGCG GCCATCAAAC CAGCTCCGCG TGCACTTCGA CAGGCCGGGG
GTTTTCGACG CCTGGGAGCT TACGGACGAC TTCCTGTCCA GGTGGGAGGA GGCGAGGACG
CTCTCGGAGC CCAGGCTCGT TGAGCGCGGG CGGCTGAGGG CGTGCGTCGA GTACGTAAAG
GGCTTCGGCA AGTCCAGAGT TACGCAGAGG GTGTGCGTCT ACAGGTTCAG CCCCGTGGTG
GAGGTGAAGA CGAGGCTCGA GTGGTTCGAC AAGGGGTTCC TGGTGAAGGC CTGGATGAAG
CCAGCGTTCA AGCCCACCGC GGCTGTCTTC GAGACCCCCT ACGGCGTCGT GTACAGGAGC
CCGGCTTGGG CTTCGAGCAT CGACAGGGCG AAGTTCGAGG CGCCAGCGCT CAGGTGGGTC
GACGTCTCCG ACGGCGCTAG GGGCTTCGCG GTGATAGCCC CGGGGAGGCA CGGCTACTCC
GTCCGGGAGG ACTACGTTAG CCTCAGCCTC CTCAAGTCCC CCACGTTCCC GAACCCGTGG
AGCGACGTGG GGGAGTTCGA GACGACGTAC TACATCTACC CGCACCTAGG AGGCTACGAG
GAGGCACGCG TAGCCTTCGT GGCCGCCGAG CTCCTCAGGC AGCCCCTCGC CGTGGCTACG
GCGCCTGTAG AGCGCTCCGA GAGCTTCCTC TCCGTCAACC CCCCGGAGGC CCTCCTAGGG
GCGTTTAAGG CGGCGGAGGA CGGGGACGGC TACGTGATGA GGCTGTACAA CCCCCACCGC
GGAGAAGTAG AGGTGGAGGT AGAGGTGAAC GTCCCCTTTA GGGAGGCCGT GGAGGTAGAC
ATACCGGAGC TGAGGGTCCT GGGGAGGGTG GAGGTGTCGG GCGGCAAGCT ACGCTTAAGG
CTCAAGCCCT TCGAGGTAAA AACGGTAAAG CTCAGGTAG
 
Protein sequence
MKLRDAERSL FDLLVASVQR YIPLSVWRLD GREKFLPVRV QAEPGRIYEF VGKARVPSSE 
HRWFAKFVLS GNATLEVDGR LVGGIDEAHT YFPLDPGLHE LRIRASPRGM FGFHDWSLSF
EKAFLAEVYW RGFSLALKLL SLVSYLKSLP PDDAVRRRVE GELFDALMSA PVSPSVLQIT
VALSLLYEGG RSFSRGDLPE RYGDYAWLSG VYGAGILGGR LRDVGASSLD DVKRACDRLE
PLLSLLEELR KSQEGAGELF IVGHSHIDAA WLWPVQETRE KVARTFANVV SLLREYGGPV
YAQSSALYYE WVEEDYPELF REIKRLVESG RWIPVGGMWV ESDVQLVEGE SLARQFLYGQ
RYFYSRFGRT ARIGWIPDSF GFPYSLPQLL VKSGLECFVT HKVLWNDTNE FPYHSFLWRG
VDGSTIPVQI LLNSYNEMLT PGSVKAYWSR YKQRGEVPFL PYAYGYGDGG GGPTREMLES
LDVVKRLPGL PRLRHLEEEE YLARLKASEG SMPVWDGELY VEFHRGTYTT NLQVKELMWR
AELALLTAEE VASLASSRTG EGLSARLERL WKTLLLHQFH DILPGSSMKE VYEDAYRDLS
MVLREASSLA REALEAYLGG GEGGVAVFNP LPWPRRAIVS LPMGMAPRGA ECQESEEGVL
VEVELPPAGY RVYSASGGCV DEGEPLRVEE VDGGILLSSG AVEVVVDRDG SLGSVRLLPG
GEEVLSRPSN QLRVHFDRPG VFDAWELTDD FLSRWEEART LSEPRLVERG RLRACVEYVK
GFGKSRVTQR VCVYRFSPVV EVKTRLEWFD KGFLVKAWMK PAFKPTAAVF ETPYGVVYRS
PAWASSIDRA KFEAPALRWV DVSDGARGFA VIAPGRHGYS VREDYVSLSL LKSPTFPNPW
SDVGEFETTY YIYPHLGGYE EARVAFVAAE LLRQPLAVAT APVERSESFL SVNPPEALLG
AFKAAEDGDG YVMRLYNPHR GEVEVEVEVN VPFREAVEVD IPELRVLGRV EVSGGKLRLR
LKPFEVKTVK LR