Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1269 |
Symbol | |
ID | 4600484 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1206045 |
End bp | 1209386 |
Gene Length | 3342 bp |
Protein Length | 1113 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639774045 |
Product | glycoside hydrolase family protein |
Protein accession | YP_920670 |
Protein GI | 119720175 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.172811 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTTTGG GGACCGTGCG CCTCGCTGCT GCTACGGCCC TCCTGCTAGC CCTACTCCTA CTCGCCGCCG CCCGCGGCGA ACCCGTGACT GTGGACGGCT CAGGCTTCGA CTGGCCGTAC GACTCCTGCC ACGCGCTCGA CCCCCACGGC GACCTCCTCG ACGCGACCCC CCACTGGTAC GACAGCAGGG ACCTCCTAGC CTGGTACTAC GCTGTCGGCG ACCAGTACGT CTTCATCAGG CTAGACCTCC TGGACCTAGC CTACGGCGCC GAGACGGCGA GCTACCCGGA CGGCGGTGTC GCGGACGCCC TAAACATCTA CGTGATGCTC GGGTGGGAGA ACGCGCCGGG GTACCAAGAG TGGGCCCCCG ACTACCTGCA GCTGAACGGC TACGGAGTCC ACATAAGCGA CTACCGCTGG GTCGTAGCTA TCGCGGTGTA CGACTCCGCA CACTACAAGG TGTACAGGTA CGACTGGTCT GTCCTCGCCG AGAACAGCGG GCTACAGGTA GCCTTCAACA GCCAGTGGGA CCTCGTCGAG ATCGGCATAC CCAGGAGCAT GCTGGAGAGC TACGGCTGGA CGCCTACATC CAGGGTGTGG GCCAAGATTG CAACCGCGCT CGTGAAGACG GGCGGCGCGA ACGTCCTGGC GGACGCCATG CCGAACACGG TCAGCTTCGA CGGGAACGCC GGCCGCTACG AGTGGAGCGG CGCCGTGTTC AGCGACCAGA AGTGCGGGAC CGCCAAGGTA GCCTTCGTGC ACCACGGGAA CCAGCACCTC GCGGACAACA GAGCCCTCAA CAAGCCTAAC GCCGTCAACA GCTACGACTA CATACTGAGG GTGCACGAAG AGCTCTCCGC GAAGGCAGGC AGGAGGATAC CCGTGTGCGT CCACATGTCG GGAACCCTGG CGGCGAGCTA CGTGTGGTGG GACCCGAGCA TAGTGGCGCA CCTGAGGAGC CTCGCCGCGA AGGGGCTAGC GTGCATAGTC GGGGGAACCT GGGCTGAGTA CATCACGGCG TACTTCTACG ACAACTTCAA CGACCCCTCG TTCTACCTCG GAAAGCTCTA CGCCGAGGCA CTGTTCGGCT ACACCCCGCT CACCGCGTGG ATACCCGAGA GGACGTGGGA CGACGAGAGA ACCGGTATCG CGTACACAGT CAGCAAGCAC TACTACGCGG TGATACTCGA CGGCAACACG CACCACGATG ACTGGAGCCC GAACACCAGC CCCTACAAGC CCCACCAGTA CGACCCGTCG AAGACGGGGG GCAGGGAGCT CTACGTCTTC TTCATCGACT GGGAGATGCA GCAACTCCTC CTCGCGAACA CGGACGGAGG GCTGAACATA AACCTGAGGA GGAAGCTCGC CTGGGCGGCC ACCAACGCGG ACCAGCAGAT GCTGTTCCTC TACGCGGACG ACTGGGAGAA GGCCGCCGGG ATATCCAGCC CGAGCTGGGA CCCCGGCAAC CCGTACCGCT ACAGGGACTC GCTGACCTGG ATAGCCATGC ACCCGTGGAT ACAGGTCGTA ACGGTGGACG AGGTCGTCGG CTGGCTTAGG AGCGGCTCGT GGACGCCGGT GAAGTACTAC TACTGCGGCT ACGACACCTA CTACTACCTC AAGACGTGGG TCCCGGGCTA CCCGTACGAC TACAGAAGGG CCTACGACGG GTGGTACTGG GGTACGAGCA GCGCGAAGAG TTTCGCCTGG TACGGTAGCG GCCAGCCGGG CTACTCGCTC CCCGACACCA CGATGCCATT CGGCGACGTC TTCGGGTACA CCACCTACAA CGGGTCGCCC GCAAACACCG TGATATACAG GCTGCTGGCC CCGGGGAAGG CTCTGGACAG CGCGCCTAGA AACGAGCTGT GGAGGCTCGC GGTCATGGCG GCTAACGCGT TGCTCTACGA GACAGCCTGG CACGAGGGCT CCGACTGCGT CGGGTGGGGC CTCAACCAGT GGAACCACTT GAGGCTCGTG AACGTGCTCC TACTCGCGTC CAAGTGGCTC GACGACGCGC GCCTCGGGAA GATCGCGGGC GCCTCCTACC TAGTCGGGGA CTTCGACTGG GACGGCAGGC CCGAGGCGGT GATCTACAAC CGGGCGGTGT TCGCCTACAT AGACGACAAG GGAGGCGCGG CGCCCCTCAT CTTCGCCTAC AACAGGACCA CCGACAGGGT CTACGTGTCC GTGGGCGCCC CGCTCGTGTA CTGGGGCGTA CTGGGCGACG CCTGGTACGG CGACAACCAC GTGGGCTTCC TCGCGGACGA CTACTTCGAC GCCACGGGTA AGAACTACTA CTCGTCCAGC TACCAGCTGG TCTCGGCTTC GAGGAGCGAC GCCGAGGGGG GAGTCGTGGT CAAGCTCGCG GCTCCCGACA TGGACGGGGA CGGGCGCCCC GACTTCTACA AGTACTTCGT GCTCCGCGAC ACCGCCAACT ACGTCGACGC GGTCTACGAG CCCTCCGGGA AGGCTGGGAC CGTGTACTCC GCCGTCGGCC TCTCGGTAGA CCTGCTCGAC AGCCTCTTCA ACGGCGACAG GGCCGCCAGG GTCGGCGACC CCTCCGGGGC CTCGACGTTC GGCTACAGGA ACGGCTACAC GGGGGCCTAC GCGTACGTCA AGCCGTTGCA GGGCGCGTCG TGGACGGGGC CCCAGGACCT CTCCAAGTAC ACCCTCCAGT ACGTTGCGAA GCTAGCAGTC AGCGTGTCGC AGGGCCAGTC GAAGGTGCGC CTCTACCTGC CCGGCGACCC CGGCCAGTAC GCGCCCCGGT GCCAGCCCTC GGCCTACGCG ACTATCGGCT TCCTCAGGTC CCCCGGGTCG ACGGCGGTGC TCGTGGCTGT GTACGGCAAC TCCAGCGCAC CCGTGAACTC CTTCGAGGCT AGGGTTGTCG ACGCGTCCGG CGCGGGCTCC TGGAGGCTCG ACGCCAGGCT CGCCGGCGGC TCCCTCGGCT CCCCCTACGC CTCCTTCCTC TTGAACCTGA GCTCGTCCCA GCTCGCTCCC GGGATGTACT ACGTGGAGCT CAACGTGAGC CTGGGGGCCT CGAGGATCGT CGAGAGGGCG TACAGCGTCT ACGTGAGGAG GCTCGAACGC GGCTACAACC TGGTGTCCCT CCCGTTCTTC TACAGCGCCG TCGTGAGCCC CTCGAAGGCC TCCGAGCTCG CCGAGACCGC GGGCACCAGC CTGCTCGCCG TGTGGAGGTG GGACGTGCAG GCGCAGAGGT TCAGGGGCTA CGTACCCGGT GTGAGCGGGC CCGAGGACGA CTTCCCGCTG GAGCCCGGGA GCGGCTACTT CGTCTACGCG AAGTCTCCAG TCGTCGTGGT GTGGGTGGCC GGGAAATGCT GA
|
Protein sequence | MSLGTVRLAA ATALLLALLL LAAARGEPVT VDGSGFDWPY DSCHALDPHG DLLDATPHWY DSRDLLAWYY AVGDQYVFIR LDLLDLAYGA ETASYPDGGV ADALNIYVML GWENAPGYQE WAPDYLQLNG YGVHISDYRW VVAIAVYDSA HYKVYRYDWS VLAENSGLQV AFNSQWDLVE IGIPRSMLES YGWTPTSRVW AKIATALVKT GGANVLADAM PNTVSFDGNA GRYEWSGAVF SDQKCGTAKV AFVHHGNQHL ADNRALNKPN AVNSYDYILR VHEELSAKAG RRIPVCVHMS GTLAASYVWW DPSIVAHLRS LAAKGLACIV GGTWAEYITA YFYDNFNDPS FYLGKLYAEA LFGYTPLTAW IPERTWDDER TGIAYTVSKH YYAVILDGNT HHDDWSPNTS PYKPHQYDPS KTGGRELYVF FIDWEMQQLL LANTDGGLNI NLRRKLAWAA TNADQQMLFL YADDWEKAAG ISSPSWDPGN PYRYRDSLTW IAMHPWIQVV TVDEVVGWLR SGSWTPVKYY YCGYDTYYYL KTWVPGYPYD YRRAYDGWYW GTSSAKSFAW YGSGQPGYSL PDTTMPFGDV FGYTTYNGSP ANTVIYRLLA PGKALDSAPR NELWRLAVMA ANALLYETAW HEGSDCVGWG LNQWNHLRLV NVLLLASKWL DDARLGKIAG ASYLVGDFDW DGRPEAVIYN RAVFAYIDDK GGAAPLIFAY NRTTDRVYVS VGAPLVYWGV LGDAWYGDNH VGFLADDYFD ATGKNYYSSS YQLVSASRSD AEGGVVVKLA APDMDGDGRP DFYKYFVLRD TANYVDAVYE PSGKAGTVYS AVGLSVDLLD SLFNGDRAAR VGDPSGASTF GYRNGYTGAY AYVKPLQGAS WTGPQDLSKY TLQYVAKLAV SVSQGQSKVR LYLPGDPGQY APRCQPSAYA TIGFLRSPGS TAVLVAVYGN SSAPVNSFEA RVVDASGAGS WRLDARLAGG SLGSPYASFL LNLSSSQLAP GMYYVELNVS LGASRIVERA YSVYVRRLER GYNLVSLPFF YSAVVSPSKA SELAETAGTS LLAVWRWDVQ AQRFRGYVPG VSGPEDDFPL EPGSGYFVYA KSPVVVVWVA GKC
|
| |