Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1628 |
Symbol | |
ID | 4601031 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1573072 |
End bp | 1576170 |
Gene Length | 3099 bp |
Protein Length | 1032 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639774401 |
Product | glycoside hydrolase family protein |
Protein accession | YP_921026 |
Protein GI | 119720531 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0383] Alpha-mannosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCTAC GCGACGCCGA GAGGTCGCTG TTCGACCTTC TGGTAGCGTC CGTTCAGAGG TACATCCCGC TCTCCGTCTG GAGGCTCGAC GGGAGGGAGA AGTTCCTACC GGTACGCGTG CAGGCTGAGC CGGGTAGGAT CTACGAGTTC GTCGGGAAAG CTAGGGTCCC GTCGAGCGAG CACAGGTGGT TCGCGAAGTT CGTCCTGAGC GGGAACGCTA CGCTCGAGGT CGACGGGAGG CTGGTAGGCG GGATAGACGA AGCGCACACG TACTTCCCGC TGGACCCGGG GCTCCACGAG CTGAGGATTA GGGCTTCGCC GAGGGGGATG TTCGGCTTCC ACGACTGGAG CCTGAGCTTC GAGAAAGCCT TCCTCGCCGA GGTGTACTGG CGCGGCTTCT CGCTCGCGCT CAAGCTACTC TCCCTCGTGT CCTACCTCAA GTCCCTTCCC CCCGACGACG CGGTGAGGAG GAGGGTCGAA GGGGAGCTCT TCGACGCCTT GATGTCCGCT CCCGTCTCGC CGAGCGTGCT CCAGATAACG GTGGCGTTGA GCCTCCTCTA TGAGGGCGGG AGGAGCTTCT CGAGGGGGGA CCTGCCCGAG AGGTACGGGG ACTACGCGTG GCTCTCCGGT GTCTACGGCG CCGGGATCCT GGGCGGCAGG CTCAGGGACG TGGGCGCGTC GAGCCTCGAC GACGTGAAGC GCGCCTGCGA CAGGCTGGAG CCCCTCCTCT CACTCCTCGA GGAGCTGAGG AAAAGCCAGG AGGGCGCGGG GGAGCTCTTC ATAGTTGGGC ACAGCCACAT AGACGCCGCG TGGCTCTGGC CCGTGCAGGA GACCAGGGAG AAGGTCGCGA GGACGTTTGC GAACGTGGTC TCGCTCCTGA GGGAGTACGG GGGACCCGTG TACGCGCAGA GCTCGGCGCT GTACTACGAG TGGGTCGAGG AGGACTACCC GGAGCTTTTC CGCGAAATCA AGCGCCTCGT CGAGTCGGGT AGGTGGATAC CGGTTGGGGG CATGTGGGTT GAGAGCGACG TCCAGCTCGT GGAGGGGGAG AGCCTGGCGA GGCAGTTCCT GTACGGCCAG AGGTACTTCT ACTCCAGGTT CGGGAGGACA GCCCGGATCG GGTGGATACC GGACAGCTTC GGCTTCCCCT ACTCGTTGCC CCAGCTACTC GTCAAGAGCG GGCTCGAGTG CTTCGTGACG CACAAGGTGC TCTGGAACGA TACCAACGAG TTCCCCTACC ACTCCTTCCT CTGGAGGGGG GTCGACGGCT CCACTATACC CGTCCAGATC CTCCTGAACA GCTACAACGA GATGCTCACC CCGGGAAGCG TGAAGGCGTA CTGGTCCAGG TACAAGCAGA GGGGCGAGGT CCCCTTCCTG CCGTACGCCT ACGGCTACGG GGACGGGGGA GGGGGGCCGA CCAGGGAGAT GCTCGAATCC CTCGATGTCG TCAAGAGGCT CCCCGGGTTG CCGAGGCTCA GGCACCTGGA AGAGGAGGAG TACCTGGCGA GGCTCAAGGC TTCCGAGGGC TCGATGCCCG TGTGGGACGG GGAGCTCTAC GTGGAGTTCC ACAGGGGGAC CTACACCACG AACCTACAGG TAAAGGAGCT GATGTGGAGG GCGGAGCTAG CCCTCCTGAC AGCCGAGGAG GTCGCCTCGC TCGCATCCTC CAGGACGGGG GAGGGGCTGT CGGCGCGCCT CGAGAGGCTC TGGAAGACCC TCCTCCTGCA CCAGTTCCAC GACATCCTGC CGGGGTCCTC GATGAAGGAG GTCTACGAGG ACGCGTACAG GGACCTCTCG ATGGTCCTGC GGGAGGCCTC CTCCCTGGCC CGGGAGGCCC TCGAAGCCTA CCTGGGGGGC GGCGAAGGCG GGGTCGCAGT CTTCAACCCC CTGCCGTGGC CCCGGAGGGC TATCGTCTCC CTCCCCATGG GCATGGCGCC GCGCGGCGCG GAGTGCCAGG AGTCGGAGGA GGGGGTTCTG GTCGAGGTCG AGCTACCGCC GGCCGGCTAC CGCGTCTACA GCGCGTCGGG CGGGTGCGTG GATGAAGGGG AGCCGCTGAG AGTAGAGGAG GTCGACGGCG GGATACTGCT CTCCAGCGGC GCAGTCGAGG TGGTGGTAGA CCGGGACGGC TCGCTGGGGT CCGTGAGGTT GCTCCCCGGC GGAGAGGAGG TGCTGTCGCG GCCATCAAAC CAGCTCCGCG TGCACTTCGA CAGGCCGGGG GTTTTCGACG CCTGGGAGCT TACGGACGAC TTCCTGTCCA GGTGGGAGGA GGCGAGGACG CTCTCGGAGC CCAGGCTCGT TGAGCGCGGG CGGCTGAGGG CGTGCGTCGA GTACGTAAAG GGCTTCGGCA AGTCCAGAGT TACGCAGAGG GTGTGCGTCT ACAGGTTCAG CCCCGTGGTG GAGGTGAAGA CGAGGCTCGA GTGGTTCGAC AAGGGGTTCC TGGTGAAGGC CTGGATGAAG CCAGCGTTCA AGCCCACCGC GGCTGTCTTC GAGACCCCCT ACGGCGTCGT GTACAGGAGC CCGGCTTGGG CTTCGAGCAT CGACAGGGCG AAGTTCGAGG CGCCAGCGCT CAGGTGGGTC GACGTCTCCG ACGGCGCTAG GGGCTTCGCG GTGATAGCCC CGGGGAGGCA CGGCTACTCC GTCCGGGAGG ACTACGTTAG CCTCAGCCTC CTCAAGTCCC CCACGTTCCC GAACCCGTGG AGCGACGTGG GGGAGTTCGA GACGACGTAC TACATCTACC CGCACCTAGG AGGCTACGAG GAGGCACGCG TAGCCTTCGT GGCCGCCGAG CTCCTCAGGC AGCCCCTCGC CGTGGCTACG GCGCCTGTAG AGCGCTCCGA GAGCTTCCTC TCCGTCAACC CCCCGGAGGC CCTCCTAGGG GCGTTTAAGG CGGCGGAGGA CGGGGACGGC TACGTGATGA GGCTGTACAA CCCCCACCGC GGAGAAGTAG AGGTGGAGGT AGAGGTGAAC GTCCCCTTTA GGGAGGCCGT GGAGGTAGAC ATACCGGAGC TGAGGGTCCT GGGGAGGGTG GAGGTGTCGG GCGGCAAGCT ACGCTTAAGG CTCAAGCCCT TCGAGGTAAA AACGGTAAAG CTCAGGTAG
|
Protein sequence | MKLRDAERSL FDLLVASVQR YIPLSVWRLD GREKFLPVRV QAEPGRIYEF VGKARVPSSE HRWFAKFVLS GNATLEVDGR LVGGIDEAHT YFPLDPGLHE LRIRASPRGM FGFHDWSLSF EKAFLAEVYW RGFSLALKLL SLVSYLKSLP PDDAVRRRVE GELFDALMSA PVSPSVLQIT VALSLLYEGG RSFSRGDLPE RYGDYAWLSG VYGAGILGGR LRDVGASSLD DVKRACDRLE PLLSLLEELR KSQEGAGELF IVGHSHIDAA WLWPVQETRE KVARTFANVV SLLREYGGPV YAQSSALYYE WVEEDYPELF REIKRLVESG RWIPVGGMWV ESDVQLVEGE SLARQFLYGQ RYFYSRFGRT ARIGWIPDSF GFPYSLPQLL VKSGLECFVT HKVLWNDTNE FPYHSFLWRG VDGSTIPVQI LLNSYNEMLT PGSVKAYWSR YKQRGEVPFL PYAYGYGDGG GGPTREMLES LDVVKRLPGL PRLRHLEEEE YLARLKASEG SMPVWDGELY VEFHRGTYTT NLQVKELMWR AELALLTAEE VASLASSRTG EGLSARLERL WKTLLLHQFH DILPGSSMKE VYEDAYRDLS MVLREASSLA REALEAYLGG GEGGVAVFNP LPWPRRAIVS LPMGMAPRGA ECQESEEGVL VEVELPPAGY RVYSASGGCV DEGEPLRVEE VDGGILLSSG AVEVVVDRDG SLGSVRLLPG GEEVLSRPSN QLRVHFDRPG VFDAWELTDD FLSRWEEART LSEPRLVERG RLRACVEYVK GFGKSRVTQR VCVYRFSPVV EVKTRLEWFD KGFLVKAWMK PAFKPTAAVF ETPYGVVYRS PAWASSIDRA KFEAPALRWV DVSDGARGFA VIAPGRHGYS VREDYVSLSL LKSPTFPNPW SDVGEFETTY YIYPHLGGYE EARVAFVAAE LLRQPLAVAT APVERSESFL SVNPPEALLG AFKAAEDGDG YVMRLYNPHR GEVEVEVEVN VPFREAVEVD IPELRVLGRV EVSGGKLRLR LKPFEVKTVK LR
|
| |