Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0932 |
Symbol | |
ID | 4600755 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 881213 |
End bp | 884776 |
Gene Length | 3564 bp |
Protein Length | 1187 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 639773711 |
Product | glycoside hydrolase family protein |
Protein accession | YP_920336 |
Protein GI | 119719841 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1449] Alpha-amylase/alpha-mannosidase |
TIGRFAM ID | [TIGR03593] membrane protein insertase, YidC/Oxa1 family, N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.26895 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAAAG CCTTGTTGGC GGCTTTACTC CTGGCACAGC TACTACCATT GCTCCTACCC CTAGCATACG CGGAGAACGC TCCGGTATTC GAGGACAGGG GGGCAACCGT CTACGTCGAC ACGGGCGTCG CGAAGATACT GATAAGTAAG ACGGGGGCCA GGCCTATAGG CTGGCTTGTC GACGGCGTGG AGCTTGCGGC TAATGCTGGT AGTAGGGGTG CCATGGGTAA TAGCTACCCA CTCTACGACT GGTTGCCTAC GCAGTCCTGG CCAGGGCTAA TAGTAACCGC GAAGTTCAAC GTCGAGACTT CGAGGCCATC GCAGGACGTC CTCGTAGTGA GGTTTACAGC CGTCATACCC GTGACTGACG TCAGCGGCGT AACTTCGAAC GTCGCCGTGA CCCGCGAGTA CGTGTTCAAG CAAGGCTCGA GGACGTTTAA CCTCACGGTA ACCCTCGTGA ACATGGGGAG CTCTCCCATG AAGATAGAGG TGAACTGGGG AGGAGCCCTC GTAGGCTACG CCTTCGCCGT TACCGGCGTG GTAGGGCAGA AGGGCGACAA CGACTGGCAA CTCTGGGTAG ACGGGGACTC GCTCTACACC CGCTCACCCG ATAACAGGGA GAGCTGGGTT AGGAAGCAGA GCCCCCTCAT AAGGCTCGTG GGGATATACG ACCCCGACGA GAAAGGGATC ATCGTCGCCA GGGTGTACAA CAGGACGGAG TCCATCTGGT TCGAAGTCGG AGGGTGGGGC ACCGAGGTTA GAGTAGAGCA TCCCACCCTT GTTCTCAACC CCAACGTCCC GGTAAGGTTC ACGTACACCG TGTACGGGGG CTCGCCCGAC TACCTGGACA AGGAGGGCTT CGCGGATCTG AAGGACAGGC TCATGGGCAA GCAGCCGCAA CAGCAAGCCC CTCCAGCGCC GCAGGCTACG AGGACGCTAC CCCCGATGCC CAAGCCTACA GGCAAGATAA GCTACAGCCT CACCCAGGAC TACGCGCTGG TCGACACCGG AGTAGCGAAG ATAAAGATCT CCCTTAGAGG GGCCAGGCCC GTCGAGTGGC AAGCCGGGGG AGTGGAGCTT GCGGCTAATG CTGGTAGTAG GGGTGCTATG GGTAATAGCT ACCCACTCTA CGACTGGTTG CCTACGCAGT CCTGGCCAGG GCTAATAGTA ACCGCGAAGT TCTCGGCGTC CGTTGCCTTC TCAAACGACA CGGTCCTCGT CCTCAAAATG GAGGCAGACG TCCCGTTCAC CAACCCGAGC GGCGACCAGA CCGTTCTACA CGTAACGAAG ACCTTCACCT TCTACGCAGG GCTATACGGC TTCGACGAAA CCACCCAGAT AACCAACAAG GGCAACGTAA AGGCTGAGAG CAAGGTAAAC TGGGACAGGA CGATAGGCTA CACCTTCGCT GTTACCGGCG TGATAGGGCA GAAAGGTGAG AATGACTGGC AGGCCTGGAA GGACGGCGCG GGGCTCCACC TGCTTTCGCC GAAGGAGAAG CTCCACTTCA CGTGGAAGTC GGCGCAGGAC CTCGAATGGA TAGGGATGAT AGACCCGGAG GAGGGTGCCT GCATAATCGC GTACCTGGGA ACCCCGGGCG CCGGGATAGT GTGGCTTGAA GCGTCGACCT GGGGAACCGA GGTAAGAGCC GAGTACCCGC CCTTCACGCT GAACCCCGGC GACAGCGTGA CCTTCGTCTA CAGGGTCTTC GGCGGGCCCG TAGACGCGCT GGCAAGCTAC GGCTACAAGG ACGTGTACCA GCAGCTAGGC GCGGCGGCGG CTCAACAGGC CCTCTACCTA GTCGTGTCGA CGGACAGCCT ACTCTACTCC CCAGGCCAGA AAGCCAAGTT CAACGTAACC GCGTCCTACA GGAGGGGCAC CATAAAGGGA ACGCTGACCG TCTCCCAGGG AGGCACCACC CTCTACGAGA GCGAGGTAAC GCTCTCACAG CAACCCCAAA GCGTGTTCGT AGAGGTTACG GTTCCGACGA AGCCCGGTAT CTACTCGTAC ACAGTGGCGC TGCGAACACA GGAGTTTACA GCTACCCGCG AGGTGAGGAT AGGCGTCGTG GACCCGGCTT CGTGGCGCTC ACCGCTCAAG CTCGTATTCG TTTGGCACAG CCACCAGGGC ATAAACGCGT GGCCCAACGG GTCGTTCCAC GGCCCCTGGG CCTTCAAGCA CACATACGAG GACGAGTTCA AGCCCTACTA CGAGGGCGGA GCCTACCTCG TGCAACCAGT GATTCTGTCC AAGTATCCGG GCGTCAAGAT GGTGTACCAC CTTAGCCCGA GCCTGCTTTG GCAGTGGGAG TACGGGCTCA AGTACGGCTA CTTTGACTCC TTCTCGCAGA AGTTCATCTC GCCAGAGAGC CCCGAGATGG CGCGTGTCAG GAAGGCCCTA GAGCTCTACA AGCAGCTGGC TAGCAGGGGG CAGATAGAGA TATTCAGCGA CTTCTTCAAC CACCCGATCC CAGGCTACGT CGTCGACACC TACCCGTGGG GAGCCGAAGC GATGAAGATA GAGCTCGAGT GGGGCTTCAA CGTGACTAAG AGGGTTCTCG GCGTAGACGC TAAGGGCGCC TGGATCCCCG AGATGTTCTT CAGCGAGAAG CTAGTACCCA TACTGGTCCA GCAGGGAGTC AAGTACATCG TCCTAGACTA CGAGACGCAC TACAAGGGCT CCGACGGGGA GAAGAGGGGG ATATACACGC CCTACCTATA CCAAGGCTCC CAGGGCCAGA TAATCGTGCT CTTCAGGGAC ACCCAGCTGA GCAACTACAT AAGCTTCGAG AACAGGTTCT CGACTCCTGA AGACGCCGAC GCTGCGGCCA GGAACTTCGT CCTAATGGTG GCTTCGAGGA GGTTCTCCGA CCCCACGGCA AGCGTGGTGG TCATCTCCGC CGACGGCGAA AACTGGATGA TCTTCAGCCC GACGGTCGCT ACTACGGGCG TATTCTTCGA GAGGCTGTGC GCGTACCTAG AGAGCGTGAA GCAGTACATA GTGACAGCCA CGGTCGCGGA GGTAGTGGAG GGCGCGCAGA GCTTCCCGAA GCTGACGAGA ATACCGACGG CGAGCTGGGC TGGGGGTAGC GATGTCTGGA CCAACAAGCC GGAGCAGAAG CAGCAGTGGG ACTGGATAAA CAAGGCGGCG TCCGTGCTGG CGAACATCAA GTCGAAGTAC GGCGAAAACA GCGCCGTCTA TAAAGCCGCG CTATTCGCGT TCTTCATGTC CCTCAACAGC GACGTCATAC ACAGGGACTT CACGTTCCCA GCGCACACCA AGGCCTGGGT AGACACGGTG CAGGCGCTCT ACGACTCCGG CGAAGCCGTA GCCTCGAAGC TGAACAGGAT AGGGTTAAAC CTCGAAGCAA CCAGAGCCCA AGCCGTAACC CCGCAACCCG GGACACAGCA GCCACCTCAG CAACAACCCA CGCAACCAGG AGGCCAGCAG GCCCCCAGCG AGATCCAACC CCAGTGGCTC GGGCTAGCAG CGCTCATAGC AGCGGTTGTT GCGGTAGCGG CGGTCTACGC GTACAGGAAA CGCGCCGGGA AGAGTAAGCA GTAA
|
Protein sequence | MRKALLAALL LAQLLPLLLP LAYAENAPVF EDRGATVYVD TGVAKILISK TGARPIGWLV DGVELAANAG SRGAMGNSYP LYDWLPTQSW PGLIVTAKFN VETSRPSQDV LVVRFTAVIP VTDVSGVTSN VAVTREYVFK QGSRTFNLTV TLVNMGSSPM KIEVNWGGAL VGYAFAVTGV VGQKGDNDWQ LWVDGDSLYT RSPDNRESWV RKQSPLIRLV GIYDPDEKGI IVARVYNRTE SIWFEVGGWG TEVRVEHPTL VLNPNVPVRF TYTVYGGSPD YLDKEGFADL KDRLMGKQPQ QQAPPAPQAT RTLPPMPKPT GKISYSLTQD YALVDTGVAK IKISLRGARP VEWQAGGVEL AANAGSRGAM GNSYPLYDWL PTQSWPGLIV TAKFSASVAF SNDTVLVLKM EADVPFTNPS GDQTVLHVTK TFTFYAGLYG FDETTQITNK GNVKAESKVN WDRTIGYTFA VTGVIGQKGE NDWQAWKDGA GLHLLSPKEK LHFTWKSAQD LEWIGMIDPE EGACIIAYLG TPGAGIVWLE ASTWGTEVRA EYPPFTLNPG DSVTFVYRVF GGPVDALASY GYKDVYQQLG AAAAQQALYL VVSTDSLLYS PGQKAKFNVT ASYRRGTIKG TLTVSQGGTT LYESEVTLSQ QPQSVFVEVT VPTKPGIYSY TVALRTQEFT ATREVRIGVV DPASWRSPLK LVFVWHSHQG INAWPNGSFH GPWAFKHTYE DEFKPYYEGG AYLVQPVILS KYPGVKMVYH LSPSLLWQWE YGLKYGYFDS FSQKFISPES PEMARVRKAL ELYKQLASRG QIEIFSDFFN HPIPGYVVDT YPWGAEAMKI ELEWGFNVTK RVLGVDAKGA WIPEMFFSEK LVPILVQQGV KYIVLDYETH YKGSDGEKRG IYTPYLYQGS QGQIIVLFRD TQLSNYISFE NRFSTPEDAD AAARNFVLMV ASRRFSDPTA SVVVISADGE NWMIFSPTVA TTGVFFERLC AYLESVKQYI VTATVAEVVE GAQSFPKLTR IPTASWAGGS DVWTNKPEQK QQWDWINKAA SVLANIKSKY GENSAVYKAA LFAFFMSLNS DVIHRDFTFP AHTKAWVDTV QALYDSGEAV ASKLNRIGLN LEATRAQAVT PQPGTQQPPQ QQPTQPGGQQ APSEIQPQWL GLAALIAAVV AVAAVYAYRK RAGKSKQ
|
| |