Gene Tpen_0932 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0932 
Symbol 
ID4600755 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp881213 
End bp884776 
Gene Length3564 bp 
Protein Length1187 aa 
Translation table11 
GC content60% 
IMG OID639773711 
Productglycoside hydrolase family protein 
Protein accessionYP_920336 
Protein GI119719841 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1449] Alpha-amylase/alpha-mannosidase 
TIGRFAM ID[TIGR03593] membrane protein insertase, YidC/Oxa1 family, N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.26895 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAAG CCTTGTTGGC GGCTTTACTC CTGGCACAGC TACTACCATT GCTCCTACCC 
CTAGCATACG CGGAGAACGC TCCGGTATTC GAGGACAGGG GGGCAACCGT CTACGTCGAC
ACGGGCGTCG CGAAGATACT GATAAGTAAG ACGGGGGCCA GGCCTATAGG CTGGCTTGTC
GACGGCGTGG AGCTTGCGGC TAATGCTGGT AGTAGGGGTG CCATGGGTAA TAGCTACCCA
CTCTACGACT GGTTGCCTAC GCAGTCCTGG CCAGGGCTAA TAGTAACCGC GAAGTTCAAC
GTCGAGACTT CGAGGCCATC GCAGGACGTC CTCGTAGTGA GGTTTACAGC CGTCATACCC
GTGACTGACG TCAGCGGCGT AACTTCGAAC GTCGCCGTGA CCCGCGAGTA CGTGTTCAAG
CAAGGCTCGA GGACGTTTAA CCTCACGGTA ACCCTCGTGA ACATGGGGAG CTCTCCCATG
AAGATAGAGG TGAACTGGGG AGGAGCCCTC GTAGGCTACG CCTTCGCCGT TACCGGCGTG
GTAGGGCAGA AGGGCGACAA CGACTGGCAA CTCTGGGTAG ACGGGGACTC GCTCTACACC
CGCTCACCCG ATAACAGGGA GAGCTGGGTT AGGAAGCAGA GCCCCCTCAT AAGGCTCGTG
GGGATATACG ACCCCGACGA GAAAGGGATC ATCGTCGCCA GGGTGTACAA CAGGACGGAG
TCCATCTGGT TCGAAGTCGG AGGGTGGGGC ACCGAGGTTA GAGTAGAGCA TCCCACCCTT
GTTCTCAACC CCAACGTCCC GGTAAGGTTC ACGTACACCG TGTACGGGGG CTCGCCCGAC
TACCTGGACA AGGAGGGCTT CGCGGATCTG AAGGACAGGC TCATGGGCAA GCAGCCGCAA
CAGCAAGCCC CTCCAGCGCC GCAGGCTACG AGGACGCTAC CCCCGATGCC CAAGCCTACA
GGCAAGATAA GCTACAGCCT CACCCAGGAC TACGCGCTGG TCGACACCGG AGTAGCGAAG
ATAAAGATCT CCCTTAGAGG GGCCAGGCCC GTCGAGTGGC AAGCCGGGGG AGTGGAGCTT
GCGGCTAATG CTGGTAGTAG GGGTGCTATG GGTAATAGCT ACCCACTCTA CGACTGGTTG
CCTACGCAGT CCTGGCCAGG GCTAATAGTA ACCGCGAAGT TCTCGGCGTC CGTTGCCTTC
TCAAACGACA CGGTCCTCGT CCTCAAAATG GAGGCAGACG TCCCGTTCAC CAACCCGAGC
GGCGACCAGA CCGTTCTACA CGTAACGAAG ACCTTCACCT TCTACGCAGG GCTATACGGC
TTCGACGAAA CCACCCAGAT AACCAACAAG GGCAACGTAA AGGCTGAGAG CAAGGTAAAC
TGGGACAGGA CGATAGGCTA CACCTTCGCT GTTACCGGCG TGATAGGGCA GAAAGGTGAG
AATGACTGGC AGGCCTGGAA GGACGGCGCG GGGCTCCACC TGCTTTCGCC GAAGGAGAAG
CTCCACTTCA CGTGGAAGTC GGCGCAGGAC CTCGAATGGA TAGGGATGAT AGACCCGGAG
GAGGGTGCCT GCATAATCGC GTACCTGGGA ACCCCGGGCG CCGGGATAGT GTGGCTTGAA
GCGTCGACCT GGGGAACCGA GGTAAGAGCC GAGTACCCGC CCTTCACGCT GAACCCCGGC
GACAGCGTGA CCTTCGTCTA CAGGGTCTTC GGCGGGCCCG TAGACGCGCT GGCAAGCTAC
GGCTACAAGG ACGTGTACCA GCAGCTAGGC GCGGCGGCGG CTCAACAGGC CCTCTACCTA
GTCGTGTCGA CGGACAGCCT ACTCTACTCC CCAGGCCAGA AAGCCAAGTT CAACGTAACC
GCGTCCTACA GGAGGGGCAC CATAAAGGGA ACGCTGACCG TCTCCCAGGG AGGCACCACC
CTCTACGAGA GCGAGGTAAC GCTCTCACAG CAACCCCAAA GCGTGTTCGT AGAGGTTACG
GTTCCGACGA AGCCCGGTAT CTACTCGTAC ACAGTGGCGC TGCGAACACA GGAGTTTACA
GCTACCCGCG AGGTGAGGAT AGGCGTCGTG GACCCGGCTT CGTGGCGCTC ACCGCTCAAG
CTCGTATTCG TTTGGCACAG CCACCAGGGC ATAAACGCGT GGCCCAACGG GTCGTTCCAC
GGCCCCTGGG CCTTCAAGCA CACATACGAG GACGAGTTCA AGCCCTACTA CGAGGGCGGA
GCCTACCTCG TGCAACCAGT GATTCTGTCC AAGTATCCGG GCGTCAAGAT GGTGTACCAC
CTTAGCCCGA GCCTGCTTTG GCAGTGGGAG TACGGGCTCA AGTACGGCTA CTTTGACTCC
TTCTCGCAGA AGTTCATCTC GCCAGAGAGC CCCGAGATGG CGCGTGTCAG GAAGGCCCTA
GAGCTCTACA AGCAGCTGGC TAGCAGGGGG CAGATAGAGA TATTCAGCGA CTTCTTCAAC
CACCCGATCC CAGGCTACGT CGTCGACACC TACCCGTGGG GAGCCGAAGC GATGAAGATA
GAGCTCGAGT GGGGCTTCAA CGTGACTAAG AGGGTTCTCG GCGTAGACGC TAAGGGCGCC
TGGATCCCCG AGATGTTCTT CAGCGAGAAG CTAGTACCCA TACTGGTCCA GCAGGGAGTC
AAGTACATCG TCCTAGACTA CGAGACGCAC TACAAGGGCT CCGACGGGGA GAAGAGGGGG
ATATACACGC CCTACCTATA CCAAGGCTCC CAGGGCCAGA TAATCGTGCT CTTCAGGGAC
ACCCAGCTGA GCAACTACAT AAGCTTCGAG AACAGGTTCT CGACTCCTGA AGACGCCGAC
GCTGCGGCCA GGAACTTCGT CCTAATGGTG GCTTCGAGGA GGTTCTCCGA CCCCACGGCA
AGCGTGGTGG TCATCTCCGC CGACGGCGAA AACTGGATGA TCTTCAGCCC GACGGTCGCT
ACTACGGGCG TATTCTTCGA GAGGCTGTGC GCGTACCTAG AGAGCGTGAA GCAGTACATA
GTGACAGCCA CGGTCGCGGA GGTAGTGGAG GGCGCGCAGA GCTTCCCGAA GCTGACGAGA
ATACCGACGG CGAGCTGGGC TGGGGGTAGC GATGTCTGGA CCAACAAGCC GGAGCAGAAG
CAGCAGTGGG ACTGGATAAA CAAGGCGGCG TCCGTGCTGG CGAACATCAA GTCGAAGTAC
GGCGAAAACA GCGCCGTCTA TAAAGCCGCG CTATTCGCGT TCTTCATGTC CCTCAACAGC
GACGTCATAC ACAGGGACTT CACGTTCCCA GCGCACACCA AGGCCTGGGT AGACACGGTG
CAGGCGCTCT ACGACTCCGG CGAAGCCGTA GCCTCGAAGC TGAACAGGAT AGGGTTAAAC
CTCGAAGCAA CCAGAGCCCA AGCCGTAACC CCGCAACCCG GGACACAGCA GCCACCTCAG
CAACAACCCA CGCAACCAGG AGGCCAGCAG GCCCCCAGCG AGATCCAACC CCAGTGGCTC
GGGCTAGCAG CGCTCATAGC AGCGGTTGTT GCGGTAGCGG CGGTCTACGC GTACAGGAAA
CGCGCCGGGA AGAGTAAGCA GTAA
 
Protein sequence
MRKALLAALL LAQLLPLLLP LAYAENAPVF EDRGATVYVD TGVAKILISK TGARPIGWLV 
DGVELAANAG SRGAMGNSYP LYDWLPTQSW PGLIVTAKFN VETSRPSQDV LVVRFTAVIP
VTDVSGVTSN VAVTREYVFK QGSRTFNLTV TLVNMGSSPM KIEVNWGGAL VGYAFAVTGV
VGQKGDNDWQ LWVDGDSLYT RSPDNRESWV RKQSPLIRLV GIYDPDEKGI IVARVYNRTE
SIWFEVGGWG TEVRVEHPTL VLNPNVPVRF TYTVYGGSPD YLDKEGFADL KDRLMGKQPQ
QQAPPAPQAT RTLPPMPKPT GKISYSLTQD YALVDTGVAK IKISLRGARP VEWQAGGVEL
AANAGSRGAM GNSYPLYDWL PTQSWPGLIV TAKFSASVAF SNDTVLVLKM EADVPFTNPS
GDQTVLHVTK TFTFYAGLYG FDETTQITNK GNVKAESKVN WDRTIGYTFA VTGVIGQKGE
NDWQAWKDGA GLHLLSPKEK LHFTWKSAQD LEWIGMIDPE EGACIIAYLG TPGAGIVWLE
ASTWGTEVRA EYPPFTLNPG DSVTFVYRVF GGPVDALASY GYKDVYQQLG AAAAQQALYL
VVSTDSLLYS PGQKAKFNVT ASYRRGTIKG TLTVSQGGTT LYESEVTLSQ QPQSVFVEVT
VPTKPGIYSY TVALRTQEFT ATREVRIGVV DPASWRSPLK LVFVWHSHQG INAWPNGSFH
GPWAFKHTYE DEFKPYYEGG AYLVQPVILS KYPGVKMVYH LSPSLLWQWE YGLKYGYFDS
FSQKFISPES PEMARVRKAL ELYKQLASRG QIEIFSDFFN HPIPGYVVDT YPWGAEAMKI
ELEWGFNVTK RVLGVDAKGA WIPEMFFSEK LVPILVQQGV KYIVLDYETH YKGSDGEKRG
IYTPYLYQGS QGQIIVLFRD TQLSNYISFE NRFSTPEDAD AAARNFVLMV ASRRFSDPTA
SVVVISADGE NWMIFSPTVA TTGVFFERLC AYLESVKQYI VTATVAEVVE GAQSFPKLTR
IPTASWAGGS DVWTNKPEQK QQWDWINKAA SVLANIKSKY GENSAVYKAA LFAFFMSLNS
DVIHRDFTFP AHTKAWVDTV QALYDSGEAV ASKLNRIGLN LEATRAQAVT PQPGTQQPPQ
QQPTQPGGQQ APSEIQPQWL GLAALIAAVV AVAAVYAYRK RAGKSKQ