Gene Tpen_1316 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1316 
Symbol 
ID4601996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1261982 
End bp1264918 
Gene Length2937 bp 
Protein Length978 aa 
Translation table11 
GC content72% 
IMG OID639774091 
Producthypothetical protein 
Protein accessionYP_920716 
Protein GI119720221 
COG category[R] General function prediction only 
COG ID[COG1353] Predicted hydrolase of the HD superfamily (permuted catalytic motifs) 
TIGRFAM ID[TIGR02577] CRISPR-associated protein, Crm2 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.800539 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGGGGCG TGTACTCGAG CATCGGGGAG CTACTGGAGG TGAAGACCGC CGCCCTGCTG 
TACGAGCCGC CGTGGAAGGC CTTCGCGCAG CTGAAGAAGG CGCCGCTACT GCCCTGGGCG
AAGCGGGGGG CGGGGCACTG GGAGGCGGAC GCCCTCGCGG TGGCGGAGAG GCTGGGGCTC
GCGGAGGTCC TCGAGAGGAG GCTCGGGGAG GTGAAGGAGC TCGCCCGCCT GGCGCTCGGC
GCCGAGCCGC TCGTGCTCGA CGGGGCAGGC GTGCAGCCGC CGGAGAGGCT CGTGCTCCTG
AACGTGTTCG ACCCGGACGT GTGCCTCGAC CCGGGCAACG GCTGCGACCT CAGGGCGTAC
GCGCGGACAG TGGAGTCCGC CGCCCCGGCC TTCGCGGAGA GCCTCGCGAA CGCTCTCTCA
GGCCTGGGGG ACGCGGCCCT GAGGTACCAC GCGCTCTGGT TCCTCCTGGA GCCGCTCTGG
TTCCGCGCGT GCGGCGCGCC CGTCGTATCG CCGGCCGACC CGAGGTTCCC GGTGTACACG
GTCTTCGACC ACGTCTACGC GGCGGCCACG CTCGCGAACG CCTTCAGGGG CGGCTCCTTC
AGGGGCTACG TGGTCGTCGT GGACTACGCG GGGGTCCAGG AGTACATCTC CAAGGCCAGG
AAGGCCCTCG ACCTCTGGGC CTCCTCGTGG GTAGCCTCGC TCGTCACGTG GGCCACCGTC
CAGCCGTTCG TCGAGCTCCT GGGGCCAGAC GTCGTCGTGA CGCCCTCGCT GAGGGGGAAC
TGGTTCTACG CGGCGTGGCT CCTCGGAAGG CTCAGGGGCA CCGGGGCCTA CGGGGCGGCC
AGGGAGGCGG CGAGGCTCGC GTACGGCTAC TCGGGCTTCC CGAGGCACCC GATAATGCCC
ACGCGCGCTG TGCTCTTCCT CCCGGAGGTT CCCGGGGCCC TCGAGGACGC CCTGCGCGAC
GAGGCCTCGC TTAGGGGCTT CATAGAGAGG AAGGCCTCCG AGGCCTGGTC CAAGGCGGTC
GGAGCCGTCC TCTCCGAGGG GAGGCTGGGC GCGCTGCTCG GCGAGACCCT CAGGGCGAAC
GGGGTCGAGG CAGACGCCTC GGAGCTGGAG GAGTACGTCG AGAGCGTCGC CTCGACGCTA
CCGCTCCCGC AGAGGCTCAT AGTGTTCAGG CACGGGGAGT CCTACGCTAG GTACAGGGAG
TGGCTGGGCA GGGTGTTCGG CGGGGAGCCC GCGGCGGAGG CGGGCGGCGC CAGGGTATCG
CTCGACGAGC GGGCGCTCTA CTTCCACTAC CTCATGGACG AGTACATCCC GTCGGAGGAG
GCCCTGTGGA AGCTGAGCAG GGTGGACCCG GACGTCGAGG CCCAGTCGAG CCGATGGTGC
GGCGCGGCGG CCGAGGTCTT CGGCAGGCTC GGCTTCTGCA GCGTGTGCGG CGAGAGGCCG
GCCGTGGTCG GGGCGCCGTC CAGCAGGTAC GGGGGGCTCG GCCCCGAGTC TAGGAGGGTC
GTGACGGAGG GCGAGAGGCT CTGCCCGAGG TGCCTCGCGA AGAGGCTGGT AGCCCTCAAC
CCCTTCGCCG CGCTGAGGGC CGTCGGGGTC CCGGCGGCGA GGACGTTCTG GTCCGTCCCC
ACGACGAACG AGCTCGCGAA CGCGGAGGTA GCGGAGGAGC ACCTGGACAG GGTCCTCGAC
GTGGTCGAGA GGCACCGGGA GGCCGTGGCG AAGGTCCTCG CGGGCTTCCA GTGGGACCAC
GAGTACTACT CGCGGAGGCT CGCGCGGAGG GCCTACAGGA GGGCCGGCAG GGAGGTCGCG
CTCGTCGCGC TCAGGCTGCT CGCAGCCCTC ATCGAGGCGG CGACGGAGCA GGAGTACAGG
GAGAGGTTCG CGAAAGCCCT GGACGCGCTC GGGGATGGGG AGGCGCGCAG GGAGATCGAG
GAGCTGTTCA GGTCCATAGC CGGGAAGAGG AGGACGCGGC TCGCCGTGGT GAAGGGGGAC
GGGGACTACG CGGGGTCCAG GCTGCTCCGG GCGAGGCTGC GGCTCGGCGC GGGGGAGTAC
GCGGGCAGGG TCGCGGCGCA GGCCGGGCTC GGGGGCGGGG CGGCGGAGCT ACTCGCGAGC
GTAGCCGGGA GGCTCGGCTC GACCGTCACG CTCTCACCCC TCTACACCGC CTCGGCGTCC
AGGTCGCTCA GCCACGGCGC CGTCCTAGAC GCACGCGCGG TCGAGGGGCT GGGGGGCTTC
GTGGTCTACG CGGGCGGGGA CGACCTGCTC GCGCTGACGC CTGCGACGGC CGGCGGGAGG
TACGTGGCGC TGGAAGCCGC CCTCAGGACC CGCAGGGGCT ACTGGGGGGA GCGCGGGAGG
GGGCCGAGGG GCTTCCACGC GTCGCCCGGA ATCCCGCTCG TCTCGCCCGC CCCCAGGAGC
TACGGAAGGT CGTACGCGGT GCTCTCGATC CACTACAGGG ACCCGCTCAG CGCGGCCGTG
GCGAGGTCGT CCGAGATGCT CGAAGAGGCG AAGGCTTGCA CCGCGGTGTA CGGGCCCGCA
ACCGTCGAGA GGGACGCCGT CCGGCTAGAG GACCTCAGGT CCGGCTCCTC CGCGATGCTA
CCCTTCAGGG CGGGGCCCGG CGCCGGGCTG GCCGGGAGCC CCGTTTGGAA GGCCGCGCTA
CTGGCCTCCA TGGAGCTACG CGGGGAGGTC TCGTCGAGCC TCTTCTACGA CTTCTTCTCG
TGGCGCTTCG ACGAGCTCGT GGAGAGGCTC GCCGCCGAGT CGAGGACGCG CGAGGCGTGG
CTCGCCCTGC GCTACCTGGT CTCCAGGAAC ACCAGGAGGA ACCCCGACGA GGTGGCGGGG
AGGGTGCTGG GCGAGGAGCT TGTAGCGGCT AGGCTCAGGT CGGCGGACGG GAGGGAGGAG
AGCCTCGCAA CCCTGGTGCT CCGGGCCGCC AAGGCGCTCA AGAGGATGGA GGGGTGA
 
Protein sequence
MRGVYSSIGE LLEVKTAALL YEPPWKAFAQ LKKAPLLPWA KRGAGHWEAD ALAVAERLGL 
AEVLERRLGE VKELARLALG AEPLVLDGAG VQPPERLVLL NVFDPDVCLD PGNGCDLRAY
ARTVESAAPA FAESLANALS GLGDAALRYH ALWFLLEPLW FRACGAPVVS PADPRFPVYT
VFDHVYAAAT LANAFRGGSF RGYVVVVDYA GVQEYISKAR KALDLWASSW VASLVTWATV
QPFVELLGPD VVVTPSLRGN WFYAAWLLGR LRGTGAYGAA REAARLAYGY SGFPRHPIMP
TRAVLFLPEV PGALEDALRD EASLRGFIER KASEAWSKAV GAVLSEGRLG ALLGETLRAN
GVEADASELE EYVESVASTL PLPQRLIVFR HGESYARYRE WLGRVFGGEP AAEAGGARVS
LDERALYFHY LMDEYIPSEE ALWKLSRVDP DVEAQSSRWC GAAAEVFGRL GFCSVCGERP
AVVGAPSSRY GGLGPESRRV VTEGERLCPR CLAKRLVALN PFAALRAVGV PAARTFWSVP
TTNELANAEV AEEHLDRVLD VVERHREAVA KVLAGFQWDH EYYSRRLARR AYRRAGREVA
LVALRLLAAL IEAATEQEYR ERFAKALDAL GDGEARREIE ELFRSIAGKR RTRLAVVKGD
GDYAGSRLLR ARLRLGAGEY AGRVAAQAGL GGGAAELLAS VAGRLGSTVT LSPLYTASAS
RSLSHGAVLD ARAVEGLGGF VVYAGGDDLL ALTPATAGGR YVALEAALRT RRGYWGERGR
GPRGFHASPG IPLVSPAPRS YGRSYAVLSI HYRDPLSAAV ARSSEMLEEA KACTAVYGPA
TVERDAVRLE DLRSGSSAML PFRAGPGAGL AGSPVWKAAL LASMELRGEV SSSLFYDFFS
WRFDELVERL AAESRTREAW LALRYLVSRN TRRNPDEVAG RVLGEELVAA RLRSADGREE
SLATLVLRAA KALKRMEG