Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1316 |
Symbol | |
ID | 4601996 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1261982 |
End bp | 1264918 |
Gene Length | 2937 bp |
Protein Length | 978 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 639774091 |
Product | hypothetical protein |
Protein accession | YP_920716 |
Protein GI | 119720221 |
COG category | [R] General function prediction only |
COG ID | [COG1353] Predicted hydrolase of the HD superfamily (permuted catalytic motifs) |
TIGRFAM ID | [TIGR02577] CRISPR-associated protein, Crm2 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.800539 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGGGGCG TGTACTCGAG CATCGGGGAG CTACTGGAGG TGAAGACCGC CGCCCTGCTG TACGAGCCGC CGTGGAAGGC CTTCGCGCAG CTGAAGAAGG CGCCGCTACT GCCCTGGGCG AAGCGGGGGG CGGGGCACTG GGAGGCGGAC GCCCTCGCGG TGGCGGAGAG GCTGGGGCTC GCGGAGGTCC TCGAGAGGAG GCTCGGGGAG GTGAAGGAGC TCGCCCGCCT GGCGCTCGGC GCCGAGCCGC TCGTGCTCGA CGGGGCAGGC GTGCAGCCGC CGGAGAGGCT CGTGCTCCTG AACGTGTTCG ACCCGGACGT GTGCCTCGAC CCGGGCAACG GCTGCGACCT CAGGGCGTAC GCGCGGACAG TGGAGTCCGC CGCCCCGGCC TTCGCGGAGA GCCTCGCGAA CGCTCTCTCA GGCCTGGGGG ACGCGGCCCT GAGGTACCAC GCGCTCTGGT TCCTCCTGGA GCCGCTCTGG TTCCGCGCGT GCGGCGCGCC CGTCGTATCG CCGGCCGACC CGAGGTTCCC GGTGTACACG GTCTTCGACC ACGTCTACGC GGCGGCCACG CTCGCGAACG CCTTCAGGGG CGGCTCCTTC AGGGGCTACG TGGTCGTCGT GGACTACGCG GGGGTCCAGG AGTACATCTC CAAGGCCAGG AAGGCCCTCG ACCTCTGGGC CTCCTCGTGG GTAGCCTCGC TCGTCACGTG GGCCACCGTC CAGCCGTTCG TCGAGCTCCT GGGGCCAGAC GTCGTCGTGA CGCCCTCGCT GAGGGGGAAC TGGTTCTACG CGGCGTGGCT CCTCGGAAGG CTCAGGGGCA CCGGGGCCTA CGGGGCGGCC AGGGAGGCGG CGAGGCTCGC GTACGGCTAC TCGGGCTTCC CGAGGCACCC GATAATGCCC ACGCGCGCTG TGCTCTTCCT CCCGGAGGTT CCCGGGGCCC TCGAGGACGC CCTGCGCGAC GAGGCCTCGC TTAGGGGCTT CATAGAGAGG AAGGCCTCCG AGGCCTGGTC CAAGGCGGTC GGAGCCGTCC TCTCCGAGGG GAGGCTGGGC GCGCTGCTCG GCGAGACCCT CAGGGCGAAC GGGGTCGAGG CAGACGCCTC GGAGCTGGAG GAGTACGTCG AGAGCGTCGC CTCGACGCTA CCGCTCCCGC AGAGGCTCAT AGTGTTCAGG CACGGGGAGT CCTACGCTAG GTACAGGGAG TGGCTGGGCA GGGTGTTCGG CGGGGAGCCC GCGGCGGAGG CGGGCGGCGC CAGGGTATCG CTCGACGAGC GGGCGCTCTA CTTCCACTAC CTCATGGACG AGTACATCCC GTCGGAGGAG GCCCTGTGGA AGCTGAGCAG GGTGGACCCG GACGTCGAGG CCCAGTCGAG CCGATGGTGC GGCGCGGCGG CCGAGGTCTT CGGCAGGCTC GGCTTCTGCA GCGTGTGCGG CGAGAGGCCG GCCGTGGTCG GGGCGCCGTC CAGCAGGTAC GGGGGGCTCG GCCCCGAGTC TAGGAGGGTC GTGACGGAGG GCGAGAGGCT CTGCCCGAGG TGCCTCGCGA AGAGGCTGGT AGCCCTCAAC CCCTTCGCCG CGCTGAGGGC CGTCGGGGTC CCGGCGGCGA GGACGTTCTG GTCCGTCCCC ACGACGAACG AGCTCGCGAA CGCGGAGGTA GCGGAGGAGC ACCTGGACAG GGTCCTCGAC GTGGTCGAGA GGCACCGGGA GGCCGTGGCG AAGGTCCTCG CGGGCTTCCA GTGGGACCAC GAGTACTACT CGCGGAGGCT CGCGCGGAGG GCCTACAGGA GGGCCGGCAG GGAGGTCGCG CTCGTCGCGC TCAGGCTGCT CGCAGCCCTC ATCGAGGCGG CGACGGAGCA GGAGTACAGG GAGAGGTTCG CGAAAGCCCT GGACGCGCTC GGGGATGGGG AGGCGCGCAG GGAGATCGAG GAGCTGTTCA GGTCCATAGC CGGGAAGAGG AGGACGCGGC TCGCCGTGGT GAAGGGGGAC GGGGACTACG CGGGGTCCAG GCTGCTCCGG GCGAGGCTGC GGCTCGGCGC GGGGGAGTAC GCGGGCAGGG TCGCGGCGCA GGCCGGGCTC GGGGGCGGGG CGGCGGAGCT ACTCGCGAGC GTAGCCGGGA GGCTCGGCTC GACCGTCACG CTCTCACCCC TCTACACCGC CTCGGCGTCC AGGTCGCTCA GCCACGGCGC CGTCCTAGAC GCACGCGCGG TCGAGGGGCT GGGGGGCTTC GTGGTCTACG CGGGCGGGGA CGACCTGCTC GCGCTGACGC CTGCGACGGC CGGCGGGAGG TACGTGGCGC TGGAAGCCGC CCTCAGGACC CGCAGGGGCT ACTGGGGGGA GCGCGGGAGG GGGCCGAGGG GCTTCCACGC GTCGCCCGGA ATCCCGCTCG TCTCGCCCGC CCCCAGGAGC TACGGAAGGT CGTACGCGGT GCTCTCGATC CACTACAGGG ACCCGCTCAG CGCGGCCGTG GCGAGGTCGT CCGAGATGCT CGAAGAGGCG AAGGCTTGCA CCGCGGTGTA CGGGCCCGCA ACCGTCGAGA GGGACGCCGT CCGGCTAGAG GACCTCAGGT CCGGCTCCTC CGCGATGCTA CCCTTCAGGG CGGGGCCCGG CGCCGGGCTG GCCGGGAGCC CCGTTTGGAA GGCCGCGCTA CTGGCCTCCA TGGAGCTACG CGGGGAGGTC TCGTCGAGCC TCTTCTACGA CTTCTTCTCG TGGCGCTTCG ACGAGCTCGT GGAGAGGCTC GCCGCCGAGT CGAGGACGCG CGAGGCGTGG CTCGCCCTGC GCTACCTGGT CTCCAGGAAC ACCAGGAGGA ACCCCGACGA GGTGGCGGGG AGGGTGCTGG GCGAGGAGCT TGTAGCGGCT AGGCTCAGGT CGGCGGACGG GAGGGAGGAG AGCCTCGCAA CCCTGGTGCT CCGGGCCGCC AAGGCGCTCA AGAGGATGGA GGGGTGA
|
Protein sequence | MRGVYSSIGE LLEVKTAALL YEPPWKAFAQ LKKAPLLPWA KRGAGHWEAD ALAVAERLGL AEVLERRLGE VKELARLALG AEPLVLDGAG VQPPERLVLL NVFDPDVCLD PGNGCDLRAY ARTVESAAPA FAESLANALS GLGDAALRYH ALWFLLEPLW FRACGAPVVS PADPRFPVYT VFDHVYAAAT LANAFRGGSF RGYVVVVDYA GVQEYISKAR KALDLWASSW VASLVTWATV QPFVELLGPD VVVTPSLRGN WFYAAWLLGR LRGTGAYGAA REAARLAYGY SGFPRHPIMP TRAVLFLPEV PGALEDALRD EASLRGFIER KASEAWSKAV GAVLSEGRLG ALLGETLRAN GVEADASELE EYVESVASTL PLPQRLIVFR HGESYARYRE WLGRVFGGEP AAEAGGARVS LDERALYFHY LMDEYIPSEE ALWKLSRVDP DVEAQSSRWC GAAAEVFGRL GFCSVCGERP AVVGAPSSRY GGLGPESRRV VTEGERLCPR CLAKRLVALN PFAALRAVGV PAARTFWSVP TTNELANAEV AEEHLDRVLD VVERHREAVA KVLAGFQWDH EYYSRRLARR AYRRAGREVA LVALRLLAAL IEAATEQEYR ERFAKALDAL GDGEARREIE ELFRSIAGKR RTRLAVVKGD GDYAGSRLLR ARLRLGAGEY AGRVAAQAGL GGGAAELLAS VAGRLGSTVT LSPLYTASAS RSLSHGAVLD ARAVEGLGGF VVYAGGDDLL ALTPATAGGR YVALEAALRT RRGYWGERGR GPRGFHASPG IPLVSPAPRS YGRSYAVLSI HYRDPLSAAV ARSSEMLEEA KACTAVYGPA TVERDAVRLE DLRSGSSAML PFRAGPGAGL AGSPVWKAAL LASMELRGEV SSSLFYDFFS WRFDELVERL AAESRTREAW LALRYLVSRN TRRNPDEVAG RVLGEELVAA RLRSADGREE SLATLVLRAA KALKRMEG
|
| |