Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1729 |
Symbol | |
ID | 4601754 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1672406 |
End bp | 1674325 |
Gene Length | 1920 bp |
Protein Length | 639 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639774502 |
Product | hypothetical protein |
Protein accession | YP_921127 |
Protein GI | 119720632 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.159419 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAGAAAG CGGCTCTCGC GGTCCTCGTA CTGCTCGTAG CGGCTACGCT CGTACCCCCG CAGCGCCAGG CGAGCGTCGA GGCGAAGCCG CCCGTAGACT GGCTGGAGCT AGCGAGGGCG GCGTGGGGCT ACTTCTCCCC CGGCTTCGGG CTGAGCCAGA GGGGGATCAA CTACGCGACG CCCTCCTGGC ACTACGTGAC GGACTGGGAC GTCGGCAGCT ACCTCTCGGC GATAGTCGAC GCGGCGTGGC TCGGGCTGAT ATCCAGGGAT GAGGCTATCA GCAGGGCCGA AAAAGTGCTC GCCTTCCTCT CCACGAGGCC GCTACACCCC TCCGGCGTGC CGTACTCGGC GTACAGCTCG GACACGGGCA TGCCGGCGGA GAATGCGGGG CCCTCGAACC CCAGCGACGC CGGGAGGTTG CTGATAGCCC TCTACAGGCT GAAGAAGAGC TTCCCGGAGC TCGCGCCGAC CGTGGACTAC GTCGTCGAGA GGAACGGCTT CTCGGCCTTC GCGGGCTCGG TGCCGGACAG CGGCTTCTAC TCGTACTACT ACGCCTACGG CTTCCACCTC TGGGGCTTCA ACACCCCCCA GGTCATGAAG GCCCTCTCGA TGCTCGGGAG GCTACCCTAC ACGAGGACTG TCGACGCCTA CGGGGTCCGG CTACCCTACG TGGAGGTAAC GATGGAGCCG ATACTCCTCA CGATCTTCGA GCTAGACCCC CCGCCCGAGT TCTACGAGTG GGCGTACAAG GTCTACAAGG CGCAGGAAAA CCGCTACCTC GCCACGGGTA AGCCCACCGC GTTCACGGAG GGTCAGGTCA ACGCGCCCCC GTACTACATC TACGAGTGGA TAGTCGACAT ATACACGGGC GAGACGTGGA CCGTGTGGAG CGGCTCCCTC GGAAAGCTCA GCATGACGCC CGTAGTATAC GCCAAGGCGG CTCTAGGCAT GCACGCCATC TGGAACACCA ACTACACAGC CTTCCTAGCG GAGTACGTGA TGAAGGCTAA AACGCCCAAC TGCTTCTACG AGGGAGTCGA CGAAAACGGC AACGTCGTCT ACGCGATAAC CGACAAGACG AACGCCATGA TAGTGAGCGC CGCGAGGTAC GCACTGCAGA GAGCAAGCAA GCCCTCGGTA ACGGCGGGAG CGGTCCCAGC CCTCTACCCA GGCGAAAACG CGACGATAAC GCTCAACGTA ACCCACCAGC TACCCCTACC CATCACCCTA AGCGCGGAAG CCCCACCCGG GATAACAGCT GAGGTCGAGC CCAGCACCGG GAAAGCCAAC CTAACGGCGA GGCTAAAGGT ATCGGCGCGG CAAGGGCTAG CCCCCGGCAA CTACACCGTC ACAGTAAAAG TCTCGACGAT AGCGCACAAC GAGACGCTAA CCCTCACCGT GACAGTCAAG CCCCCCGGCT ACACCCTCAG AGTAAGAGTA GTAGACGCCT GCGGAGACCC AGTACCCGGC GCAACGCTAC TACTAAACGG GCTCAAAGCA GGAGAAACCG ACGCCAAGGG AGAAGCCGAG GTAAAACACG TGGAAGGAGA AGCAACGCTC ACCGCGATCT ACGCAGGGCT AGAAGTAGCC GGGCCACTAA AAATCAGCGT AAACTCCGAC ACCAACGCGA CGCTCAAGGC AAACCTCCGA AAAATAGCAG TCGCCTTCAC AACCCCCGAC GGAAAACCCG CAACCGGGAT ACTCGTAGTA GCACTGCTAG GGCGAACAAC CCTCTCAACC GCAAAGACGA ACTCCACGGG GCACGCACTA CTACCGAGAA TACCCCCCGC AAACATAACC CTACAAGCCT ACACACCCGA CGGAAAGCTA CTACTAGGAG AATGGACCGT GAACGCCGCG CAAGGCGAAG GAGTAGTAGA CCCAGAAATC CCCCCAACCA CCAGACACCT AGAGGCATAG
|
Protein sequence | MKKAALAVLV LLVAATLVPP QRQASVEAKP PVDWLELARA AWGYFSPGFG LSQRGINYAT PSWHYVTDWD VGSYLSAIVD AAWLGLISRD EAISRAEKVL AFLSTRPLHP SGVPYSAYSS DTGMPAENAG PSNPSDAGRL LIALYRLKKS FPELAPTVDY VVERNGFSAF AGSVPDSGFY SYYYAYGFHL WGFNTPQVMK ALSMLGRLPY TRTVDAYGVR LPYVEVTMEP ILLTIFELDP PPEFYEWAYK VYKAQENRYL ATGKPTAFTE GQVNAPPYYI YEWIVDIYTG ETWTVWSGSL GKLSMTPVVY AKAALGMHAI WNTNYTAFLA EYVMKAKTPN CFYEGVDENG NVVYAITDKT NAMIVSAARY ALQRASKPSV TAGAVPALYP GENATITLNV THQLPLPITL SAEAPPGITA EVEPSTGKAN LTARLKVSAR QGLAPGNYTV TVKVSTIAHN ETLTLTVTVK PPGYTLRVRV VDACGDPVPG ATLLLNGLKA GETDAKGEAE VKHVEGEATL TAIYAGLEVA GPLKISVNSD TNATLKANLR KIAVAFTTPD GKPATGILVV ALLGRTTLST AKTNSTGHAL LPRIPPANIT LQAYTPDGKL LLGEWTVNAA QGEGVVDPEI PPTTRHLEA
|
| |