Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0347 |
Symbol | |
ID | 4601457 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 318487 |
End bp | 320415 |
Gene Length | 1929 bp |
Protein Length | 642 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 639773107 |
Product | hypothetical protein |
Protein accession | YP_919759 |
Protein GI | 119719264 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.379451 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAGGTA TGAAGAAAAT ACTCGGCGTA GTGGTGTTAG CACTTGTCTT GCCAGTAATA CTTGGGGCTG TCCTCGGAGA ACCATCCTCC ACGCCCCAGT TTTACGCACG TATAGACATC ACGCTTAGTA CCGAGGAATT CTTGACGCTC GACTTTGGGA GTAAGAGTAT CATCAGCATA GCGGGGGCAG CAGAGCCTCC GGGCTTCAGT GTAGACAGGG TAGTCGTGGT CTTCCAGGGC GGAGCGCCGA GCGGGCTTAT TCCCGTCAAG TACGACAGTA TAAGCGAGTC AATGGGGAGA ATAACCTCTA TCTCTTCCCG TAGCGGAGAG GTAGTAGTAG CCTCCAATGG GTTTAACGGG ACAGTCCCCG TGAGGGTCAT AACGTACTTT AGGAAAACTT CCTGGAAACC GATTAAGGGC AACAACATAA CCGTGGACAC GTCAGAGTTT ACGGGCCTCA ATCTACCCAG CGTGCGCCTC AAGGTAACAC TGGACAACTA CGCCCCCTAC AGCGTAGCGG GAGTACTAGG ACCTTCCGGG GAAAACCTGT TAGACGTAGA CTTGCAGGAG AAGCTGGGAC CCAGCGTTAT AAAGTTCGAC CCGAAGCATG TAGAGGTAGA CGTATCGAAG GTAGGCTTCG GCGTTTACAC CGTTAAGCTT GCCCAAGGAG AGGAGAATAA GCTTCCCAAC GCGATGCTCG TAGTGGAGGA CACCTACATC GAAACGAGCG TTCCAGCTAA GTCCTCCAAA GTGTTTAACC TTAGAGGGCG TACAGGCTGG AACCCGCTAG GCTTCATAGT CGTAGTGTAC TCCGTAGCCC CCGGACCCCT CTCTGCAAAC GTTCGAGTAG AGTCTGAAAT GACGAACTAT GTGTTTAGCA GGGCGGAAGA ATTCGATATA AGAGGGGCGT CCCTGCTCAT ACCTCCTCTG CTTATGCACT ACTGGATAAA GGGATACATA GCGTTCGGCC AGGCGGTCAA GGTGGTTAAC AACGAGAACC GCGACATCCA GGTGCTGCTC GTACCAGTGT ACTACAAGGA GGTAGGAACG TGGACGCCTA GAGGATTAAT AGCCACAATC TCGAAGGCCG ATATAGGCAA TGCGTACTCC GCGTTCCTGG TCGTGCAGGT ACCGTCTATA GCCAGGATAA CCTCCATCGA AACTCCAAGC GGGCAGGTAT TGCAAGGCAA GGAGAACTAC ACTGGGGCGT GGCTCGGCAC CTGGAGAACC GCCGTGATCG AGCCCGGCGA AGCCGCCGTC ATGGTCAAGA ACGGCGACGC CGTTGAAGAC GGGACATACA AGGTCAACAT TGAGTGGAGG CCGCTCAGAG TGAAGTTCGT GGACTCCAAG GGGTACCCAA TCTCCGGAGT AGAAGCCACG CTTAAAGGCT CCGTAAGCGC CTCTGCAGTC AGCGGAGCGG ACGGTGTCGC CACCCTGAAT GTTTATGCGC CGGGAGTCTA CACTCTGACC GGCGTATACA AGGGGTCAAA TATAGCGTCC ATGGTACTCG GTACGCTCAT AGACACTGAC TTGGAGATAA AATGCCCCGT GTACAACCTG AACGTCAAGG TCGTGAATGC TCTCGGAGCG CCGATCACTG GCGCAACGGT AACGGTTTCG AACAACGGTG GCTTTACGCA GTCCATGGAG ACGGACGCGA GCGGGAAGGC ACTCTTCCAG CAACTCCCCG GGGCACAGTA TACAATCGAA GTGAACTACA AGAGGATATC CACTAAGTCT ACGCTCACCC TGACCCAAGA CCAAGATATA ACGATAAACA CGGGAGTACT CTTCGAAATC CCGTTACTTG GCCCCATAAC AGTGATGGAG ACGCTGACTC TCGGAGCCGC AGCACTGCTA ACCTCCGCCC TATACTTCGG TGTCAGGAAG GGCAGAGAGG AGGAAGTGGC AGAGATAGAG ATAGACTAA
|
Protein sequence | MRGMKKILGV VVLALVLPVI LGAVLGEPSS TPQFYARIDI TLSTEEFLTL DFGSKSIISI AGAAEPPGFS VDRVVVVFQG GAPSGLIPVK YDSISESMGR ITSISSRSGE VVVASNGFNG TVPVRVITYF RKTSWKPIKG NNITVDTSEF TGLNLPSVRL KVTLDNYAPY SVAGVLGPSG ENLLDVDLQE KLGPSVIKFD PKHVEVDVSK VGFGVYTVKL AQGEENKLPN AMLVVEDTYI ETSVPAKSSK VFNLRGRTGW NPLGFIVVVY SVAPGPLSAN VRVESEMTNY VFSRAEEFDI RGASLLIPPL LMHYWIKGYI AFGQAVKVVN NENRDIQVLL VPVYYKEVGT WTPRGLIATI SKADIGNAYS AFLVVQVPSI ARITSIETPS GQVLQGKENY TGAWLGTWRT AVIEPGEAAV MVKNGDAVED GTYKVNIEWR PLRVKFVDSK GYPISGVEAT LKGSVSASAV SGADGVATLN VYAPGVYTLT GVYKGSNIAS MVLGTLIDTD LEIKCPVYNL NVKVVNALGA PITGATVTVS NNGGFTQSME TDASGKALFQ QLPGAQYTIE VNYKRISTKS TLTLTQDQDI TINTGVLFEI PLLGPITVME TLTLGAAALL TSALYFGVRK GREEEVAEIE ID
|
| |