Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1866 |
Symbol | |
ID | 4600331 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008696 |
Strand | + |
Start bp | 15432 |
End bp | 17306 |
Gene Length | 1875 bp |
Protein Length | 624 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 639772464 |
Product | hypothetical protein |
Protein accession | YP_919124 |
Protein GI | 119709784 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCATGA AAAGGTACAT ACCCCTGTTA CTGCTACTGG TAACGGTAGC CTTGGGCATC GCCACCGTAA AGGCGGCTCC CGTCACGACT TACACGGTCT TCGTCAACTA CACCATGTAC ACGGGCACCG CTCCAGCCGC GGGTAGCACC TACTGGTTCA ACTTCACAAT CACTCCAAGC CTCGACGTCG TCATCGTGAC GGTAGAGGAC TTGGGAGGCG CTAGCCAGAG GACGGTCTAC GTGGACAACG TGGCTGTAAC CCTGCCCTAC AAGCTGAAGG CTGGAGAGAC CCACAGGGTA GCCGTCAAGG TCTACTTCCC GAGCGCGCTG ACTGTAGCGT GGTTCGGCAA GACGATACTC GCCTACAACA GGACTGAGGA CGTAAACCTG CAGGTAAAGT ACGTGGGCTA CGGCTTCGAC TCCGTGTCCT ACGGGAGCTA CGTGGGTAGC GTGGACAACG TGGCAACGCT GACCGTCAAG CTGGACACGC CTTACACCAT GGGGGTCACA CACACGTTCA GCTCTGTGTC CACGGTCGTC TGGCTCAGGT TCCACCCGGC GATGCCGATT AAGGGCTACA CGGGCACCGC CTACTACACC ACGACCTTCA ACGCCTACAC GATTGGAAGC CAGCCCGCGG GCACCGATAG CTACACCATC TCCAACAACC CGGTAACCGT CACGGTTGAC TGTCCACCGC CAGTAGGGCT CTACGAGCTT GTCTGGTACG TGAGCTGGGG AGGCTTGACG GAGAAGATGC TCGGGGCGCC GGGCACCCCC ACCCCCAACA AGAACTACGC TAGGCTGGTC GGCGCCACGT TTACTTGGAA GTTCACCCCC AACGCCACGT ACTTCCCCAC GACCAAGGTT GACGAGTACC TCTTGGTGAA CGGTAGCAAG GCTTCGAGCC TCGCGCTGAG CAAGAAGGGG CTCTACAACG CCACCTTCGT CAACATCGGA AAGATGAACA CCACGTTCTA CGGAGTGCTG GCATACCTGC CCCCCGTGGT CAGGGCGGGT ACCATGTACG TCTACGCCGC GGAGCTTAAG GTCAACGTGC TCTCTCAGGT AGGCACGGTG GGAGCCCCGA GCCCCGGCTT CACTCTCAGC TTAGCCGGCG CGAACGCTCC CGGCTACTTC GCGCTTCAGG TCAAGGTGCC CGAGGGAGTC GTCACGGTCG AAAAGGCTGA GGGAGTGGCG TGGAACAGGA CTGTGCTCCC CGGCATAGCC AAGGACGTCA AGACGAGCCT CGACGCCGAT AAGCTGGTCT ACACCGTGAA CTACACGTAC ACGCTGTACT ACGCGCCTAT AGTGGGAGGC GTCAACTTCC TAGGCGCGTG GGGCATAGAG GAGACTCCGA GCCTTACGCC GACGAGCCCG ACTATAGCCA CTCTGACTGC CAGCAAGCCA GTGGTGCTCA ACGCTACTAG AAACGTGATA AGGGTCTTGG ACTCCGCGGG CAACGACGTC AGCTTCGTCG GGAAAGCCAT AGCGATAAAG AGTAGCGGCA CCTACACCGT CAAGCTTGAG ACCGTGATAA AGGTCGTCAA CCTCTACCAA GGCAAGAAGA TACCCGCCAC GGTCAGGCTC TACGACGCTA AGGGCTTGCG CCTAGCGGAG AAGACCGGGG AGGAGGTAAC GTTCACGGTT GAGCCCGGAC TGCTCTACAC GGTCGAATCG GACAACGGTA ACGAGGTGCT GACTCAGCGC GTCACGCCTA CACAGGACGT CGATGTAACA ATGGAGTTCA CGAAGCCTCC CGCCGTGGTA ATTCCGTGGG AGTGGGTGTG GCTAGCGCTG GCCATAGTGT TCCTAGTGGT TCTGATATAC TTCGCTAAGA GGCTCAAGGA GGGTCTAGAG ATAGTAGTGG GGTAA
|
Protein sequence | MGMKRYIPLL LLLVTVALGI ATVKAAPVTT YTVFVNYTMY TGTAPAAGST YWFNFTITPS LDVVIVTVED LGGASQRTVY VDNVAVTLPY KLKAGETHRV AVKVYFPSAL TVAWFGKTIL AYNRTEDVNL QVKYVGYGFD SVSYGSYVGS VDNVATLTVK LDTPYTMGVT HTFSSVSTVV WLRFHPAMPI KGYTGTAYYT TTFNAYTIGS QPAGTDSYTI SNNPVTVTVD CPPPVGLYEL VWYVSWGGLT EKMLGAPGTP TPNKNYARLV GATFTWKFTP NATYFPTTKV DEYLLVNGSK ASSLALSKKG LYNATFVNIG KMNTTFYGVL AYLPPVVRAG TMYVYAAELK VNVLSQVGTV GAPSPGFTLS LAGANAPGYF ALQVKVPEGV VTVEKAEGVA WNRTVLPGIA KDVKTSLDAD KLVYTVNYTY TLYYAPIVGG VNFLGAWGIE ETPSLTPTSP TIATLTASKP VVLNATRNVI RVLDSAGNDV SFVGKAIAIK SSGTYTVKLE TVIKVVNLYQ GKKIPATVRL YDAKGLRLAE KTGEEVTFTV EPGLLYTVES DNGNEVLTQR VTPTQDVDVT MEFTKPPAVV IPWEWVWLAL AIVFLVVLIY FAKRLKEGLE IVVG
|
| |