Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1840 |
Symbol | |
ID | 4600341 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008696 |
Strand | - |
Start bp | 47 |
End bp | 1714 |
Gene Length | 1668 bp |
Protein Length | 555 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 639772438 |
Product | hypothetical protein |
Protein accession | YP_919098 |
Protein GI | 119709758 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCTGAGG AGAGGAAGAA GCCTGTAGAG GTCAAGGAGT ACGAGGCTTC CGAGTACCTC AAACAGCTGG GCTACTCCGA GGAGGAGCTT AGGCGGATGG GCGTTCTGCC CGCGGAGGAG GTGCCGGAGC CCGTCGAGGT CGAAATCGAG GAGATACCCG AGGCGGAAAT CGAGGAGGCG CCCGAGGTGG AGGTGGAGAT AGAGGAGTAC CCGGGCGAGG AGGAGGTTGC AAAGGCTAAG GAGGAGCACG AGAGACAGGT CGCCGAGGTC ATGGACAAGA TTAGGGACGC CGTCCGCGGC ATGATGAAGA CGCGCGCTTG GGTCATGGTT CACGGTAAGC CCTTGGAGAC ACTGCCCCCG CTTAAAAAGG AGGACATAGC CGCGGTGGCG AAGACTCCGG AGGGCGACGT CTCGATAATC ACGAAGGAGG GTAAGGAGTA CGTAGTCGTA GCCGGCGAAA AGGTATACGA GGTGGAGCTA CCCCAAGAGG ACAAGCTCGC GGTTCAGAGG CTAACGGCTG AGCTCAGCAA GAGGTACGGA GCCGCCGCCG TCACGGAGGC TGTGGACAGG ATGCTGAAAG AGTACGCGCC GAGGCTGTAC CCCGCGCTCA AGGAGAAGGC TCCCGCCGCG GTCGCCGCGG TCGGCAAGGC TAAGGAGGCT GTTAGCGGTG CCACGAAGGG CTGGAAGCTC GCGCTCTACC GGGCGTTGTA CCCGGGCGTG CTGAGCACGT ACGACTACTT CTCGCGGATG ACGGAGGCAA CGCTGAACAT GGTGGCGGCT CTGGGCTTGA CGGGGCTCCT AGCCTACCTG ATTACGTGGG CTAGGGGCTT CGCCGCGTCG GGGGCGGGCG GGCTTTGGGG GTTGATGGAG GAGAAGGACG TGGTCGTCGC GCTGTTCAAC TTCCTGCTGG ACTTGGCTAG CTACACGCTG ATACCCGCCT TGCTAGCCGT CTTCGTATTC CTCTTGGTGT ACTTGGGCGG GGCAATCACG GGGGCGGTTC GCCTGATAAG GGTGGGGGGA GAGTCTAGGA GCGTCGTTGT ACCCCTGTTC AGCTTCGTCT CCGCCGTCCT AAACGCGATG TTCAGGCAGA TGCTCGTAGG CTACTTGGTG GGGGCTGTGG CGGAGTACGT GGCTATAGTA ATCGTGGCGC TCTTGTCTAG CGTGGTTCTG CTGTGGCTCG GACTGCTAGC CGTCTCCGCT TTGACGGGCT TGTTCTTGGG CGGCATACAG GGAGCCTTGG TCTTCGCGCT CCTAGTATTC CTCCCCGTGG CTAGCGTGCT CGCGGGGGCG CTGACGCTCG TACTGCTGGG CAGGAGTAGC AGGGCTATGC TGAGGCTCAC CGATTTGCCC ACGCTGTTGC TAGCCACGGT GGTAGCAGTT AGCTACAAGT TCGCCTTCAT ACAGCCCCTG CTCTGGCTGG TCTTGGCCGC CGTGGTCGTC ATCCTAGGCA TAATAGCGGC GTCTAGACCG GTCGAGAGGC TCATGGTCTT GCTGAGGGGA GTCGCGGTGA TAGCCGGGGC GATATTCGCG GTTCACTTGG GCTACTCGGG GATAGAGGAC TTCGTGGTGC CCGCCTTCCG CCTGTACTGC CAGATTCTAG GGCTCCCCGA GGGGGTAGTC AACACCGCGG TGGACTGGGC TAGGAAGATT CTCTACGTAA TACCGTAA
|
Protein sequence | MAEERKKPVE VKEYEASEYL KQLGYSEEEL RRMGVLPAEE VPEPVEVEIE EIPEAEIEEA PEVEVEIEEY PGEEEVAKAK EEHERQVAEV MDKIRDAVRG MMKTRAWVMV HGKPLETLPP LKKEDIAAVA KTPEGDVSII TKEGKEYVVV AGEKVYEVEL PQEDKLAVQR LTAELSKRYG AAAVTEAVDR MLKEYAPRLY PALKEKAPAA VAAVGKAKEA VSGATKGWKL ALYRALYPGV LSTYDYFSRM TEATLNMVAA LGLTGLLAYL ITWARGFAAS GAGGLWGLME EKDVVVALFN FLLDLASYTL IPALLAVFVF LLVYLGGAIT GAVRLIRVGG ESRSVVVPLF SFVSAVLNAM FRQMLVGYLV GAVAEYVAIV IVALLSSVVL LWLGLLAVSA LTGLFLGGIQ GALVFALLVF LPVASVLAGA LTLVLLGRSS RAMLRLTDLP TLLLATVVAV SYKFAFIQPL LWLVLAAVVV ILGIIAASRP VERLMVLLRG VAVIAGAIFA VHLGYSGIED FVVPAFRLYC QILGLPEGVV NTAVDWARKI LYVIP
|
| |