Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0149 |
Symbol | |
ID | 4600641 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 124705 |
End bp | 127392 |
Gene Length | 2688 bp |
Protein Length | 895 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 639772903 |
Product | solute binding protein-like protein |
Protein accession | YP_919562 |
Protein GI | 119719067 |
COG category | [R] General function prediction only |
COG ID | [COG3889] Predicted solute binding protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.450727 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACACCA AACTTTCATT ACTACTCGTA GCACTGCTCG TAGCCGCATT CCTGGCTCCG CTTCTAAGCG CACAACCACA GCCACTCCAG GGTCCATGGG TAGACGAGGT ATCCTTCCTC AAGGAAGGCG ACCCAGCGAA AGTAATGGAC ATGATTGCCA AGGGAGACGT GCAGGTTTAC TTCAGCGATG TCCGCGTCGA CGCGGCGATA GCTCAGAGGC TAAAAACAGA CCCCAACATA AACTACAAGT ACGCCTTCGG GCTCTACTTC GAGCTAACGT TCAACCCCGT GGGACCGGAG TTCCCGAAGA CCGGGAAGCT TAACCCCTTC AGCGTTCCGA GGATCAGGGA GGCTATGAAC TACATTGTCG ACAGGAACTA CATCGTGAAC GAGATAATGC TGGGCTTCGC TGTACCCAAG TGGCTACCCC TGATCTCCCA GTTCCCCGAG TACGCTAAGC TGGCGGACAC CGTTAAGTTG CTCGAGGCAC AGTACAGCTA CAACTTCGAA AAGGGCAAGG AGATAATATT CGAGGAGATG GCGAAGCTGG GCGCAACGTA CAAGGACGGG AAGTGGTACT ACAAGGGAGA ACCTGTTGTC ATAAAGTTCC TGATAAGAAC CGAGGACCAG AGGAGGCGGA TAGGCGACTA CGTTGCAGAC CAGCTGGAGA AGCTCGGCTT CACCGTTGAA CGAATGTACA AGACTGCGCG CGAAGCCAGC CCGATATGGA TACGCGGAAA CCCGGCAGAC GGGCAGTGGC ACATATACAC CGGAGGCTGG ATAACTACGG CTGTGAGCAG AGACGATAGT AGCGTCTGGG GATACTTCTA CACTCCGCTC GGAAGACCTG AGCCGCTCTG GCAGGCTTAC AAGCCGGACC CCGTCTTCAT GGACATAGCT ACTAGACTGT GGAATGGGCA GTTCAAGACC ATAGAGGAAA GACAGGAGCT TATGGCTAAG GCGGCTGTCC TTGCTCTCAA GGACTCCGTT AGAGTCTGGC TGGTAGACCA GATAGTACCC TACATCTACA GCAAGAAGGT GGACCTTGCG GCCGACCTTA GCGGAGGCTT CTCATCTCCG ATCTCTCTCA GAACCTTGCG CTACGTCGAC AAGGCGGGAG GTAGCGTGAA AGCTGCCATG AGAGAAGTAC TAGTAGAACC TTGGAACCCG GTTGCGGGGA CGAACTGGGT CTACGATGCA GTACTTATCG GGGCAACACA GGGTTATGCG TTCCTCATAA GCCCGTACAC AGGCCTACCG CTACCCGAGA GGGCTGTCAG CGCGGAGGTC TATGCGCTGA AGGGCACCCC CACTACTAGC TCGAGCGACT GGTGCAAGCT CACGTTTGTC GACAAGGTCG AAGTACCCGC GGACGCGTGG TACTCTTACG ACGTGAAGAA CAACAAGGTC GTGACGGCCG GAGAGGCCGG TGTAAAGAAC GCGCAGTTCA AGATAGTGGT TAGCTACGGC GACGTCATAG GCAAGATAAA GTATCACGAC GGGACAACGA TGAGCCTCGC GGACTGGGTT ATAGGCTGGC CTCTGACGTT TGCCAGGGTT GACCCGAGCA GCCCGCTTTA CGACGAGTCA GCGGTTCCGA GCTTCCAGGC CTGGAGATCC TACTTCATTG CTTGGCGCTT CGTGAGCACT TCCCCGCTCG TGATAGAGTA TTACGTGAAC TACACGAGCC CCGACGTCGA GCTGATAGTA TCCGGGTTCG CCGGGTGGCC TAACTTCCCG TGGCACGCCC TCGCCATAGG TATAAGGGCC GAGGAGAAAG GCTTGTTGGC GTTCAGCGCT GACAAGGCAG ACAAGATGAA GGTTGAGTGG ATGAACTACA TCGGCGGACC GAGCCTCGAC ATCCTCTCGA AGATGCTCGA CGAAGCCCTC GCGGAGGGCT ACATCCCGTT CAAGGACTTC ATGAGCAAGT ACGTGAGCGT CGACGAGGCT AAAGCTAGGT ACAACGCCCT TAAGAGCTGG TACGCGGCAC ACAAGCACTT CTGGGTGGGC GACGGTCCCT ACTACCTCGA CACCGCGGAC TATAATGCTC ACGTAGCTGT TCTCAAGGCT AACAGGAACT ATCCTGACAA GTCCGACAGG TGGGCCTGGC TCGCAACTCC GCCGATACCG GAGGTCGCCG TGAGACCGCC GGACAACGTC GTGCCAGGGC TCGAGGCCGT GTTCACGATA CAGGTTACCT ACAAGGGCCA GCCGTACCCC AACAAGCGCA TGGACTTCGT GAAGTTCCTG GTGCTGGACC CAGCCGGCAA CGTACTCGCG AAGGGCTCTG CAACGCCCAC CGCCGAGGGT GTCTGGCAGG CTAAGCTGAG CCCGGAGGAC ACGGGCAAGC TGACGCCCGG CCCCTACAGG ATAATGGTGA TAGCGCTGAG CAAGGACGTA GCGAGCCCGG CTATAAAGGA GTCTCCGTTC ACTGTCATCC CGCAGATTGC CTACTTCCAG ACCATGGTCG CCGGGATTAG AGGGCAGCTC GAGTCTAGGA TTGCTGGCGT CGAGACTGGT GTAAGCGAGG TTAGAGGCAA GATATCGGAC ATCGCTAACA GTGTTTCGTC GCTACAAGGC ACCGTCAATA CCGTGCTTGC CCTTTCAGTG CTAAGCCTGA TAATAGCTGT CGTGGCAGTC TTCCTCGCTT TAAGGAAACC AAAAGCAGAA ACCGCCCAAA AACAATAA
|
Protein sequence | MNTKLSLLLV ALLVAAFLAP LLSAQPQPLQ GPWVDEVSFL KEGDPAKVMD MIAKGDVQVY FSDVRVDAAI AQRLKTDPNI NYKYAFGLYF ELTFNPVGPE FPKTGKLNPF SVPRIREAMN YIVDRNYIVN EIMLGFAVPK WLPLISQFPE YAKLADTVKL LEAQYSYNFE KGKEIIFEEM AKLGATYKDG KWYYKGEPVV IKFLIRTEDQ RRRIGDYVAD QLEKLGFTVE RMYKTAREAS PIWIRGNPAD GQWHIYTGGW ITTAVSRDDS SVWGYFYTPL GRPEPLWQAY KPDPVFMDIA TRLWNGQFKT IEERQELMAK AAVLALKDSV RVWLVDQIVP YIYSKKVDLA ADLSGGFSSP ISLRTLRYVD KAGGSVKAAM REVLVEPWNP VAGTNWVYDA VLIGATQGYA FLISPYTGLP LPERAVSAEV YALKGTPTTS SSDWCKLTFV DKVEVPADAW YSYDVKNNKV VTAGEAGVKN AQFKIVVSYG DVIGKIKYHD GTTMSLADWV IGWPLTFARV DPSSPLYDES AVPSFQAWRS YFIAWRFVST SPLVIEYYVN YTSPDVELIV SGFAGWPNFP WHALAIGIRA EEKGLLAFSA DKADKMKVEW MNYIGGPSLD ILSKMLDEAL AEGYIPFKDF MSKYVSVDEA KARYNALKSW YAAHKHFWVG DGPYYLDTAD YNAHVAVLKA NRNYPDKSDR WAWLATPPIP EVAVRPPDNV VPGLEAVFTI QVTYKGQPYP NKRMDFVKFL VLDPAGNVLA KGSATPTAEG VWQAKLSPED TGKLTPGPYR IMVIALSKDV ASPAIKESPF TVIPQIAYFQ TMVAGIRGQL ESRIAGVETG VSEVRGKISD IANSVSSLQG TVNTVLALSV LSLIIAVVAV FLALRKPKAE TAQKQ
|
| |