Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0949 |
Symbol | |
ID | 4601289 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 900868 |
End bp | 902106 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 639773727 |
Product | SufS subfamily cysteine desulfurase |
Protein accession | YP_920352 |
Protein GI | 119719857 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.286124 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCTCGACC CCTACAGGGT TAGGAGGGAT TTCCCCATAC TCTCGAAGAA GGTTCACGGG AAGCAGCTTG TCTACTTCGA TAACGCGGCT ACGAGCCAGA GGCCCCTCCA GGTGGTGGAG GCTGTCGAGT CGTTTTACAA GTCGCTGAAC GCGAACATAG GTAGGAGTGT GCACGAGCTT GGGCTCAGGG CTACCGAGGC TTACGAGGAT TCGAGGAAGA TTGTCGCGGC GTTTATAGGG GCGAAGCCCG AGGAGCTCGT CTTCACGAAG AACACTACCG AGTCCATAAA CCTAGCCGCC TACTCGTTGC TGGTGTCGGG GCTTATATCG AGGGGCGACG CCATAGTCGT TACGAGGATG GAGCACCACA GCAACCTGTT GCCCTGGGTG CGGGTAGCGA AGCTCGCGGG GGCCGAGCTG AGGGTAGTCG ACGTGGACGA CGAGGGCAGG CTTAGGCTCG ACGAGTACGA GAAGGCTCTG TCGAGGGGCG CTAAGATAGT CGCGGTGACC CACGTGAGCA ATGTGACGGG CGTCGTGAAC CCAGTGAAGG AGCTGTGCAG GCTAGCGCAC GACGCTGGCG CCTTCTGCCT CGTAGACGGC GCTCAGAGCA CTCCGCATAT GCGGGTAGAC GTGAAGGATA TCGGGGCGGA CTTCTTCGCG TTCTCGGGGC ACAAGATGCT CGGACCCATG GGTACGGGCG GGCTCTACAT AAGGGGGGAT CTCGCCGAGA GGCTTGAACC CCCGTTCCCG GGCGGCGGCG CCATATCCCT GGTGAGCTGC GCGGAGGACT CGTGTAGCGC CGAGTGGCTC CACCCGCCCC ACAAGTTTGA GGCTGGAACT CCCAACGTTG CCGGCGCCGT GGGGCTCGCG AAGGCGGTGG AGTACTTGAG GAGCGTGGGC ATGGAGGACG TGGAGGAGCA CGAGAAGAGG CTTACCGCGA AGCTACTCGA GGTGCTGGAG GGTGTCGGAG CGAGGATTTA CGGCCCCAGG GACATGAAGG ATAGGCTCGG CGTGGTAAGC TTCAACCTAG ACGGGTTGAC GCCGCACGAG GTAGCCTCAA TGCTCGACTT GGAGGGCATC GCGGTGAGGA GCGGGCACCA CTGCGCCCTG CCGCTTGTGA AGAGGCTCGG GAGCCCGATG GGCACCGTGA GGGCTAGCTT ATACCTCTAC AACACGCCCG AGGAGGTGGA GTACTTCGGC GCAGTACTCG AGAAAATCAA AAAGCTAGCT ACGGGCTAG
|
Protein sequence | MLDPYRVRRD FPILSKKVHG KQLVYFDNAA TSQRPLQVVE AVESFYKSLN ANIGRSVHEL GLRATEAYED SRKIVAAFIG AKPEELVFTK NTTESINLAA YSLLVSGLIS RGDAIVVTRM EHHSNLLPWV RVAKLAGAEL RVVDVDDEGR LRLDEYEKAL SRGAKIVAVT HVSNVTGVVN PVKELCRLAH DAGAFCLVDG AQSTPHMRVD VKDIGADFFA FSGHKMLGPM GTGGLYIRGD LAERLEPPFP GGGAISLVSC AEDSCSAEWL HPPHKFEAGT PNVAGAVGLA KAVEYLRSVG MEDVEEHEKR LTAKLLEVLE GVGARIYGPR DMKDRLGVVS FNLDGLTPHE VASMLDLEGI AVRSGHHCAL PLVKRLGSPM GTVRASLYLY NTPEEVEYFG AVLEKIKKLA TG
|
| |