Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0457 |
Symbol | |
ID | 4601860 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 415828 |
End bp | 417387 |
Gene Length | 1560 bp |
Protein Length | 519 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 639773224 |
Product | radical SAM domain-containing protein |
Protein accession | YP_919869 |
Protein GI | 119719374 |
COG category | [C] Energy production and conversion |
COG ID | [COG1032] Fe-S oxidoreductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.211287 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCCAGAGG ATGTAGAGGT TGTCCGGGCG CGAATGATTG CCGAGGACAA TGTGGTCAAG AAGAAAGCCG GGCGGGGTAC CATTAGGGTT GCACTGCTCT ACCCCTCTCT ACCGACCGTT GCGCTGGACT CCCTCTCGTA CCAGATGCTG TACTACTGGT TGAACTCTTT GGATGATGTC TACGCGGAGC AGTTCATGCT CGACTTCGAT GGGACTCCTT TAACGCGTAG CATCGAGACT GGCACCCCGC TACGAGACTT CGACTACGTG GTTATCTCGG TGCATTACGA GCTCGACTTC GTTAACATAC TTAGAGTTCT CCTTGAAGCC GGGGTAGAGG TTTACAGCGA GAGGCGCGAG AAACCGGTTG TGATAGCCGG AGGACCACCG GTTATAGCGA ACCCCGTGCC GCTTTCGCCT TTCGTGGACG TTCTCGCAGT GGGCGAGATC GAGCCTTTAA TGCCCGTACT GGTAGATGGC ATGGCTGGGT ACAGGGGGGA CAAGAGAGCG TTCCTAGAGA ACTTGGGCGC CGAGAAGGGG TTCTACGTCC CTCTGCTCCA CGGCGGGGAG GAGGTTGTCT TCAATTACGC AAGGGAGTTG TCCAAGGAGT TTTCGCCGAG CGTCCAGCTA CAGCTTTTAG ACCCGCGTGC AAAGTGGAGG AGGAGAACCG CCGTCGAGAC GAGCAGAGGA TGCTTCAGGA TGTGCGCGTT CTGCCTCGAG GGACACATAT TCCGAGGAGT TAGGGAGAGG CCCTTCGAGC AGGTAAGAAG AATAGTCGAA GAGGGCTCCG CGGGGAATAG AACCAGGCAC GTAAAGCTCG TAAGCCTCTC GTTTTTCGAC CACTCGGAGG CAGACCGCAT ACTCGAGTGG CTGGTGAACG AGGGGTACTC GTTTTCGATC CCCTCTCTCA GGGCCGACAC CCTGAACGAG AGAAGGCTCG AGTATATCAG GCTTGGCGGC CAGAAAACGC TCACGGTAGC TCCTGAAACA CTGTCTCCAA GCCTTGCGAT CGCTATCAGG AAGCACATAG GCTACAGCCT CATCCGCGAG CTTTCCCTCT CGGCACGGAA GCTCGGTTAC ACAGGGCTAA AGGTATACCT AATGGTAGGC ATCCCGGGCG AACGCGAGGA AGACCTGAGG CTGACCGCCG AGAAGCTTAG ACAACTCGCG TCGGAGACGG GGTTTAAGGG GGAGAGAGCC CTCAAAGTCA CTGTCAGCCC TCTCGTGCCG AAGCCGCATA CACCATTGCA GTACGCGCCG TTCGTAGGAA TTCAGGAGGC TCGACGCCGC ATAGAGATCA TAAGAAGAGA GCTTCGAGGC CTCGCGGATG TACGAGAGTA CGACCCCAGG CTAGCTTACA TACAGACAAT AATTGCTAGG GGAGACTCGA TGCTCTCGCA GGTTCTCCTC TACTGGGCCC TGGAGGGGGG AGGTCTTGGA GGGTGGAGGA AGGCTCTTAG AGTTACAGGC GTGGACGTAC AGAGATATCT CGCACCAAGC CCCGAGGAGG AGCCGCCGTG GGGGTTCATA AGACTGCCAG GCGTGAGAGG CACTCGGTAG
|
Protein sequence | MPEDVEVVRA RMIAEDNVVK KKAGRGTIRV ALLYPSLPTV ALDSLSYQML YYWLNSLDDV YAEQFMLDFD GTPLTRSIET GTPLRDFDYV VISVHYELDF VNILRVLLEA GVEVYSERRE KPVVIAGGPP VIANPVPLSP FVDVLAVGEI EPLMPVLVDG MAGYRGDKRA FLENLGAEKG FYVPLLHGGE EVVFNYAREL SKEFSPSVQL QLLDPRAKWR RRTAVETSRG CFRMCAFCLE GHIFRGVRER PFEQVRRIVE EGSAGNRTRH VKLVSLSFFD HSEADRILEW LVNEGYSFSI PSLRADTLNE RRLEYIRLGG QKTLTVAPET LSPSLAIAIR KHIGYSLIRE LSLSARKLGY TGLKVYLMVG IPGEREEDLR LTAEKLRQLA SETGFKGERA LKVTVSPLVP KPHTPLQYAP FVGIQEARRR IEIIRRELRG LADVREYDPR LAYIQTIIAR GDSMLSQVLL YWALEGGGLG GWRKALRVTG VDVQRYLAPS PEEEPPWGFI RLPGVRGTR
|
| |