Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1228 |
Symbol | |
ID | 4601725 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1166174 |
End bp | 1167721 |
Gene Length | 1548 bp |
Protein Length | 515 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639774004 |
Product | radical SAM domain-containing protein |
Protein accession | YP_920629 |
Protein GI | 119720134 |
COG category | [C] Energy production and conversion |
COG ID | [COG1032] Fe-S oxidoreductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.641175 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGTTTTCG AGGTAGTTAT AACGTCCGAT AGGACAATGA TAACGGACCA CCACGGCAAG GAGTTCATAG GCTTCATGGC CACTGGGCCC GCTATAGGGG TCCCGGAGAG GCTCTGGATG TGGGCGTGCT GCCCGAAGCC CAAGGTCGAC AGGCTCGGCA GGCCGCGCGT AGCGCCCTAC GGGCTCAGGA AGATAGAGGC GAAGCTCCAG GAGGCTGGCT TCAACGCCGC CATCGTCGAC CCAGACCACT TGGACAAGCA CTTGGACACT ATGAAGGTGC TCCTAGTGGG GCACCACGAC TACTTTGCTT ACGGCCCGCC GAGCAGCGAG TGGTGGGTTA TCACGGGCAG GGAGCCTGTT AACAGGAGGA GCTTCAGGAG GCTCATGGAG TCCCCCGCTG TGCGCAAGGC GAAGGAGAAA GGCGTGAAGA TAATCGCCGG GGGGCCCGCG GCGTGGCAGT GGCTCTGGGA GCTGGAGAGC TGGAAGAAGT GGGGCGTGGA CACCGTCGTC GACGGGGAGG GTGAGGGGGT CGTCGTGGAC CTAGTCGAGA AGGTTTACAG GGGGGAGCCG CTCCCAGAGT ACGTCTACGT GAGCCCCCGC GACGCTCCAA GCATAGAGGA GATACCGGTG ATCAGGGGCG CCAGCGTCAA CGGGCTGGTA GAGATAATGA GGGGTTGCCC CAGGGGCTGC AGGTTCTGCT CCGTGACGCT GAGACCCCTG AGGTTCATGC CCATAGAGAA GGTTGTGGCG GAGGTCAGGG TCAACGTGAG GGCTGGGCTG AGGAACGTCC TGCTACACAG CGAGGACGTC CTGCTCTACG GCGCCGACGG CGTAAAGCCG AGGCCCGAGC CCGTCCTAAA GCTCCACGCC GAGGTGCTCA AAGAGGCACC CGGCAGCGTC GCGTGGTCCC ACGCAAGCCT ATCCGCCGTG AAGTACGCCG AGGATAACTA CAGGCTGGTA TCGCGATTAA TGGAGATGCT GAGCGAGAGG CAGGAGATAC TTGGGGTGGA GGTCGGGATA GAGACGGGTA GCGCGAGATT GGCGAGGGAG GTCATGCCGG CGAAGGCGCT ACCCTACAGG GCGGAGGAGT GGGTGGAGGT CGTGAAGGAC GCCTTCGCGA TAATGCACGA CAACAGGGTT GTCCCAGCGG CTACGCTTAT ACTCGGGCTA CCCGGCGAGA CCCCGGACGA CGTCGTCAAG ACCGCGGAGC TCGTCGACGA CTTGAAGCCC TACAGGAGCC TCATAGTGCC CATGCTCTTC GTCCCCATGG GGAAGCTGAA GAACATGGAG AAGTTCAGGA GGGAGATGAT AACCAGGGAG CACGTAGAGG TCATGAAGGC TTGCTTGAGG CACGACCTCT ACTGGGCCCG GGAGATAATG GGCAAGTTCT ACCTCAAGGG GGCGCACATG GCGCCTTTAA GGTTCTTCCT CGAGGCCTTC ATATCCTACG TTGAGCGTAG AGCGTCGAGG ATTGACGAGG AGATCAAGCA ACTCTTCGAA GAAAAGCAAG CCCTAGAGAG GCGGCGGGAA AGCGTCGTCC GCGCCTAG
|
Protein sequence | MVFEVVITSD RTMITDHHGK EFIGFMATGP AIGVPERLWM WACCPKPKVD RLGRPRVAPY GLRKIEAKLQ EAGFNAAIVD PDHLDKHLDT MKVLLVGHHD YFAYGPPSSE WWVITGREPV NRRSFRRLME SPAVRKAKEK GVKIIAGGPA AWQWLWELES WKKWGVDTVV DGEGEGVVVD LVEKVYRGEP LPEYVYVSPR DAPSIEEIPV IRGASVNGLV EIMRGCPRGC RFCSVTLRPL RFMPIEKVVA EVRVNVRAGL RNVLLHSEDV LLYGADGVKP RPEPVLKLHA EVLKEAPGSV AWSHASLSAV KYAEDNYRLV SRLMEMLSER QEILGVEVGI ETGSARLARE VMPAKALPYR AEEWVEVVKD AFAIMHDNRV VPAATLILGL PGETPDDVVK TAELVDDLKP YRSLIVPMLF VPMGKLKNME KFRREMITRE HVEVMKACLR HDLYWAREIM GKFYLKGAHM APLRFFLEAF ISYVERRASR IDEEIKQLFE EKQALERRRE SVVRA
|
| |