Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1180 |
Symbol | |
ID | 4602046 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1121003 |
End bp | 1122259 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639773956 |
Product | radical SAM domain-containing protein |
Protein accession | YP_920581 |
Protein GI | 119720086 |
COG category | [R] General function prediction only |
COG ID | [COG0535] Predicted Fe-S oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.832103 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTGTAC GTATGAGCAG GGTTCTGCCC TGGTACGTCC CGCCCACGCT CTTCGTGCGG CAGGTTTTAT CCGCGGCGAC CGTCCGGAGT GCTACGGGTT GGTCTCCCGG CTCGCGGTGG GTGAAGTCGC TGTCCTCCCT GCTGAGAAGC GCCAACGGGG CTAAAGCCAT GGGGTGTTTC GGCTACGCCC CGCACCCCGT CTACGAGGTG ACTGCCGCGT GCAACTTGAG GTGTGCGCAC TGCCACGCGT CCGCCGGGAG GCCTTACCCC GGGGAGCTCG ACACCGAGGG CGCGAAGAGG GTTATAGAGA GCCTCACGAC CGTGAAGGAC TTCAGAACCC TCGTCTTCAC GGGCGGGGAG CCCCTCGTGA GGAAGGACAT ATGGGAGCTG ACCAGGCACG CGGTAGACCT GGGCTTCGGG GTCGTATTCG CAACGAACGG GGTTCTCGTC AGCGAGAGTG TCGCCGCCGA GATGCGGAGG CTGGGCGTCC TGGGCGCAGC CGTGAGCCTG GACTCTTCAA GGCCGCTTGT ACACGACAAG CTGAGAGGCG TCCCGGGCGC CTGGAGGGGA GCGGTGAGGG GGATCAAGAA CCTCTTGAAA GAGGGCCTAT ACGTGCAGGT AAACATAACC GCGAACAGGC TTAACGTCGA CGAAATAGAG GATGTAGTGA GACTCGCAGA CTCGCTGGGC TCCCACGTGA TCTTTCTCTA CACGTTCGTC TCCGTGGGGA GAGGATCGTC CAACGACTGG CTCTCCTTAA CCCCGGAGGA ATTCGTAAAG CTCTCCAGGA GGATCCTAAA GGTTCAGGGA GAGGTGCAGA GCCTAATCAT ACCAGTCGCC ATGCCGTGGT ACTTCCCCCT CCTCCTGCAG GAAGCCCGGC TCAAACCCGA GGTGGCGTCG AGGTGGGTAT CCGGGTGCAT AGCGGCTAGG GGGATGTTCT ACGTAAAGCC CAACGGGGAC GCCTGGCCCT GCGCCTTCAT ACCCGTCTCA GGGGGAAACG TCGCCCGGCA ACCAGCCATC GAGGTGTGGG AGGGAGACCT CTTCAAGGCG ATACGGAACA GGGAGAACCT CGAGGAGCCC TGCAGGAGTT GCAGGTTCAG GGAGGTCTGC GGAGGGTGCA GGTCGAGAGC CTACCTAGCA ACCGGGAGAC TGACAGCCCC AGACCCGCTG TGCCCCCTCG TGCGCCGAAG GCTAAGCACC TCAACGGCAC CGAGCGGTCC ACTACAGCCC GCCGCCAAGA GCGGCGAGCC TAACTAG
|
Protein sequence | MGVRMSRVLP WYVPPTLFVR QVLSAATVRS ATGWSPGSRW VKSLSSLLRS ANGAKAMGCF GYAPHPVYEV TAACNLRCAH CHASAGRPYP GELDTEGAKR VIESLTTVKD FRTLVFTGGE PLVRKDIWEL TRHAVDLGFG VVFATNGVLV SESVAAEMRR LGVLGAAVSL DSSRPLVHDK LRGVPGAWRG AVRGIKNLLK EGLYVQVNIT ANRLNVDEIE DVVRLADSLG SHVIFLYTFV SVGRGSSNDW LSLTPEEFVK LSRRILKVQG EVQSLIIPVA MPWYFPLLLQ EARLKPEVAS RWVSGCIAAR GMFYVKPNGD AWPCAFIPVS GGNVARQPAI EVWEGDLFKA IRNRENLEEP CRSCRFREVC GGCRSRAYLA TGRLTAPDPL CPLVRRRLST STAPSGPLQP AAKSGEPN
|
| |