Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1317 |
Symbol | |
ID | 4601997 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1264915 |
End bp | 1265802 |
Gene Length | 888 bp |
Protein Length | 295 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639774092 |
Product | CRISPR-associated RAMP Cmr6 family protein |
Protein accession | YP_920717 |
Protein GI | 119720222 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1604] Uncharacterized protein predicted to be involved in DNA repair (RAMP superfamily) |
TIGRFAM ID | [TIGR01898] CRISPR-associated RAMP protein, Cmr6 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00379803 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGACCTGC TGGGCCTGGT TTACGGATAC GTCGACGAGC TCCTGAAGAA CCCGGAGGCG GAGAAGGAGG AGCTGAGGAG GAGGCTCATC GAGAGAATAG TCGAGAACTA CTCGCGGGGA CGCGAACTGG ACGAGAAGCT GAGGCTGTCC AGGGAGAGGC TCGAGAGCCT AGCCGCGGGG CTCGCGCTGG CGGGCTTCCA CGTCTTCGTG GTTGACGCGA AGCTCTCCAC GATGGGAGCC GTCGGGGTGT CGCACGGGGT TTTGAGAAGC GTGTTCGAGG TGGGGGTTAG CTGGGACCAC GTGCTGGACC TGCCGTTCAT ACCGGGCTCC AGCGTCAAGG GCGCCGTGAG GGCGGTCGCA GAGCGCGCCT CGAGGGACGA CGCGGACGTC CTGTTCGGGA GGGGCGGGGA CTCGGGGTGG GCCGGGCTAC TGCTCTTCTT CGACGCCTAC CCCGTCGAGG CCGGGGACAG GCTCCTCGAG CCAGACATCG TGACGCCGCA CTACAGCAGG GGCGGGAGGC CCGTGAGGTT CGAGTACGAG GTCGAGCCCG TGCCCGTGGC GCACGTCTCG ATCGCGCCTG GGGCCGTCTT CAGGTTCGTC GTCGCGGTCG AGCCGGGAGG GGGCGTCCTC CACGAGGAGG TGGACCGCGC CCTCGCGAAC ATCTGCGGGA GGCTCGGCGT CCAGGGCGGG GGGCCGGCGT CGCTCCTGCT AGGCCTCTTG GAGTACGCGC TCGCCAGCGG CGTTGGGGCG AGGACGAGCC AGGGCTACGG GAGGTTCGAG GTCGTCTCGC GCTCCGCGGT GATCAACGGG TCGGAGCACA GGACGCCCCT CCGGGTGAAG CCCGCGAGGG CTAGGAGGGG CAGGTCCACT CGTCCCGGTG CGGGGTGA
|
Protein sequence | MDLLGLVYGY VDELLKNPEA EKEELRRRLI ERIVENYSRG RELDEKLRLS RERLESLAAG LALAGFHVFV VDAKLSTMGA VGVSHGVLRS VFEVGVSWDH VLDLPFIPGS SVKGAVRAVA ERASRDDADV LFGRGGDSGW AGLLLFFDAY PVEAGDRLLE PDIVTPHYSR GGRPVRFEYE VEPVPVAHVS IAPGAVFRFV VAVEPGGGVL HEEVDRALAN ICGRLGVQGG GPASLLLGLL EYALASGVGA RTSQGYGRFE VVSRSAVING SEHRTPLRVK PARARRGRST RPGAG
|
| |