Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1364 |
Symbol | |
ID | 4601189 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1320555 |
End bp | 1321550 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639774139 |
Product | CRISPR-associated RAMP Csm3 family protein |
Protein accession | YP_920764 |
Protein GI | 119720269 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1337] Uncharacterized protein predicted to be involved in DNA repair (RAMP superfamily) |
TIGRFAM ID | [TIGR02582] CRISPR-associated RAMP protein, Csm3 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.152011 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGCGC TTCAAAGGGA GCTACGGGGC ATAGTCAGGC TAGGCTTGAA GATGGAGACA GTCACGGGCC TCCTGATCAG GATGCCCGTC CACGCCCAGG TATACAGGAT CGGGGGCGCA GACCTCTACC CGATCGTGAC CAAGCGCAAG TACAGCCTCG GCGGCGCGGA AGTCGAAGTG GAGGTTCCGC TGGTCCCCGG CTCCAGCCTG AAGGGGAGGA TGCGCGGGCT CTTGGAGCTC TCGCTCGGCA AGCGCCTTTA CACCTCGGAC GAGAGGATAT GGCTCCATGC GAGAATCCTC TGGGGTACCC GTGCTCATCC GATGTCCACA GACGAGCTAA TTCGGGACAT AAGCGGTAGG TGCGAAGTAG ACGAAGTATT TGGATCTCCC GCCGTGGGCT TCGACCAGCT CGTCGAAATG CTTGCCAGAG ACTTAGCTTC GAAGCAGGGC AAAGAGGAAC CCGATGACGC CGACTACGAA GAGGCGGGGA AGGTTGCCCG AAGCCTGGCG CTGAACCTTG CGCTAACGAG GCTGCTGGTC GACGACATGA CCCCGGACAA GGACTACGTT GGCGTGCTCA GCGAGAACGG CAGGAGGATC CTGGCGATCT CGGACTTCCT CGAAGAGAAA GGCGAGAACA GGATCGACAG GATAACCTCC GCGGCTGACC CGAGGCAGGT TGCCCGCGTC AAGCCGGGGG TGGTGTTCCA GGGAGCGTTG AGGCTACTCG TTTTCGACAT CGACAAGGGG TACGTGAAGA GGAACCTCGA GCTAGTCGCC AAGGGGCTCA GGCTGGTGGA GGAGACGTAC CTGGGCGCCT CCGGTAGCAG GGGGTACGGC AGGGTGAAGT TCAAGGACAT CTCGGTGGAG GTAATGAAGA CCTTCGGCGC GGAGAAGCCC GAGTACAGGA AGCTAGCGGA CCTAGCTAGC GTCGAAGCTC TGCTGGGCAA GGTGGACGAG ATAGCAGGAG AAGTGGAGAA GACGCTCTTC GGGTAG
|
Protein sequence | MSALQRELRG IVRLGLKMET VTGLLIRMPV HAQVYRIGGA DLYPIVTKRK YSLGGAEVEV EVPLVPGSSL KGRMRGLLEL SLGKRLYTSD ERIWLHARIL WGTRAHPMST DELIRDISGR CEVDEVFGSP AVGFDQLVEM LARDLASKQG KEEPDDADYE EAGKVARSLA LNLALTRLLV DDMTPDKDYV GVLSENGRRI LAISDFLEEK GENRIDRITS AADPRQVARV KPGVVFQGAL RLLVFDIDKG YVKRNLELVA KGLRLVEETY LGASGSRGYG RVKFKDISVE VMKTFGAEKP EYRKLADLAS VEALLGKVDE IAGEVEKTLF G
|
| |