Gene Tpen_1288 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1288 
Symbol 
ID4600596 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1229619 
End bp1230542 
Gene Length924 bp 
Protein Length307 aa 
Translation table11 
GC content67% 
IMG OID639774064 
ProductCRISPR-associated Csm3 family protein 
Protein accessionYP_920689 
Protein GI119720194 
COG category[L] Replication, recombination and repair 
COG ID[COG1337] Uncharacterized protein predicted to be involved in DNA repair (RAMP superfamily) 
TIGRFAM ID[TIGR02581] CRISPR-associated RAMP protein, SSO1426 family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0685165 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGCCAGG TCCCGTGGTC CTCGCACAGG GTCCTCCTCA GGGAGGCAGT CTTCAGGGGC 
TACCTCGTCG CCGAGTCTCC GCTGAGGGTG GGCGCTGGCA GGGAGGCGCC GCTGGGCTCC
CCGGTGGACC TCTCGGTCCT CAGGGTTAGG CTTGGCGGTA AAAGCGTCCC CTACATACCC
GGCAGTAGCC TGAAGGGTGT GTTCAGGAGC TTCTCCCAGT CCCTCGCAGT GGCGAAGGGG
CTAAGCGTGT GCAGCGGGCT GAGCGGGGAG ACGTGCATGG ACTACGAGGA CCCGTCGCTG
GGCGGCGAGA AGTTGCTGAG CTACTTGCAG GGGCTGATGA GGGAGGGGCA CAGCCTGGAA
GCCGTCAGGC TGTTCCACGA GAAGGCCTGC CTGATGTGCA AGGTCTTCGG CGCCCCCTCG
TTCTCCGGGC ACGTCGAGTT CAGCGACGCC TACCCCGTCG ACGAGAAGGG GGGCGTCGTC
GACGTCTCCA CCGGGGTGAG GACGGGCATA GCGATAAACA GGAGGACGGG CGCCGTCTAC
GAGAGGGCCC TCTACCAGGT CGAGTACGTC GAGCCGGGCG CGAGGTTCAG GTTCGAGGCT
AGGACTACGA ACCTCCCCAA CTACGCGCTC GGGCTCCTCT CCGCCGTCAT CAGGATGATG
AACGAGGGCT GGGTCAGGGT GGGCGGCTTC AAGACGAGGG GCTTCGGAGA GGTGCGCGTG
GAGGGCCTGG AGTTCGCGGC TAGGGGCGCG ACTGTCCGCG GCTCGGTGCT GGTGAAGCTC
GACGACTACG ACTCGGACGT CGACCTCTCG GGCCTCGCCG AGGCCAGGGA CGGGTGGCTA
AGGGCCTCCG GGGACTCCGC CTGGAAGGCG CTGGCTAAGC TCGAGGAGGT GTGGTCCAAT
GCAAGCTTCG GGAAGCGCGG TTAG
 
Protein sequence
MSQVPWSSHR VLLREAVFRG YLVAESPLRV GAGREAPLGS PVDLSVLRVR LGGKSVPYIP 
GSSLKGVFRS FSQSLAVAKG LSVCSGLSGE TCMDYEDPSL GGEKLLSYLQ GLMREGHSLE
AVRLFHEKAC LMCKVFGAPS FSGHVEFSDA YPVDEKGGVV DVSTGVRTGI AINRRTGAVY
ERALYQVEYV EPGARFRFEA RTTNLPNYAL GLLSAVIRMM NEGWVRVGGF KTRGFGEVRV
EGLEFAARGA TVRGSVLVKL DDYDSDVDLS GLAEARDGWL RASGDSAWKA LAKLEEVWSN
ASFGKRG