Gene Tpen_1364 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1364 
Symbol 
ID4601189 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1320555 
End bp1321550 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content61% 
IMG OID639774139 
ProductCRISPR-associated RAMP Csm3 family protein 
Protein accessionYP_920764 
Protein GI119720269 
COG category[L] Replication, recombination and repair 
COG ID[COG1337] Uncharacterized protein predicted to be involved in DNA repair (RAMP superfamily) 
TIGRFAM ID[TIGR02582] CRISPR-associated RAMP protein, Csm3 family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.152011 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGCGC TTCAAAGGGA GCTACGGGGC ATAGTCAGGC TAGGCTTGAA GATGGAGACA 
GTCACGGGCC TCCTGATCAG GATGCCCGTC CACGCCCAGG TATACAGGAT CGGGGGCGCA
GACCTCTACC CGATCGTGAC CAAGCGCAAG TACAGCCTCG GCGGCGCGGA AGTCGAAGTG
GAGGTTCCGC TGGTCCCCGG CTCCAGCCTG AAGGGGAGGA TGCGCGGGCT CTTGGAGCTC
TCGCTCGGCA AGCGCCTTTA CACCTCGGAC GAGAGGATAT GGCTCCATGC GAGAATCCTC
TGGGGTACCC GTGCTCATCC GATGTCCACA GACGAGCTAA TTCGGGACAT AAGCGGTAGG
TGCGAAGTAG ACGAAGTATT TGGATCTCCC GCCGTGGGCT TCGACCAGCT CGTCGAAATG
CTTGCCAGAG ACTTAGCTTC GAAGCAGGGC AAAGAGGAAC CCGATGACGC CGACTACGAA
GAGGCGGGGA AGGTTGCCCG AAGCCTGGCG CTGAACCTTG CGCTAACGAG GCTGCTGGTC
GACGACATGA CCCCGGACAA GGACTACGTT GGCGTGCTCA GCGAGAACGG CAGGAGGATC
CTGGCGATCT CGGACTTCCT CGAAGAGAAA GGCGAGAACA GGATCGACAG GATAACCTCC
GCGGCTGACC CGAGGCAGGT TGCCCGCGTC AAGCCGGGGG TGGTGTTCCA GGGAGCGTTG
AGGCTACTCG TTTTCGACAT CGACAAGGGG TACGTGAAGA GGAACCTCGA GCTAGTCGCC
AAGGGGCTCA GGCTGGTGGA GGAGACGTAC CTGGGCGCCT CCGGTAGCAG GGGGTACGGC
AGGGTGAAGT TCAAGGACAT CTCGGTGGAG GTAATGAAGA CCTTCGGCGC GGAGAAGCCC
GAGTACAGGA AGCTAGCGGA CCTAGCTAGC GTCGAAGCTC TGCTGGGCAA GGTGGACGAG
ATAGCAGGAG AAGTGGAGAA GACGCTCTTC GGGTAG
 
Protein sequence
MSALQRELRG IVRLGLKMET VTGLLIRMPV HAQVYRIGGA DLYPIVTKRK YSLGGAEVEV 
EVPLVPGSSL KGRMRGLLEL SLGKRLYTSD ERIWLHARIL WGTRAHPMST DELIRDISGR
CEVDEVFGSP AVGFDQLVEM LARDLASKQG KEEPDDADYE EAGKVARSLA LNLALTRLLV
DDMTPDKDYV GVLSENGRRI LAISDFLEEK GENRIDRITS AADPRQVARV KPGVVFQGAL
RLLVFDIDKG YVKRNLELVA KGLRLVEETY LGASGSRGYG RVKFKDISVE VMKTFGAEKP
EYRKLADLAS VEALLGKVDE IAGEVEKTLF G