Gene Tpen_1321 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1321 
Symbol 
ID4602001 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1273626 
End bp1274609 
Gene Length984 bp 
Protein Length327 aa 
Translation table11 
GC content60% 
IMG OID639774096 
ProductCRISPR-associated Cas1 family protein 
Protein accessionYP_920721 
Protein GI119720226 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00529939 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGTTGC TCGTGCTACG CGGCGTAGAC GAGGTGACCG TGAGTAGCAG GAGCACGGTG 
GTGATCAAGT CCGGGAACAG GGTGTTCGAG CGGGCTCTCA GGGACGTCGA CGCAGTCCTC
GTCGTGGGCT CTGGGATCAA GATATCGTCC TCCCTCCCGC CCGTGTTGGC GCTCCACGGC
ATACCTCTCT CGATACTCGC GAAGGGGCAC GTCGCTGTGC TACTGAACCC TGTCGGGACG
AAGTATAACA ACTACAGGGC CCTCCAGTAC ACGTTGCCGA AGAACAAGGC GCTCGCAATC
GCGCTCGAAT ACCTCAAGTC CAGGGTGCGC GGAATGGCGA GCATAATCAG GAACCGCGGG
GGAAGGCTCC CCGCGCTCCC CGAGCCTCCC GACCCAGCGC TCTACGAAGA CCCGGCGAGG
CTCGAATCGG ACATAAGATC CTGGGAGGCC GCCGCCTCGA ACACCCTCTG GGACGAGGTC
TTCAAGCTAC TGGACCCATC CGCGGCCAGG GAGCTCAGAG AGAGATACGG CTTCGCGGGG
AGGAAGCCCG GGCACCCGGA CCCCCTCAAC AAGGCTATCT CCGCCATGTA CGCAGTCCTC
TACACGCTCT CGACGAAAGC ACTCGTAGCC GCCGGGCTAG ACCCCACCTA CGGCTTCCTG
CACAGAACCC AGTACAGCGT GCCGCTAGCG TTCGACTACG CCGAAGCCTT CAAGCCCTTA
GCAGTGGAAG CCGCGCTGGA CCTCGTAAAC GAGGAAGGCC TCCCAACGCT GAGCGAAGAC
GGCGACCTCG ACAAAGACTC CCTCAACAAG GCCATGAAGA GGCTCTACAG GTACCTCTCA
GCCAAGCACA GGGAAACCGG GAAAACACCC TACCAGCAGA TACACCTGAA GGCATTCTGC
CTCGCAAAAC ACCTGGAAGG CAAGTGTAGC AGAGAAAAAC TCGCCTTCAC GTGGGACAAA
AAGCGCTACA TAATACACGA GTAA
 
Protein sequence
MRLLVLRGVD EVTVSSRSTV VIKSGNRVFE RALRDVDAVL VVGSGIKISS SLPPVLALHG 
IPLSILAKGH VAVLLNPVGT KYNNYRALQY TLPKNKALAI ALEYLKSRVR GMASIIRNRG
GRLPALPEPP DPALYEDPAR LESDIRSWEA AASNTLWDEV FKLLDPSAAR ELRERYGFAG
RKPGHPDPLN KAISAMYAVL YTLSTKALVA AGLDPTYGFL HRTQYSVPLA FDYAEAFKPL
AVEAALDLVN EEGLPTLSED GDLDKDSLNK AMKRLYRYLS AKHRETGKTP YQQIHLKAFC
LAKHLEGKCS REKLAFTWDK KRYIIHE