Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1354 |
Symbol | |
ID | 4602191 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1307106 |
End bp | 1308806 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639774129 |
Product | CRISPR-associated helicase Cas3 |
Protein accession | YP_920754 |
Protein GI | 119720259 |
COG category | [R] General function prediction only |
COG ID | [COG1203] Predicted helicases |
TIGRFAM ID | [TIGR01587] CRISPR-associated helicase Cas3 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.245938 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCTTGC AGGAGTACTG GGGGGTAGTG GAGAGGGTCG TCTCGTCGAG GAACAGGAGA CCGGTGCGCT ACTCGTACTT CGAGGACGTC TGGAGGGCCG TAGACGAGGG GCACAGGCTG GTCGTCGTAA GGGCACCCAC TGGTTGCGGC AAGACGGAAG CGGCCACCGC GCCGTTCATC AGCGACGCGG CTAAAGGCTC GAGGCGCTGG GTCTCGCTCG TCTACGCGCT ACCCACCCGC TCCCTCGCCT CGGCGATGCT GAGGAGACTG TCCCGCTCGC TGGCCGCGGC GGGGGCGGAG TGGACCACCG CGACGCTGAG CTACGGGGGC CTCTGGGAGG CCAGGCCGTA CCTCGAGGGG GACGTGGCAG TCACGACGTA CGACACGCTC CTCCACCAGT TCTACGGCGT CGTCTCCCCG GGATACCACC TCCTCCTGCC CGCGGCGAAA GTCTCTGGCT CGCTCGTGGT CCTCGACGAG ACCCACCTGC TCCAGGACGC CCACTGGTAC GCCCCGAGCC TTCTCCCGGC CCACGTCGCG TCCCTCGTAT CCCTGGGGGC CCAGGTGCTC GTGGTGGGGG CCACGGTGCC GGAGGTCCTG CTCGAGGAGC TACGGAGGGA GTACAGGCTT GTGAGCCGCG GGGAGGAGCC GGCAGTGGTA GACGCGGCGG ACGAGCCGGC GAGGGGCAGG CTGGACGTCG AGCTCAGGGG CGGCGGGATG CCCGTGGAGG GGCTGTGCAG CCTCCTCGAG GGCGCCCCGA GGCCCGCGCT CGTCGTGGTG AACAAGGTCG AGAAGGCCGT CGAGGCCTAC AGGGCGCTCC GCTCCTGCCT CGGGGGGAGC GTTGCGCTAC TGCACTCGAG GCTGAGGGGC GGCGTCAGGG CGCGCGTTGA GGGGCTGTTC GAGGGGGACG GCGCCCCCGG GGACCTAGTC CTGGTCGCAA CCCAGGTCGT GGAGGCGGGG CTGGACCTCG ACGTGCGCTT CCTCGTGACC GAGGTCTCGC CTGTCGACTC GCTGATACAG AGGCTGGGCA GGTGCGCGCG GAGGAGCGAC GGCCACGCCG TCGTCTTCCT GGACGAGGAG GCGGCGCGGA ACGTCTACCC GAGGGAGCTC GTCGAGAGAA CGCTGGGGGT CGTCGACGCC CAGTCGCTGG CGGAGAGCGT TAGGAGGCTA AGCGTTGCCC GCGAGCTCGT AGACGGGGTG TACGCGGCGG AGGTCGTCGA GAGGCTTAGG AAGGGGTGGG AGCGCGCCCT GAGCGAGGTG AAGGGCTGGG CCTTGAGGTT CCCGCGAAGC CTCCTCCACA AGGAAGCGCA CAGAGAGCCG GGGCCCCTAC TCAGGCTCGG CTACGAGGTC GCCTGCTACC TGCCGGGGAG CGCTGGCGAG TACGAGGCGT TGCTGGGCGG CGGGGAGACC GCAGTATCCC TGGAAAGGCT CAGGGACTAC ACCGTGAGGC TGTCCGTGGA GGGGCGCGGA GAGGCCCCGG CGGCCGTCGT CCACGAGGTC GGCGGCCGCG AAGTCGTCGT CGCGCTCGAG TACAAGCGGG TCGACGGAGG GCTAGCCCTG AGGGGGAGGC GCATGGAGCC CCGCGCTTTC CCCCGCGCCG TCGAGTCCGG CGAGCTCTTC CTGCTGAACC CGGCGTTCTA CCTGAGCGAG GGCGGGGACG AGCTCGGGGT GGTTCGGCCG TGGAGGTCCC GGAGTGCGTA G
|
Protein sequence | MSLQEYWGVV ERVVSSRNRR PVRYSYFEDV WRAVDEGHRL VVVRAPTGCG KTEAATAPFI SDAAKGSRRW VSLVYALPTR SLASAMLRRL SRSLAAAGAE WTTATLSYGG LWEARPYLEG DVAVTTYDTL LHQFYGVVSP GYHLLLPAAK VSGSLVVLDE THLLQDAHWY APSLLPAHVA SLVSLGAQVL VVGATVPEVL LEELRREYRL VSRGEEPAVV DAADEPARGR LDVELRGGGM PVEGLCSLLE GAPRPALVVV NKVEKAVEAY RALRSCLGGS VALLHSRLRG GVRARVEGLF EGDGAPGDLV LVATQVVEAG LDLDVRFLVT EVSPVDSLIQ RLGRCARRSD GHAVVFLDEE AARNVYPREL VERTLGVVDA QSLAESVRRL SVARELVDGV YAAEVVERLR KGWERALSEV KGWALRFPRS LLHKEAHREP GPLLRLGYEV ACYLPGSAGE YEALLGGGET AVSLERLRDY TVRLSVEGRG EAPAAVVHEV GGREVVVALE YKRVDGGLAL RGRRMEPRAF PRAVESGELF LLNPAFYLSE GGDELGVVRP WRSRSA
|
| |