Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_2679 |
Symbol | |
ID | 8604022 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | + |
Start bp | 3128254 |
End bp | 3131151 |
Gene Length | 2898 bp |
Protein Length | 965 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | CRISPR-associated helicase Cas3 |
Protein accession | YP_003300269 |
Protein GI | 269126899 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000878559 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGATCGAAG GGGAGGAGCC TGGCCGGCTG GGAGAGGGCC TGTCGGCTGT CGCCCGGGCC GTGTGGGCCA AGCATGACCG CGCGGCTGAA GGATGGCTGC CGCTGTGGCG GCACATGGCT GACAGCGGCG CGGTAGCGGG ACTGCTGTGG GATGAGTGGA TTCCTCACCA AGTACGGCGA CTGATCTCCG CGGAGTTGCC GGGAGGTGCC GAGGACGCTC GGCGGCTGGT GGTGTGGCTG GCGACTACGC ATGACATCGG GAAGGCCACG CCGGCGTTCG CGTGTCAGGT GGAGGCACTC GCTGCCGCGA TGCGAGATGC CGGGTTGGGC ATGCCCGGCC ACCGGGAGAT GATCGATCGG GGGCTTGCCC CGCATGGGCT GGCGGGGCAG TTCCTGCTAC AGGAATGGCT GATAGAGCGG CACGGGTGGA ACCGATCCGC GGCGCTGCAG CTCGCGGTCA TTGCCGGTGG GCATCATGGT GTTCCGCCGG CCTACCAGAA CGTTCATGAT CTGAATGCCC GCCGCTACCT GCTTCGCACG CCCGATCGCG AGTCGGCGTG GCGGAGCGTG CAGGAGGAGC TGCTGGACGC TTGTGCGGCG GCTTGCGGCG TCCGTGATCG GCTGGACGAC TGGCGGCGGG TGAAGCTGTC ACAGCCGGCA CAGGTGCTGC TGACGGGATT GGTGATCGTC GCTGACTGGA TCGCCAGCAA CCCCGATCTT TTTCCCTACT TCCCTGAGGC CGCGCACATA AGTGACCGCG AGCGCATCGA GGCGGCCTGG CGCGGTCTGG CGCTGCCGGC TCCTTGGCGG GCGGTGGAGC CGGATGAGGA CACAGATGAG CTGTTCGCCG CACGGTTCGC GCTGCCCCTG GGAGCGCGTG TCCGGCCGGT GCAGGAGCAA GCGGTGAAGC TGGCGCGCGG CATGGCCGCA CCGGGGCTGA TGATCATCGA GGCGCCGATG GGGGAAGGGA AGACCGAGGC CGCGCTGGCG GTCGCCGAGA TCTTCGCGGC GCGTTCCGGG GCGGGTGGCT GCTATGTGGC GCTGCCCACC ATGGCGACCG GCAATGCGAT GTTCCCCCGT CTGCTGGAGT GGCTGCGGCG GCTCCCGGCC GGCGGCTCAG GCGAAACGCA CTCTGTGTAT CTGGCGCATT CGAAGGCGGC GCTGAACGAG AAGTACGTCG AGCTCATGCG CACCGGCCGG CGGACGGTGA GCGCGGTGGA ACTGGACGGC CCCGGTGACG AGCGGCGACA TCACGGTGAT CGACGTGTCC ACTCCGCCGA GCTCGTCGCC CACCACTGGC TTCGCGGCAG GAAGAAGGCG ATGCTCGCCT CCTTCGGGGT GGGGACCATC GATCAGCTGC TGTTCATGGG GCTCAAGAGC CGGCACGTGG CGCTGCGCCA CCTTGCGCTG GCGGGCAAGG TCGTCATCAT CGACGAGGCC CATGCCTACG ACACCTACAT GAACACCTAC CTGGACCGGG TGCTGTCCTG GCTGGGCGCC TACCGCGTGC CGGTGATCGT GCTGTCGGCG ACGCTTCCGG CCGGGCGGAG ACGGGAACTC CTTGAGGCTT ACGCGGGGAC GAGCGGGTCC GGCCTCACGG AGGTGGAGAG GGCTGAGGGC TACCCGCTGC TGACCGCCGC GGCTCCTGGA CGGTCTCCGG TATTGCAGCC CGTCGCCCCG TCCGGTCGTT GCACCGCTGT TCATATGGAA CGGCTGGAGG ACGACCTTGA CGTCCTGGCG GACCGGCTGG AAGAGGAGTT GTCCGACGGC GGCTGCGCAC TGGTGGTGCG CAACACCGTT GATCGCGTCC TGCAGACCGC TGCGCACTTG CGGAAGCGGT TCGGCAACGT GACCGTCGCT CATGCGCGTT TCCTCGACTT GGATCGGATG GCCAAGGACG CCGAGCTGTT GAAGCTGTTC GGTCCGCCCG GCAAGGACAC CAAACGCCCC AAAGAGGCTC ACATCGTGGT GGCGAGCCAG GTGGCCGAGC AGTCCCTCGA CATCGACTTT GATCTGCTGG TGACCGATCT GTGCCCGATC GACCTGCTGT TGCAGCGCAT GGGACGGGTG CACCGGCACC TGCGGGGCGA GGGGCAGTCG GAGCGTCCGC CGCGGCTTCG CACCGCCCGG TGCCTGGTCA CCGGTGCGGA TTGGGCGGCC GTTCCGGTCG AGCCGGTCAG CGGTTCCCGG CGGATTTACC GGCCTTACCC GTTGTTGCGC GCTGCGGCCG TTCTGGAGCC TTACCTCAGC GGCAGCGCAG AAACCGGACA TATTGTGCGC CTTCCGGAGG ACATCAGCCC GCTGGTGCAA AAGGCCTATG GACACGATCC CGCGGGGCCG GCCGAGTGGC AGAAGGCGCT CGAAGAGGCC CGGCGCGATC AGGAGCTGCA CCAGGAAAAG CAGCGCAAGG AGGCAGAGTC CTTCCAGCTC GGCAAGGTCG GAAAGGACGG CCGTGCCCTG ATCGGATGGC TCGACGCCGG AGTCGGCGAC GCCGACGACA CCCGGGCGGG CAGGGCCCAG GTCCGCGACG GCGAGGAGTC CTTGGAAGTC CTGGTGGTGC AGCGCCTGGC CGACGGGACC GTGCGGACGG TGCCGTGGCT CACCGAAGGA CGCGGTGGCC TGGAGCTGCC CACCGAGACC GCGCCCGAGC CGCGTTTGGC GCGGATAGTG GCGGCCTGCG GGCTGCGGCT TCCCTTCCAG TTCTCGACCC CCGAAGTCCT TGACCGCGCG ATCGAGGAAC TGGAGGCCGA ATGCATCCCG GCCTGGCAGT CCAAAGAGTG CCATTGGCTG GCCGGTGAAC TGATCCTCTT TCTGGACGAG AACTGTCGGA CCCGTCTGGC AGGCTACGAC CTGCATTACA CGCCCACGGA TGGCCTGGAG GTGACTCGTG CCGAGTGA
|
Protein sequence | MIEGEEPGRL GEGLSAVARA VWAKHDRAAE GWLPLWRHMA DSGAVAGLLW DEWIPHQVRR LISAELPGGA EDARRLVVWL ATTHDIGKAT PAFACQVEAL AAAMRDAGLG MPGHREMIDR GLAPHGLAGQ FLLQEWLIER HGWNRSAALQ LAVIAGGHHG VPPAYQNVHD LNARRYLLRT PDRESAWRSV QEELLDACAA ACGVRDRLDD WRRVKLSQPA QVLLTGLVIV ADWIASNPDL FPYFPEAAHI SDRERIEAAW RGLALPAPWR AVEPDEDTDE LFAARFALPL GARVRPVQEQ AVKLARGMAA PGLMIIEAPM GEGKTEAALA VAEIFAARSG AGGCYVALPT MATGNAMFPR LLEWLRRLPA GGSGETHSVY LAHSKAALNE KYVELMRTGR RTVSAVELDG PGDERRHHGD RRVHSAELVA HHWLRGRKKA MLASFGVGTI DQLLFMGLKS RHVALRHLAL AGKVVIIDEA HAYDTYMNTY LDRVLSWLGA YRVPVIVLSA TLPAGRRREL LEAYAGTSGS GLTEVERAEG YPLLTAAAPG RSPVLQPVAP SGRCTAVHME RLEDDLDVLA DRLEEELSDG GCALVVRNTV DRVLQTAAHL RKRFGNVTVA HARFLDLDRM AKDAELLKLF GPPGKDTKRP KEAHIVVASQ VAEQSLDIDF DLLVTDLCPI DLLLQRMGRV HRHLRGEGQS ERPPRLRTAR CLVTGADWAA VPVEPVSGSR RIYRPYPLLR AAAVLEPYLS GSAETGHIVR LPEDISPLVQ KAYGHDPAGP AEWQKALEEA RRDQELHQEK QRKEAESFQL GKVGKDGRAL IGWLDAGVGD ADDTRAGRAQ VRDGEESLEV LVVQRLADGT VRTVPWLTEG RGGLELPTET APEPRLARIV AACGLRLPFQ FSTPEVLDRA IEELEAECIP AWQSKECHWL AGELILFLDE NCRTRLAGYD LHYTPTDGLE VTRAE
|
| |